Does robots.txt apply to subdomains?

Tags:

robots.txt

Let's say I have a test folder (test.domain.com) and I don't want the search engines to crawl in it, do I need to have a robots.txt in the test folder or can I just place a robots.txt in the root, then just disallow the test folder?

397

asked Nov 28 '13 01:11

Pa3k.m

2 Answers

Each subdomain is generally treated as a separate site and requires their own robots.txt file.

answered Nov 27 '22 13:11

malexander

If your test folder is configured as a virtual host, you need robots.txt in your test folder as well. (This is the most common usage). But if you move your web traffic from subdomain via .htaccess file, you could modify it to always use robots.txt from the root of your main domain.

Anyway - from my experience it's better to be safe than sorry and put (especially declining access) files robots.txt in all domains you need to protect. And double-check if you're getting the right file when accessing:

http://yourrootdomain.com/robots.txt
http://subdomain.yourrootdomain.com/robots.txt

answered Nov 27 '22 14:11

Kleskowy

Related questions
                            
                                Any reason to not do a 301 on favicon.ico, apple-touch-icon, and robots.txt?
                            
                                How can i fix "Googlebot can't access your site" issue?
                            
                                Block bingbot from crawling my site
                            
                                Why does Chrome request a robots.txt?
                            
                                Should I use different case-spellings for case-insensitive directories in robots.txt?
                            
                                Robots.txt file in MVC.NET 4
                            
                                Nginx: different robots.txt for alternate domain
                            
                                How to add route to dynamic robots.txt in ASP.NET MVC?
                            
                                Sitemap for a site with a large number of dynamic subdomains
                            
                                How to allow crawlers access to index.php only, using robots.txt?
                            
                                Can I use the “Host” directive in robots.txt?
                            
                                How to add `nofollow, noindex` all pages in robots.txt?
                            
                                How to make a private URL?
                            
                                Robots.txt, how to allow access only to domain root, and no deeper? [closed]
                            
                                Facebook and Crawl-delay in Robots.txt?
                            
                                How can I serve robots.txt on an SPA using React with Firebase hosting?
                            
                                Is it possible to control the crawl speed by robots.txt?
                            
                                robots.txt in subdirectory
                            
                                Disallow or Noindex on Subdomain with robots.txt
                            
                                Web Crawler - Ignore Robots.txt file?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With