Disallow or Noindex on Subdomain with robots.txt

Tags:

robots.txt

I have dev.example.com and www.example.com hosted on different subdomains. I want crawlers to drop all records of the dev subdomain but keep them on www. I am using git to store the code for both, so ideally I'd like both sites to use the same robots.txt file.

Is it possible to use one robots.txt file and have it exclude crawlers from the dev subdomain?

371

asked Feb 05 '11 01:02

Kirk Ouimet

1 Answers

You could use Apache rewrite logic to serve a different robots.txt on the development domain:

<IfModule mod_rewrite.c>
    RewriteEngine on
    RewriteCond %{HTTP_HOST} ^dev\.qrcodecity\.com$
    RewriteRule ^robots\.txt$ robots-dev.txt
</IfModule>

And then create a separate robots-dev.txt:

User-agent: *
Disallow: /

191

answered Sep 19 '22 18:09

Christian Davén

Related questions
                            
                                Listing both sitemaps and sitemap index files in robots.txt?
                            
                                Googlebots Ignoring robots.txt? [closed]
                            
                                Any reason to not do a 301 on favicon.ico, apple-touch-icon, and robots.txt?
                            
                                How can i fix "Googlebot can't access your site" issue?
                            
                                Block bingbot from crawling my site
                            
                                Why does Chrome request a robots.txt?
                            
                                Should I use different case-spellings for case-insensitive directories in robots.txt?
                            
                                Robots.txt file in MVC.NET 4
                            
                                Nginx: different robots.txt for alternate domain
                            
                                How to add route to dynamic robots.txt in ASP.NET MVC?
                            
                                Sitemap for a site with a large number of dynamic subdomains
                            
                                How to allow crawlers access to index.php only, using robots.txt?
                            
                                Can I use the “Host” directive in robots.txt?
                            
                                How to add `nofollow, noindex` all pages in robots.txt?
                            
                                How to make a private URL?
                            
                                Robots.txt, how to allow access only to domain root, and no deeper? [closed]
                            
                                Facebook and Crawl-delay in Robots.txt?
                            
                                How can I serve robots.txt on an SPA using React with Firebase hosting?
                            
                                Is it possible to control the crawl speed by robots.txt?
                            
                                robots.txt in subdirectory

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With