How to block search engines from indexing all urls beginning with origin.domainname.com

Tags:

I have www.domainname.com, origin.domainname.com pointing to the same codebase. Is there a way, I can prevent all urls of basename origin.domainname.com from getting indexed.

Is there some rule in robot.txt to do it. Both the urls are pointing to the same folder. Also, I tried redirecting origin.domainname.com to www.domainname.com in htaccess file but it doesnt seem to work..

If anyone who has had a similar kind of problem and can help, I shall be grateful.

Thanks

304

asked Oct 05 '10 06:10

Loveleen Kaur

1 Answers

You can rewrite robots.txt to an other file (let's name this 'robots_no.txt' containing:

User-Agent: *
Disallow: /

(source: http://www.robotstxt.org/robotstxt.html)

The .htaccess file would look like this:

RewriteEngine On
RewriteCond %{HTTP_HOST} !^www.example.com$
RewriteRule ^robots.txt$ robots_no.txt

Use customized robots.txt for each (sub)domain:

RewriteEngine On
RewriteCond %{HTTP_HOST} ^www.example.com$ [OR]
RewriteCond %{HTTP_HOST} ^sub.example.com$ [OR]
RewriteCond %{HTTP_HOST} ^example.com$ [OR]
RewriteCond %{HTTP_HOST} ^www.example.org$ [OR]
RewriteCond %{HTTP_HOST} ^example.org$
# Rewrites the above (sub)domains <domain> to robots_<domain>.txt
# example.org -> robots_example.org.txt
RewriteRule ^robots.txt$ robots_${HTTP_HOST}.txt [L]
# in all other cases, use default 'robots.txt'
RewriteRule ^robots.txt$ - [L]

Instead of asking search engines to block all pages on for pages other than www.example.com, you can use <link rel="canonical"> too.

If http://example.com/page.html and http://example.org/~example/page.html both point to http://www.example.com/page.html, put the next tag in the <head>:

<link rel="canonical" href="http://www.example.com/page.html">

See also Googles article about rel="canonical"

117

answered Sep 22 '22 22:09

Lekensteyn

Related questions
                            
                                Convert url to lower case using htaccess except query string
                            
                                How can I tell a curl request vs browser request
                            
                                403 error for js files in vendor directory on Heroku
                            
                                Access-Control-Allow-Origin htaccess file not working
                            
                                adding ExpiresDefault to .htaccess file
                            
                                Short URL system: How to redirect the Custom URLs?
                            
                                htaccess rewrite all pages to same page on another domain
                            
                                Redirecting https://www to https://non-www - without seeing certificate error, possible?
                            
                                How can I redirect to a different domain without changing the URL in the address bar?
                            
                                Redirect on deny | htaccess
                            
                                Use htaccess to redirect wordpress homepage to subpage
                            
                                Edit .htaccess with PHP
                            
                                Redirect https to non-www and http to www
                            
                                how to restrict the site from being indexed
                            
                                redirect if url contains specific string using htaccess
                            
                                How to create Virtual Host from Apache .htaccess?
                            
                                Where should I put the Mobile detection in .htaccess or php?
                            
                                Disable directory listings in all but one folder, using htaccess?
                            
                                How to configure ETag with browser caching
                            
                                htaccess conditionals

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to block search engines from indexing all urls beginning with origin.domainname.com

Tags:

url-rewriting

.htaccess

robots.txt

Loveleen Kaur

People also ask

1 Answers

Lekensteyn

Recent Activity

Donate For Us