How to stop search engines from crawling the whole website?

Tags:

I want to stop search engines from crawling my whole website.

I have a web application for members of a company to use. This is hosted on a web server so that the employees of the company can access it. No one else (the public) would need it or find it useful.

So I want to add another layer of security (In Theory) to try and prevent unauthorized access by totally removing access to it by all search engine bots/crawlers. Having Google index our site to make it searchable is pointless from the business perspective and just adds another way for a hacker to find the website in the first place to try and hack it.

I know in the robots.txt you can tell search engines not to crawl certain directories.

Is it possible to tell bots not to crawl the whole site without having to list all the directories not to crawl?

Is this best done with robots.txt or is it better done by .htaccess or other?

449

asked Feb 01 '12 20:02

Iain Simpson

1 Answers

It is best handled with a robots.txt file, for just bots that respect the file.

To block the whole site add this to robots.txt in the root directory of your site:

User-agent: *
Disallow: /

To limit access to your site for everyone else, .htaccess is better, but you would need to define access rules, by IP address for example.

Below are the .htaccess rules to restrict everyone except your people from your company IP:

Order allow,deny
# Enter your companies IP address here
Allow from 255.1.1.1
Deny from all

answered Oct 05 '22 22:10

Ulrich Palha

Related questions
                            
                                Is encrypting AJAX calls for authentication possible with jQuery?
                            
                                Two realms in same application with Spring Security?
                            
                                How to do cross-domain authentication securely?
                            
                                How can I stop my installer from triggering Windows 10's "This app has been blocked for your protection" error?
                            
                                Anti XSS and Classic ASP
                            
                                Always return a 404 when you decide to return a 403
                            
                                Node Security service shutdown: getaddrinfo ENOTFOUND api.nodesecurity.io
                            
                                What for are the commonly used PKCS-Standards: PKCS#7, PKCS#10 and PKCS#12?
                            
                                How does Java strings being immutable increase security?
                            
                                Where to store database credentials in a web app?
                            
                                Image Uploading - security issues
                            
                                keytool - see the public and private keys
                            
                                Best way for a Spring MVC web app to detect a brute force attack?
                            
                                How to validate the origin of a web service invokation
                            
                                strcmp vs. == vs. === in PHP for checking hash equality
                            
                                Determine if Touch ID-Protected Keychain Item Exists?
                            
                                Interactive command prompt as NETWORK SERVICE
                            
                                Checklist for Web Site Programming Vulnerabilities
                            
                                Securing Cookie Based Authentication
                            
                                At least one security token in the message could not be validated

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to stop search engines from crawling the whole website?

Tags:

security

.htaccess

robots.txt

Iain Simpson

People also ask

1 Answers

Ulrich Palha

Recent Activity

Donate For Us