Is it possible to control the crawl speed by robots.txt?

Tags:

We can tell bots to crawl or not to crawl our website in robot.txt. On the other hand, we can control the crawling speed in Google Webmasters (how much Google bot crawls the website). I wonder if it is possible to limit the crawler activities by robots.txt

I mean accepting bots to crawl pages but limit their presence by time or pages or size!

940

asked Oct 16 '11 20:10

Googlebot

1 Answers

There is one directive you can use in robots.txt, it's "Crawl-delay".

Crawl-delay: 5

Meaning robots should be crawling no more than one page per 5 seconds. But this directive is not officially supported by robots.txt, as much as I know.

Also there are some robots that don't really take in count robots.txt file at all. So even if you have disallowed access to some pages, they still may get crawled by some robots, of course not the largest ones like Google.

Baidu for example could ignore robots.txt, but that's not for sure.

I've got no official source for this info, so you can just Google it.

156

answered Feb 09 '23 00:02

ZurabWeb

Related questions
                            
                                How to optimize "text search" for inverted index and relational database? [closed]
                            
                                Where do search engines start crawling?
                            
                                How does Google serve results so fast? [duplicate]
                            
                                Does the position of a slug in a URL matter?
                            
                                Do I need to use http redirect code 302 or 307?
                            
                                how to make a search engine for website? [closed]
                            
                                Do search engines respect the HTTP header field “Content-Location”?
                            
                                Why do search engine crawlers not run javascript? [closed]
                            
                                How to index and store multiple languages in ElasticSearch
                            
                                how to migrate mysql data to ElasticSearch realtime
                            
                                What are the best practices for multilanguage sites?
                            
                                django haystack: which search engine would be better
                            
                                Methods for preventing search engines from indexing irrelevant content on a page
                            
                                Breaking a string apart into a sequence of words
                            
                                How to determine search query forwarding user to my website?
                            
                                Meaning of parameters in a Google query?
                            
                                How to maintain SEO while using JQuery to show hidden divs [closed]
                            
                                Does google allow businesses to use "Did you Mean" feature as an api?? I would like to use it but I am not getting anything
                            
                                I wonder how reverse image search services like tineye.com work ...?
                            
                                Does the url order matter in a XML sitemap?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is it possible to control the crawl speed by robots.txt?

Tags:

search-engine

robots.txt

google-crawlers

Googlebot

People also ask

1 Answers

ZurabWeb

Recent Activity

Donate For Us