Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to know if HTTP Request is a BOT

I am looking for a full list of User-Agents of BOTS (crawlers, spiders, twitter bots, etc).

Do you know anything?

Thanks

like image 457
Himberjack Avatar asked Jan 19 '11 10:01

Himberjack


People also ask

How do you know who is a bot?

It is easy to tell a bot from how it glides and lands. Some bots tail players lading in a hot drop area, while some bots drop in remote places across the map. The bots generally land 20 -30 seconds after the real players land. Bots also dive and glide straight and often land on the outskirts of a POI.

Can sites detect bots?

Web engineers can look directly at network requests to their sites and identify likely bot traffic. An integrated web analytics tool, such as Google Analytics or Heap, can also help to detect bot traffic.

How can I tell where a HTTP request came from?

There is absolutely no way to know with certainty if a request came from a browser or something else making an HTTP request. The HTTP protocol allows for the client to set the User Agent arbitrarily.


1 Answers

Check this list: http://www.botsvsbrowsers.com/category/1/index.html

It contains a total number of 4768 bot user agents.

The other way around to accomplishing bot detection is to use the reverse approach in a white-list way, that is, check if the user agent is not a bot, then anything else is a bot. :-)

To compile a comprehensive list of non bot user agents you can use the lists at http://www.user-agents.org/ and http://www.botsvsbrowsers.com/.

like image 124
Shinnok Avatar answered Oct 20 '22 04:10

Shinnok