Following on from my question on the Legalities of screen scraping, even if it's illegal people will still try, so:
What technical mechanisms can be employed to prevent or at least disincentivise screen scraping?
Oh and just for grins and to make life difficult, it may well be nice to retain access for search engines. I may well be playing devil's advocate here but there is a serious underlying point.
Implementing JavaScript to set a cookie value that is checked by the web server can be a simple technique and effective. CAPCHAS are also a good technique to prevent scraping; if an IP sends too many requests, CAPTCHAS can be presented to the end user to validate human interaction.
A good bot detection solution or anti-crawler protection solution will be able to identify visitor behavior that shows signs of web scraping in real time, and automatically block malicious bots before scraping attacks unravel while maintaining a smooth experience for real human users.
You can’t prevent it.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With