How does Google reCAPTCHA v2 work behind the scenes?

Tags:

captcha

This post refers to Google ReCaptcha v2 (not the latest version)

Recently Google introduced a simplified "captcha" verification system (video) that enables users to pass the "captcha" just by clicking on it.

But how can it differentiate a bot from a person just by a click?

As per this answer, (assuming a similar implementation), at first "recaptcha" generates a hidden key and attaches it to a hidden input element and also lazily renders a check box (not an actual check box input but a div) with the same key which when clicked, sends an asynchronous request (XHR) to the Google backend servers to mark it as a valid verification key (i.e. a key that has to be validated when the form is submitted).

But why can't bots automate that click (at least, browser-based bots)?

How might this work?

761

asked Dec 04 '14 04:12

everlasto

2 Answers

This is speculation, but based on Google's reference to the "risk analysis engine" they use (http://googleonlinesecurity.blogspot.com/2014/12/are-you-robot-introducing-no-captcha.html)

I would assume it looks at how you behaved prior to clicking, how your cursor moved on its way to the check (organic path/acceleration), which part of the checkbox was clicked (random places, or dead on center every time), browser fingerprint, Google cookies & contents, click location history tied to your fingerprint or account if it detects one etc.

It's fairly difficult to fake "organic" behavior in such a way that it would fool a continuously learning pattern detection engine. In the cases where it's not sure, it still prompts you to match an actual CAPTCHA string.

121

answered Oct 14 '22 12:10

AgmLauncher

A new paper has been released with several tests against reCAPTCHA:

https://www.blackhat.com/docs/asia-16/materials/asia-16-Sivakorn-Im-Not-a-Human-Breaking-the-Google-reCAPTCHA-wp.pdf

Some highlights:

By keeping a cookie active for +9 days (by browsing sites with Google resources), you can then pass reCAPTCHA by only clicking the checkbox;
There are no restrictions based on requests per IP;
The browser's user agent must be real, and Google run tests against your environment to ensure it matches the user agent;
Google tests if the browser can render a Canvas;
Screen resolution and mouse events don't affect the results;

Google has already fixed the cookie vulnerability and is probably restricting some behaviors based on IPs.

Another interesting finding is that Google runs a VM in JavaScript that obfuscates much of reCAPTCHA code and behavior. This VM is known as botguard and is used to protect other services besides reCAPTCHA:

https://github.com/neuroradiology/InsideReCaptcha

UPDATE 2017

A recent paper (from August) was published on WOOT 2017 achieving 85% accuracy in solving noCAPTCHA reCAPTCHA audio challenges:

http://uncaptcha.cs.umd.edu/papers/uncaptcha_woot17.pdf

UPDATE 2018

Google is introducing reCAPTCHA v3, which looks like a "human score prediction engine" that is calibrated per website. It can be installed into different pages of a website (working like a Google Analytics script) to help reCAPTCHA and the website owner to understand the behaviour of humans vs. bots before filling a reCAPTCHA.

https://www.google.com/recaptcha/intro/v3beta.html

answered Oct 14 '22 10:10

barbolo

Related questions
                            
                                How to use Python plugin reCaptcha client for validation?
                            
                                Most effective form of CAPTCHA?
                            
                                Will an English CAPTCHA be an issue for people in other countries?
                            
                                Client Server REST API captcha implementation
                            
                                How do I set up Scrapy to deal with a captcha
                            
                                What is the best/recommended CAPTCHA component for ASP.NET [closed]
                            
                                Qaptcha - is it effective?
                            
                                When the bots attack! [closed]
                            
                                Easy-to-use django captcha or registration app with captcha?
                            
                                Vkontakte API using OAuth does not work with Captcha
                            
                                HTTP Status Code for Captcha
                            
                                How can I bypass the Google CAPTCHA with Selenium and Python?
                            
                                Recaptcha creates iFrame on page, breaks styling
                            
                                How can I avoid google mail server asking me to log in via browser?
                            
                                Blocking comment spam without using captcha [closed]
                            
                                Recommendations for java captcha libraries [closed]
                            
                                ReCaptcha 2.0: enable Submit button on callback if recaptcha successful
                            
                                How do I show multiple recaptchas on a single page?
                            
                                ReCaptcha API v2 Styling
                            
                                Has reCaptcha been cracked / hacked / OCR'd / defeated / broken? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With