Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Recognizing checkboxes in azure form recognizer

The forms I want to use in form recognizer have several checkboxes. Currently, form recognizer's OCR engine doesn't seem to detect checkboxes. Is there a work-around for this, or is there a way to train checkbox recognition?

I've also tried using tesseract on windows and linux, but I couldn't solve it there either.

The checkboxes are your standard open squares. Sometimes they may have an "x" or checkmark on them.

If there is no solution, then I may ask the customer to change from checkboxes to filled in circles (radio-buttons). I haven't tested that yet, to see what OCR would make of them.

Edit: I read the form recognizer documentation and I saw that it explicitly says checkboxes and radio buttons are not supported. I wonder if anyone has a awork-around?

like image 918
rdodhia Avatar asked Apr 24 '26 01:04

rdodhia


2 Answers

An OCR is by definition made for "character recognition" (see here). A checkbox or a radio button is not a character, so it will not be recognized by an OCR.

You can still give a try to a custom vision detector to find those items, but it will be a bit complicated as this service will only find those items in your document and you will have to combine with an OCR call to get the text, then try to match the zones in the document to know which text is associated with which combo/radio button

like image 81
Nicolas R Avatar answered May 02 '26 14:05

Nicolas R


Support for checkboxes was added to Form Recognizer in version 2.1 (in public preview as of September 2020). From the announcement:

Checkbox / Selection Mark detection – Form Recognizer supports detection and extraction of selection marks such as check boxes and radio buttons. Selection Marks are extracted in Layout and you can now also label and train in Train Custom Model - Train with Labels to extract key value pairs for selection marks.

There is now a selectionMarks object in the Get Analyze Layout Result API response that lists detected selection marks and their state, either selected or unselected.

Support for labeling checkboxes and selection marks was also added to the sample labeling tool as of version 2.1.

like image 28
Adam D Avatar answered May 02 '26 13:05

Adam D



Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!