Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Cloud Vision API poorly recognizes 7-segment numbers

The simplest example of what I'm trying to recognize:

The simplest example of what I'm trying to recognize

I use DOCUMENT_TEXT_DETECTION, but in the answer I get the hieroglyphics.

If I use Eng in the ImageContext parameter for the addAllLanguageHints method, then I have 111 in result. Better, but still bad.

Is there any way to indicate that the numbers are recognised or somehow improve the results?

Also, how is the setRepeatedField option in ImageContext is used? I could not find any examples of its use.

Thanks in advance.

like image 232
berek Avatar asked Feb 25 '18 16:02

berek


Video Answer


1 Answers

Even if it doesn't work out of the box ... you'd need is to classify images using custom labels, when the default labels won't suffice. Cloud Auto ML Vision (select Vision from that blue drop-down menu) let's you train custom models, which can be used to recognize that font. And since the possible amount of shapes is quite limited with that 7-segment display, it shouldn't be too difficult to train it. If you'd get a calculator with a better display, it might also work better. The LCD above looks a little cheap, with those huge spaces and cut-off endings - but nevertheless, one can train it to read that.

like image 93
Martin Zeitler Avatar answered Oct 17 '22 21:10

Martin Zeitler