Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

TensorFlow - Text recognition in image [closed]

I am new to TensorFlow and to Deep Learning. I am trying to recognize text in naturel scene images. I used to work with an OCR but I would like to use Deep Learning. The text has always the same format : ABC-DEF 88:88.

What I have done is recognize every character/digit. It means that I cropped the image around every character (so each picture gives me 10 characters) to build my training and test set and they build a two conv neural networks. So my training set was a set of characters pictures and the labels were just characters/digits.

But I want to go further. What I would like to do is just to give the full pictures and output the entire text (not one character such as in my previous model).

Thank you in advance for any help.

like image 824
A. Attia Avatar asked Feb 15 '17 04:02

A. Attia


People also ask

Can TensorFlow be used for OCR?

This reference app demos how to use TensorFlow Lite to do OCR. It uses a combination of text detection model and a text recognition model as an OCR pipeline to recognize text characters.

How does OCR work in deep learning?

Optical character recognition or OCR refers to a set of computer vision problems that require us to convert images of digital or hand-written text images to machine readable text in a form your computer can process, store and edit as a text file or as a part of a data entry and manipulation software.

Which algorithm is used to detect text in images?

Optical Character Recognition (OCR) is used to analyze text in images.

What is Mmocr?

We present MMOCR-an open-source toolbox which provides a comprehensive pipeline for text detection and recognition, as well as their downstream tasks such as named entity recognition and key information extraction.


1 Answers

The difficulty is that you don't know where the text is. The solution is, given an image, you need to use a sliding window to crop different part of the image, then use a classifier to decide if there are texts in the cropped area. If so, use your character/digit recognizer to tell which characters/digits they really are.

So you need to train another classifer: given a cropped image (the size of cropped images should be slightly larger than that of your text area), decide if there are texts inside.

Just construct training set (positive samples are text areas, negative samples are other areas randomly cropped from the big images) and train it~

like image 124
soloice Avatar answered Sep 28 '22 07:09

soloice