Google Cloud Vision - Numbers and Numerals OCR

Tags:

I've been trying to implement an OCR program with Python that reads numbers with a specific format, XXX-XXX. I used Google's Cloud Vision API Text Recognition, but the results were unreliable. Out of 30 high-contrast 1280 x 1024 bmp images, only a handful resulted in the correct output, or at least included the correct output in the results. The program tends to omit some numbers, output in non-English languages or sneak in a few special characters.

The goal is to at least output the correct numbers consecutively, doesn't matter if the results are sprinkled with other junk. Is there a way to help the program recognize numbers better, for example limit the results to a specific format, or to numbers only?

717

asked Sep 16 '16 22:09

NigelJL

1 Answers

I am unable to tell you why this works, perhaps it has to do with how the language is read, o vs 0, l vs 1, etc. But whenever I use OCR and I am specifically looking for numbers, I have read to set the detection language to "Korean". It works exceptionally well for me and has influenced the accuracy greatly.

answered Oct 31 '22 21:10

Jake Braden

Related questions
                            
                                How to include a template with relative path in Jinja2
                            
                                How to translate this Math Formula in Haskell or Python? (Was translated in PHP)
                            
                                How does Moose compare to Python's OO system? [closed]
                            
                                Hashing an immutable dictionary in Python
                            
                                Is it possible to create a numpy.ndarray that holds complex integers?
                            
                                Assign new values to slice from MultiIndex DataFrame
                            
                                pip freeze does not show all installed packages
                            
                                How to read a v7.3 mat file via h5py?
                            
                                Folder and files upload with Flask
                            
                                Most pythonic and/or performant way to assign a single value to a slice?
                            
                                Python ThreadPoolExecutor - is the callback guaranteed to run in the same thread as submitted func?
                            
                                How to upload small files to Amazon S3 efficiently in Python
                            
                                Will a UNICODE string just containing ASCII characters always be equal to the ASCII string?
                            
                                Problems obtaining most informative features with scikit learn?
                            
                                How to restrict Django Rest Framework browsable API interface to admin users
                            
                                How to print each line of a script as it is run only for the top-level script being run?
                            
                                Using Spyder IDE, how do you return from "goto definition"?
                            
                                Apache Spark throws NullPointerException when encountering missing feature
                            
                                Memory-efficient way to generate a large numpy array containing random boolean values
                            
                                pd.rolling_mean becoming deprecated - alternatives for ndarrays

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Google Cloud Vision - Numbers and Numerals OCR

Tags:

python

google-cloud-platform

ocr

google-cloud-vision

text-recognition

NigelJL

People also ask

1 Answers

Jake Braden

Recent Activity

Donate For Us