Image cleaning before OCR application

Tags:

I have been experimenting with PyTesser for the past couple of hours and it is a really nice tool. Couple of things I noticed about the accuracy of PyTesser:

File with icons, images and text - 5-10% accurate
File with only text(images and icons erased) - 50-60% accurate
File with stretching(And this is the best part) - Stretching file in 2) above on x or y axis increased the accuracy by 10-20%

So apparently Pytesser does not take care of font dimension or image stretching. Although there is much theory to be read about image processing and OCR, are there any standard procedures of image cleanup(apart from erasing icons and images) that needs to be done before applying PyTesser or other libraries irrespective of the language?

...........

Wow, this post is quite old now. I started my research again on OCR these last couple of days. This time I chucked PyTesser and used the Tesseract Engine with ImageMagik instead. Coming straight to the point, this is what I found:

1) You can increase the resolution with ImageMagic(There are a bunch of simple shell commands you can use)
2) After increasing the resolution, the accuracy went up by 80-90%.

So the Tesseract Engine is without doubt the best open source OCR engine in the market. No prior image cleaning was required here. The caveat is that it does not work on files with a lot of embedded images and I coudn't figure out a way to train Tesseract to ignore them. Also the text layout and formatting in the image makes a big difference. It works great with images with just text. Hope this helped.

418

asked Oct 28 '13 16:10

zenCoder

1 Answers

Not sure if your intent is for commercial use or not, But this works wonders if your performing OCR on a bunch of like images.

http://www.fmwconcepts.com/imagemagick/textcleaner/index.php

ORIGINAL

After Pre-Processing with given arguments.

answered Sep 21 '22 12:09

Milne

Related questions
                            
                                Python Virtual Machine architecture diagrams/references [closed]
                            
                                benchmarking django apps
                            
                                Vim - run ctags on current python site-packages
                            
                                Is get_result() a required call for put_async() in Google App Engine
                            
                                Measuring performance in Python
                            
                                Interoperating with Django/Celery From Java
                            
                                How to modify the metavar for a positional argument in pythons argparse?
                            
                                ZeroMQ PUB socket buffers all my out going data when it is connecting
                            
                                Django - Handling "enum models"
                            
                                Python JPEG to movie
                            
                                Parsing a PDF with no /Root object using PDFMiner
                            
                                os.getenv returns None instead correct value [closed]
                            
                                Python PIL incorrectly decoding TIFF colors (using incorrect colorspace)?
                            
                                Python/Tornado - compressing static files
                            
                                Cannot get environment variables in Django settings file
                            
                                Python multiprocessing and handling exceptions in workers
                            
                                Inertial scrolling in Mac OS X with Tkinter and Python
                            
                                concurrent writing to the same file using threads and processes
                            
                                How to make ttk.Treeview's rows editable?
                            
                                How to define PyCharm-friendly value object in Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Image cleaning before OCR application

Tags:

python

image-processing

ocr

zenCoder

People also ask

1 Answers

Milne

Recent Activity

Donate For Us