Python OCR Module in Linux?

2 Answers

You can just wrap tesseract in a function:

import os import tempfile import subprocess  def ocr(path):     temp = tempfile.NamedTemporaryFile(delete=False)      process = subprocess.Popen(['tesseract', path, temp.name], stdout=subprocess.PIPE, stderr=subprocess.STDOUT)     process.communicate()      with open(temp.name + '.txt', 'r') as handle:         contents = handle.read()      os.remove(temp.name + '.txt')     os.remove(temp.name)      return contents

If you want document segmentation and more advanced features, try out OCRopus.

174

answered Oct 24 '22 05:10

Blender

In addition to Blender's answer, that just executs Tesseract executable, I would like to add that there exist other alternatives for OCR that can also be called as external process.

ABBYY comand line OCR utility: http://ocr4linux.com/en:start

It is not free, so worth to consider only if Tesseract accuracy is not good enough for your task, or you need more sophisticated layout analisys or you need to export PDF, Word and other files.

Update: here's comparison of ABBYY and tesseract accuracy: http://www.splitbrain.org/blog/2010-06/15-linux_ocr_software_comparison

Disclaimer: I work for ABBYY

answered Oct 24 '22 05:10

Tomato

Related questions
                            
                                Problem with MVC EditorFor named template
                            
                                Comparing design by contract to type systems
                            
                                Using layout_above in a RelativeLayout
                            
                                Why does 'fopen' return a NULL pointer?
                            
                                How to pattern match a class with multiple argument lists?
                            
                                Platform independent /dev/null in c++ [duplicate]
                            
                                .net DynamicObject implementation that returns null for missing properties rather than a RunTimeBinderException
                            
                                How to avoid the message of "server-start" while opening another Emacs session?
                            
                                How to fix libeay32.dll was not found error
                            
                                When is a database called as an Embedded database?
                            
                                Using IoC in extension methods
                            
                                Google Maps API v3 - Get map by address ?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python OCR Module in Linux?

Tags:

Felix Yan

People also ask

2 Answers

Blender

Tomato

Recent Activity

Donate For Us