Recognize images in Python

Tags:

I'm kinda new both to OCR recognition and Python.

What I'm trying to achieve is to run Tesseract from a Python script to 'recognize' some particular figures in a .tif.

I thought I could do some training for Tesseract but I didn't find any similar topic on Google and here at SO.

Basically I have some .tif that contains several images (like an 'arrow', a 'flower' and other icons), and I want the script to print as output the name of that icon. If it finds an arrow then print 'arrow'.

Is it feasible?

711

asked Feb 13 '12 10:02

Giorgio

1 Answers

This is by no means a complete answer, but if there are multiple images in the tif and if you know the size in advance, you can standardize the image samples prior to classifying them. You would cut up the image into all the possible rectangles in the tif.

So when you create a classifier (I don't mention the methods here), the end result would take a synthesis of classifying all of the smaller rectangles.

So if given a tif , the 'arrow' or 'flower' images are 16px by 16px , say, you can use Python PIL to create the samples.

Click to copy

from PIL import Image

image_samples = []

im = Image.open("input.tif")
sample_dimensions = (16,16)

for box in get_all_corner_combinations(im, sample_dimensions):

    image_samples.append(im.crop(box))


classifier = YourClassifier()

classifications = []

for sample in image_samples:
    classifications.append (classifier (sample))

label = fuse_classifications (classifications)

Again, I didn't talk about the learning step of actually writing YourClassifier. But hopefully this helps with laying out part of the problem.

There is a lot of research on the subject of learning to classify images as well as work in cleaning up noise in images before classifying them.

Consider browsing through this nice collection of existing Python machine learning libraries.

http://scipy-lectures.github.com/advanced/scikit-learn/index.html

There are many techniques that relate to images as well.

145

answered Oct 04 '22 23:10

HeyWatchThis

Related questions
                            
                                import sqlite3 with Python2.7 on Heroku
                            
                                Passing a valid path to Python from PHP
                            
                                Identifying important words and phrases in text
                            
                                Python SUDS return type other than XML
                            
                                Pyplot - rescaling y axis after limiting x axis
                            
                                Browser multiplayer network strategy - does this seem like a viable solution? [closed]
                            
                                autotools and python setup.py
                            
                                Is there pluggable online python console?
                            
                                Getting matplotlib plots to refresh on mouse focus
                            
                                ZMQ Pub-Sub Program Failure When Losing Network Connectivity
                            
                                Got Exception Error "Exception in thread Thread-1 (most likely raised during interpreter shutdown)" which using Paramiko
                            
                                Permanent gaierror 'Temporary failure in name resolution' after running for a few hours
                            
                                Automate Java applet with Python
                            
                                python, COM and multithreading issue
                            
                                How to make pydev/eclipse compile cython modules on a Windows platform
                            
                                Packaging multiple scripts in PyInstaller
                            
                                Image to numpy-array: JPG vs. PNG
                            
                                matplotlib.animation error - The system cannot find the file specified
                            
                                How to efficiently copy all files from one directory to another in an amazon S3 bucket with boto?
                            
                                What thing I will need for creating a front end for Python based on LLVM architecture?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Recognize images in Python

Tags:

python

image

image-processing

ocr

Giorgio

People also ask

1 Answers

HeyWatchThis

Recent Activity

Donate For Us