Using Ruby And Ubuntu With Optical Character Recognition

Tags:

I am a university student and it's time to buy textbooks again. This quarter there are over 20 books I need for classes. Normally this wouldn't be such a big deal, as I would just copy and paste the ISBNs into Amazon. The ISBNs, however, are converted into an image on my school's book site. All I want to do is get the ISBNs into a string so I don't have to type each one by hand. I have used GOCR to convert the images into text, but I want to use it with a Ruby script so I can automate the process and do the same for my classmates.

I can navigate to the site. How can I save the image to a file on my computer (running UBUNTU), convert the image with GOCR, and finally save it to a file so I can then access them again with my Ruby script?

547

asked Dec 09 '09 21:12

ryan

1 Answers

GOCR seems to be a good choice at first, but from what I can tell from my own "research", quality isn't quite sufficient for daily use. Maybe this could lead to a problem, depending on the image input. If it doesn't work out for you, try the "new" feature of Google Docs, which allows you to upload images for OCR. You can then retrieve the results using some google api ( there are tons out there, I'm using gdata-ruby-util which requires some hacking, though.

You could also use tesseract-ocr for the OCR part, it's also open source and in active development.

For the retrieval part, I would as well stick with hpricot, super-powerful and flexible.

192

answered Oct 11 '22 15:10

moritz

Related questions
                            
                                MSI product code from product id?
                            
                                iPhone table cell label misaligned
                            
                                PropertyGrid and Dynamic Types of Objects
                            
                                android RadioButton option on the right side of the text
                            
                                How does incr work with expiry times?
                            
                                How do I fetch/clone only a few branches using git-svn?
                            
                                Validate DOM manipulation when using Selenium
                            
                                "Access is denied" Exception with WMI
                            
                                Animate UITableViewCell height on selection [duplicate]
                            
                                How to use "javax.lang.model.element.ElementVisitor"?
                            
                                JavaScript RegExp compatibility in IE
                            
                                LINQ To SQL in Compact Framework

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With