Is there an OCR library that outputs coordinates of words found within an image? [closed]

Tags:

ocr

In my experience, OCR libraries tend to merely output the text found within an image but not where the text was found. Is there an OCR library that outputs both the words found within an image as well as the coordinates (x, y, width, height) where those words were found?

730

asked Feb 18 '11 12:02

Adam Paynter

2 Answers

Most commercial OCR engines will return word and character coordinate positions but you have to work with their SDK's to extract the information. Even Tesseract OCR will return position information but it has been not easy to get to. Version 3.01 will make easier but a DLL interface is still being worked on.

Unfortunately, most free OCR programs use Tesseract OCR in its basic form and they only report the raw ASCII results.

www.transym.com - Transym OCR - outputs coordinates. www.rerecognition.com - KADMOS engine returns coordinates.

Also Caere Omnipage, Mitek, Abbyy, Charactell return character positions.

146

answered Sep 18 '22 17:09

Andrew Cash

I'm using TessNet (a Tesseract C# wrapper) and I'm getting word coordinates with the following code:

TextWriter tw = new StreamWriter(@"U:\user files\bwalker\ocrTesting.txt"); Bitmap image = new Bitmap(@"u:\user files\bwalker\2849257.tif"); tessnet2.Tesseract ocr = new tessnet2.Tesseract(); // If digit only ocr.SetVariable("tessedit_char_whitelist", "0123456789ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz.,$-/#&=()\"':?"); // To use correct tessdata ocr.Init(@"C:\Users\bwalker\Documents\Visual Studio 2010\Projects\tessnetWinForms\tessnetWinForms\bin\Release\", "eng", false);  List<tessnet2.Word> result = ocr.DoOCR(image, System.Drawing.Rectangle.Empty); string Results = ""; foreach (tessnet2.Word word in result) {     Results += word.Confidence + ", " + word.Text + ", " +word.Top+", "+word.Bottom+", "+word.Left+", "+word.Right+"\n"; } using (StreamWriter writer = new StreamWriter(@"U:\user files\bwalker\ocrTesting2.txt", true)) {     writer.WriteLine(Results);//+", "+word.Top+", "+word.Bottom+", "+word.Left+", "+word.Right);     writer.Close(); } MessageBox.Show("Completed");

answered Sep 19 '22 17:09

Ben Walker

Related questions
                            
                                Tesseract OCR fails to detect varying font size and letters that are not horizontally aligned
                            
                                How to extract text from image Android app
                            
                                Stroke Width Transform (SWT) implementation (Python)
                            
                                How can I use Tesseract in Android?
                            
                                Can I do a "string contains X" with a percentage accuracy in python?
                            
                                Tesseract confuses two numbers
                            
                                Handwriting recognition API's for android applications [closed]
                            
                                Google ML Kit: Waiting for the text recognition model to be downloaded
                            
                                Tesseract does not recognize single characters
                            
                                Remove background noise from image to make text more clear for OCR
                            
                                Getting text from image on ios (image processing)
                            
                                My own OCR-program in Python
                            
                                Where can I find a free .Net (C#) library that I can use to scan and OCR documents? [closed]
                            
                                Understanding Freeman chain codes for OCR
                            
                                Tesseract OCR simple example
                            
                                How can i use tesseract ocr(or any other free ocr) in small c++ project?
                            
                                iOS: Real Time OCR on top of live camera feed (similar to iTunes Redeem Gift Card)
                            
                                How to remove all lines and borders in an image while keeping text programmatically?
                            
                                Programmatically recognize text from scans in a PDF File [closed]
                            
                                What OCR options exist beyond Tesseract? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With