Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Windows 7 OCR API

I have been reviewing replacements for the Office 2007 MODI OCR (OneNote's 2010 solution has lesser quality/results than 2007 :-( ). I notice that Windows 7 contains an OCR library once you install the optional tiff filter

The OCR component gets installed to

%programfiles%\Common Files\microsoft shared\OCR\7.0\xocr3.psp.dll 

but I don't see any API for it?

Does anyone see how this can be interfaced preferably in C#?

ANSWER: Found the soluation, once the optional tiff ifilter win7 feature is installed, i can then get a textoutput of a screenshot using the code/exe on http://www.codeproject.com/KB/cs/IFilter.aspx. Also if add the same [HKEY_CLASSES_ROOT.tiff\PersistentHandler] for .png and .jpg then OCR also works for jpg and png's.

like image 915
slyi Avatar asked May 23 '11 16:05

slyi


People also ask

Is Microsoft OCR free?

You can improve and customize it - it is open source The (a9t9) Free OCR Software converts scans or (smartphone) images of text documents into editable files by using Optical Character Recognition (OCR) technologies. It uses state-of-the-art modern OCR software.

Does Microsoft have OCR software?

Microsoft's OCR technologies support extracting printed text in several languages. Follow a quickstart to get started with the REST API or a client SDK. Or, try out the capabilities of OCR quickly and easily in your browser using Vision Studio.


1 Answers

  1. Tessnet OCR is a good solution, but pretty old (last release from 2009). There are couple of very good free OCR solutions available for .NET:
  2. Asprise C# OCR SDK. Very good and fast one.
  3. Microsoft Research Project Hawaii Web-based (cloud) OCR solution with full docs and samples (discontinued 2013)
  4. Bing OCR Web based (cloud) OCR replacement for above. (discontinued March 2014)
like image 185
Piotr Szmyd Avatar answered Sep 20 '22 15:09

Piotr Szmyd