So what I heard after research is that the only solid free OCR options are either Tesseract or CuneiForm. Now, the Tesseract docs are plain horrible, all they give you is a bunch of Visual Studio code (for me on Windows) and from there you are on your own in an ocean of their API. All you can do is use the exe that compiles then use it on a tiff image. I was expecting at least short documentation that tells you how to pull their API call to use OCR at least for a small example but no, there's nothing like that in their docs. CuneiForm: I downloaded it and "great" everything is in Russian. :( Is it really hard for those guys to pull a small example instead they supply us with bunch of irrelevant info that probably 90% of people won't reach, how can you reach there without starting on small things and they explain none of it! So I have bunch of API but how the hell am I supposed to use it if it's explained nowhere?... Maybe someone can offer me advice and a solution? I'm not asking for a miracle, just something small to show me how things work.

You might have given up, but there may be some other who are still trying. So here is what you need to start with tesseract: First of all you should read all the documentation about tesseract. You may find something useful is the wiki. To start using the API(v 3.0.1, currently in trunk, read also the README and ChangeLog from trunk) you should check out the <code>baseapi.h</code>. The documentation of how to use the api is right there, a comment above each function. For starters: <ul> <li>include <code>baseapi.h</code> & construct <code>TessBaseAPI</code> object</li> <li>call <code>Init()</code> </li> <li>Some optional like <ul> <li>change some params with the <code>SetVariable()</code> func. You can see all the params and their values if you print them in a file using <code>PrintVariables()</code> func.</li> <li>change the segmentation mode with <code>SetPageSegMode()</code>. Tell tesseract what the image you are about to OCR represents - block or line of text, word or character.</li> </ul> </li> <li><code>SetImage()</code></li> <li> <code>GetUTF8Text()</code> </li> </ul> (Again, that is just for starters.) You can check the tesseract's community for alredy answerd questions or ask your own here.

How can i use tesseract ocr(or any other free ocr) in small c++ project?

Tags:

c++

c

windows

image-processing

ocr

So what I heard after research is that the only solid free OCR options are either Tesseract or CuneiForm.

Now, the Tesseract docs are plain horrible, all they give you is a bunch of Visual Studio code (for me on Windows) and from there you are on your own in an ocean of their API. All you can do is use the exe that compiles then use it on a tiff image.

I was expecting at least short documentation that tells you how to pull their API call to use OCR at least for a small example but no, there's nothing like that in their docs.

CuneiForm: I downloaded it and "great" everything is in Russian. :(

Is it really hard for those guys to pull a small example instead they supply us with bunch of irrelevant info that probably 90% of people won't reach, how can you reach there without starting on small things and they explain none of it!

So I have bunch of API but how the hell am I supposed to use it if it's explained nowhere?... Maybe someone can offer me advice and a solution? I'm not asking for a miracle, just something small to show me how things work.

641

asked Feb 22 '11 14:02

Marko29

1 Answers

You might have given up, but there may be some other who are still trying. So here is what you need to start with tesseract:

First of all you should read all the documentation about tesseract. You may find something useful is the wiki.

To start using the API(v 3.0.1, currently in trunk, read also the README and ChangeLog from trunk) you should check out the baseapi.h. The documentation of how to use the api is right there, a comment above each function.

For starters:

include baseapi.h & construct TessBaseAPI object
call Init()
Some optional like
- change some params with the SetVariable() func. You can see all the params and their values if you print them in a file using PrintVariables() func.
- change the segmentation mode with SetPageSegMode(). Tell tesseract what the image you are about to OCR represents - block or line of text, word or character.
SetImage()
GetUTF8Text()

(Again, that is just for starters.)

You can check the tesseract's community for alredy answerd questions or ask your own here.

197

answered Sep 25 '22 02:09

zkunov

Related questions
                            
                                How can I execute a command line command from a C++ program
                            
                                Sort filenames naturally with Qt
                            
                                Should we prefer Boost or standard lib? [closed]
                            
                                Difference between inotify and epoll
                            
                                Are two function pointers to the same function always equal?
                            
                                Structs vs classes in C++ [duplicate]
                            
                                Why does C++ linking use virtually no CPU?
                            
                                C++ nested classes accessibility
                            
                                Default initialization of C++ Member arrays?
                            
                                best way to do variant visitation with lambdas
                            
                                Qt foreach loop ordering vs. for loop for QList
                            
                                why is std::lock_guard not movable?
                            
                                Qt - add a hyperlink to a dialog
                            
                                Why define operator + or += outside a class, and how to do it properly?
                            
                                Simple object detection using OpenCV and machine learning
                            
                                Creating new types in C++
                            
                                How do I invoke the MinGW cross-compiler on Linux?
                            
                                Using std::tie as a range for loop target
                            
                                What are _mm_prefetch() locality hints?
                            
                                How can you detect if two regular expressions overlap in the strings they can match?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With