How do I train tesseract 4 with image data instead of a font file?

1 Answers

Clone the tesstrain repo at https://github.com/tesseract-ocr/tesstrain.

You’ll also need to clone the tessdata_best repo, https://github.com/tesseract-ocr/tessdata_best. This acts as the starting point for your training. It takes hundreds of thousands of samples of training data to get accuracy, so using a good starting point lets you fine-tune your training with much less data (~tens to hundreds of samples can be enough)

Add your training samples to the directory in the tesstrain repo named ./tesstrain/data/my-custom-model-ground-truth

Your training samples should be image/text file pairs that share the same name but different extensions. For example, you should have an image file named 001.png that is a picture of the text foobar and you should have a text file named 001.gt.txt that has the text foobar.

These files need to be single lines of text.

In the tesstrain repo, run this command:

make training MODEL_NAME=my-custom-model START_MODEL=eng TESSDATA=~/src/tessdata_best

Once the training is complete, there will be a new file tesstrain/data/.traineddata. Copy that file to the directory Tesseract searches for models. On my machine, it was /usr/local/share/tessdata/.

Then, you can run tesseract and use that model as a language.

tesseract -l my-custom-model foo.png -

191

answered Oct 17 '22 20:10

Eric Ihli

Related questions
                            
                                Equivalent of volumes_from in Docker Compose v3
                            
                                Using roxygen2 to inherit only certain parameters
                            
                                What is a postgres superuser
                            
                                Configuring Vagrant CA Certificates
                            
                                Angular 2/4 where to store token
                            
                                How do I minify dynamic HTML responses in Spring?
                            
                                Is there any technical reason to write a catch block containing only a throw statement?
                            
                                How does std::visit work with std::variant?
                            
                                What is regularization loss in tensorflow?
                            
                                What is an intermediate value?
                            
                                Winston not displaying error details
                            
                                RecyclerView (wrap_content) inside of a BottomSheetDialogFragment

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do I train tesseract 4 with image data instead of a font file?

Tags:

ocr

lstm

tesseract

training-data

claim

People also ask

1 Answers

Eric Ihli

Recent Activity

Donate For Us