How can I run tesseract with multiple languages one time?

Tags:

I have to analyzed a image which containing both English and Japanese texts. When I run tesseract by default (-l eng), some Japanese characters lost. Otherwise, if I run tesseract with japanese (-l jpn) some English characters lost (e.g. Email).

How can I run one process which recognize both English and Japanese characters?

954

asked Jun 24 '14 06:06

pars

2 Answers

Since tesseract 3.02 it is possible to specify multiple languages for the -l parameter.

-l lang The language to use. If none is specified, English is assumed. Multiple languages may be specified, separated by plus characters. Tesseract uses 3-character ISO 639-2 language codes.

An example:

tesseract myscan.png out -l deu+eng

144

answered Oct 24 '22 10:10

tobltobs

Try this:

custom_config = r'-l eng+jpn --psm 6' txt = pytesseract.image_to_string(img, config=custom_config)  from langdetect import detect_langs detect_langs(txt)

Note: you have to install langdetect by using:

 pip install langdetect

answered Oct 24 '22 09:10

rahul

Related questions
                            
                                Trim whitespace from the end of a StringBuilder without calling ToString().Trim() and back to a new SB
                            
                                Entity Framework - The foreign key component … is not a declared property on type
                            
                                ASP.NET Web API multiple RoutePrefix
                            
                                What is really the difference between underscore _.each and _.map?
                            
                                How can I mock private static method with PowerMockito?
                            
                                Unexpected result with += on NumPy arrays
                            
                                Can I get user email using Instagram API login?
                            
                                Matplotlib scatter plot with legend
                            
                                How to initialize QJsonObject from QString
                            
                                Cannot move out of borrowed content when trying to transfer ownership
                            
                                Using CSS and HTML5 to create navigation buttons using trapezoids
                            
                                How to see diff between working directory and staging index?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With