chinese character recognition using Tesseract OCR

Tags:

I have been using Tesseract 3.0.2 OCR SDK for image text extraction. But if I use Chinese text images and pass through OCR then Tesseract doesn't provide me the Chinese characters instead of that I am getting numeric and english characters. But I need Chinese characters as displayed in the image I am using.

How can I achieve this? Is there any way I can obtain Chinese characters rather than any other characters?

431

asked May 16 '13 07:05

Nishant Tyagi

1 Answers

You need to download chinese trained data (it will be a file like chi_sim.traineddata) and add it to your tessdata folder.

To download the file https://github.com/tesseract-ocr/tessdata/raw/master/chi_sim.traineddata

and use like this

Tesseract* tesseract= [[Tesseract alloc] initWithDataPath:@"tessdata" language:@"chi_sim"];

if you have any problem you can download my experiment with tessaract (with chinese language support) from https://github.com/aryansbtloe/ExperimentWithTesseract.git

I have tested this one...Hope you will find this useful.

answered Oct 02 '22 22:10

Alok Singh

Related questions
                            
                                When is an autoreleased object actually released?
                            
                                Does PhoneGap support In-App Purchase?
                            
                                Remove form assistant from keyboard in iPhone standalone web app
                            
                                Issues submitting firemonkey app to app store
                            
                                iOS development on Windows [duplicate]
                            
                                Issue stopping iPhone resizing HTML e-mails
                            
                                Upside down orientation not working in iOS6 for UINavigation view and UITabbar view?
                            
                                UIWebView shouldStartLoadWithRequest only called once?
                            
                                Recording custom overlay on iPhone
                            
                                Thousand of errors in base classes like NSObject.h,NSObjCRuntime.h
                            
                                Cant launch iOS app with Instruments on device
                            
                                "Thread 6 com.apple.NSURLConnectionLoader: Program received signal: EXC_BAD_ACCESS"
                            
                                Google Maps output=kml broken?
                            
                                EXC_BAD_access code=2 address 0x8
                            
                                UITextView contentOffset on iOS 7
                            
                                Which initializer(s) to override for UITableViewController subclass
                            
                                Separate NSArray to a list of NSString type objects
                            
                                What is the easiest way to determine an iOS .app bundles size while developing?
                            
                                Where CFBundleName is being used
                            
                                Android <-> iOS direct communication (Bluetooth). Is it possible? [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

chinese character recognition using Tesseract OCR

Tags:

ios

iphone

ocr

tesseract

Nishant Tyagi

People also ask

1 Answers

Alok Singh

Recent Activity

Donate For Us