Open source OCR [closed]

Tags:

I'm looking for an open source OCR library that runs on Linux. I need this to work for PNGs and PDFs. Mostly I would like to interface this library from java or ruby. Any idea if there is anything available?

Regards.

694

asked Mar 01 '11 07:03

Chris

1 Answers

Tesseract is a very good OCR engine: https://github.com/tesseract-ocr/tesseract

The project has been launched by HP Labs and is now continued and sponsored by Google (for Google Books !). It is released under the Apache license, and it runs on Linux. It uses Tiff or PNGs files ; for PDFs, you will need to convert to one of these formats. I suppose that there is no binding so you should invoke this software as a subprogram...

118

answered Sep 22 '22 12:09

olivierlemasle

Related questions
                            
                                Numbers of source Raster bands and source color space components do not match when i read image [duplicate]
                            
                                Is the accumulator of reduce in Java 8 allowed to modify its arguments?
                            
                                ClassFormatError in java 8?
                            
                                Why does HK2 repackage everything?
                            
                                Java: Is volatile / final required for reference to synchronized object?
                            
                                Serialization without reflection in compiled classes
                            
                                Is it OK to add default implementations to methods of an interface which represents a listener?
                            
                                Is there a convenience method to create a Predicate that tests if a field equals a given value?
                            
                                Guarantees concerning Math.atan2
                            
                                How to deal with multiple database results from different servers for a request
                            
                                Android - How to use camera getSupportedPreviewSizes() for portrait orientation
                            
                                keycloak CORS filter spring boot
                            
                                Does Spring Transaction Management Work with Spring WebFlux?
                            
                                How can I Java webstart multiple, dependent, native libraries?
                            
                                Why is it recommended to avoid unidirectional one-to-many association on a foreign key? [duplicate]
                            
                                Render JavaScript and HTML in (any) Java Program (Access rendered DOM Tree)?
                            
                                Recognize numbers in images
                            
                                How to populate Java (web) application with initial data using Spring/JPA/Hibernate
                            
                                Writing a thread safe modular counter in Java
                            
                                Using @NotNull in a project where both IntelliJ and Eclipse developers are working

Open source OCR [closed]

Tags:

java

linux

ruby

pdf

ocr

Chris

People also ask

1 Answers

olivierlemasle

Recent Activity

Donate For Us