I'm writing an Android app to extract a Sudoku puzzle from a picture. For each cell in the 9x9 Sudoku grid, I need to determine whether it contains one of the digits 1 through 9 or is blank. I start off with a Sudoku like this: <img src="https://i.stack.imgur.com/u5Qev.jpg" alt="enter image description here"> I pre-process the Sudoku using OpenCV to extract black-and-white images of the individual digits and then put them through Tesseract. There are a couple of limitations to Tesseract, though: <ol> <li>Tesseract is large, contains lots of functionality I don't need (I.e. Full text recognition), and requires English-language training data in order to function, which I think has to go onto the device's SD card. At least I can tell it to only look for digits using <code>tesseract.setVariable("tessedit_char_whitelist", "123456789");</code> </li> <li>Tesseract often misinterprets a single digits as a string of digits, often containing newlines. It also sometimes just plain gets it wrong. Here are a few examples from the above Sudoku:</li> </ol> <img src="https://i.stack.imgur.com/imaAs.png" alt="enter image description here"> I have three questions: <ol> <li>Is there any way I can overcome the limitations of Tesseract?</li> <li>If not, what is a useful, accurate method to detect individual digits (not k-nearest neighbours) that would be feasible to implement on Android - this could be a free library or a DIY solution.</li> <li>How can I improve the pre-processing to target that method? One possibility I've considered is using a thinning algorithm, as suggested by this post, but I'm not going to bother implementing it unless it will make a difference.</li> </ol>

I took a class with one of the computer vision superstars who was/is at the top of the digit recognition algorithm rankings. He was really adamant that the best way to do digit recognition is... <pre class="prettyprint"><code>1. Get some hand-labeled training data. 2. Run Histogram of Oriented Gradients (HOG) on the training data, and produce one long, concatenated feature vector per image 3. Feed each image's HOG features and its label into an SVM 4. For test data (digits on a sudoku puzzle), run HOG on the digits, then ask the SVM classify the HOG features from the sudoku puzzle </code></pre> OpenCV has a <code>HOGDescriptor</code> object, which computes HOG features. Look at this paper for advice on how to tune your HOG feature parameters. Any SVM library should do the job...the <code>CvSVM</code> stuff that comes with OpenCV should be fine. For training data, I recommend using the MNIST handwritten digit database, which has thousands of pictures of digits with ground-truth data. A slightly harder problem is to draw a bounding box around digits that appear in nature. Fortunately, it looks like you've already found a strategy for doing bounding boxes. :)

Suggestions for digit recognition

Tags:

android

image-processing

opencv

ocr

tesseract

I'm writing an Android app to extract a Sudoku puzzle from a picture. For each cell in the 9x9 Sudoku grid, I need to determine whether it contains one of the digits 1 through 9 or is blank. I start off with a Sudoku like this:

enter image description here

I pre-process the Sudoku using OpenCV to extract black-and-white images of the individual digits and then put them through Tesseract. There are a couple of limitations to Tesseract, though:

Tesseract is large, contains lots of functionality I don't need (I.e. Full text recognition), and requires English-language training data in order to function, which I think has to go onto the device's SD card. At least I can tell it to only look for digits using tesseract.setVariable("tessedit_char_whitelist", "123456789");
Tesseract often misinterprets a single digits as a string of digits, often containing newlines. It also sometimes just plain gets it wrong. Here are a few examples from the above Sudoku:

enter image description here

I have three questions:

Is there any way I can overcome the limitations of Tesseract?
If not, what is a useful, accurate method to detect individual digits (not k-nearest neighbours) that would be feasible to implement on Android - this could be a free library or a DIY solution.
How can I improve the pre-processing to target that method? One possibility I've considered is using a thinning algorithm, as suggested by this post, but I'm not going to bother implementing it unless it will make a difference.

863

asked Nov 10 '12 05:11

1''

2 Answers

I took a class with one of the computer vision superstars who was/is at the top of the digit recognition algorithm rankings. He was really adamant that the best way to do digit recognition is...

Click to copy

1. Get some hand-labeled training data.
2. Run Histogram of Oriented Gradients (HOG) on the training data, and produce one
    long, concatenated feature vector per image
3. Feed each image's HOG features and its label into an SVM
4. For test data (digits on a sudoku puzzle), run HOG on the digits, then ask 
    the SVM classify the HOG features from the sudoku puzzle

OpenCV has a HOGDescriptor object, which computes HOG features. Look at this paper for advice on how to tune your HOG feature parameters. Any SVM library should do the job...the CvSVM stuff that comes with OpenCV should be fine.

For training data, I recommend using the MNIST handwritten digit database, which has thousands of pictures of digits with ground-truth data.

A slightly harder problem is to draw a bounding box around digits that appear in nature. Fortunately, it looks like you've already found a strategy for doing bounding boxes. :)

122

answered Sep 28 '22 10:09

solvingPuzzles

Easiest thing is to use Normalized Central Moments for digit recognition. If you have one font (or very similar fonts it works good).

See this solution: https://github.com/grzesiu/Sudoku-GUI

In core there are things responsible for digit recognition, extraction, moments training. First time application is run operator must provide information what number is seen. Then moments of image (extracted square roi) are assigned to number (operator input). Application base on comparing moments.

Here first youtube movie shows how application works: http://synergia.pwr.wroc.pl/2012/06/22/irb-komunikacja-pc/

answered Sep 28 '22 10:09

krzych

Related questions
                            
                                Butterknife 8.4.0 does not find views after re-running the app. It gets a NullPointerException
                            
                                Recyclerview view is not working with setNestedScrollingEnabled and its never recycle any views also
                            
                                React Native fetch() not working android
                            
                                Mockito 2 for Android Instrumentation test : Could not initialize plugin: interface org.mockito.plugins.MockMaker
                            
                                java.lang.ClassNotFoundException: Didn't find class ".MainActivity" on path: DexPathList react-native [duplicate]
                            
                                Room database onConflict = OnConflictStrategy.REPLACE not working
                            
                                Is there an elegant way to save and restore View state in Kotlin?
                            
                                Good way to cache data during Android application lifecycle?
                            
                                Good guidelines for developing an ecommerce application
                            
                                Testing for the masses with only one phone and emulator
                            
                                Android: how to increase heap size at runtime?
                            
                                How to Detect Lowmemory in android?
                            
                                What is the scope of LoaderManager?
                            
                                How to implement pull to refresh on a ListFragment
                            
                                Android USB Host Communication
                            
                                What are default values for connection and socket timeouts in DefaultHttpClient on Android?
                            
                                PopupWindow above Virtual keyboard
                            
                                Have horizontal RadioButtons wrap if too long for screen
                            
                                Compiling Android project from command line is slow
                            
                                Why would I ever want `setRetainInstance(false)`? - Or - The correct way to handle device rotation

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With