I want to make a program to recognize the digit in an image. I follow the tutorial in scikit learn . I can train and fit the svm classifier like the following. First, I import the libraries and dataset <pre class="prettyprint"><code>from sklearn import datasets, svm, metrics digits = datasets.load_digits() n_samples = len(digits.images) data = digits.images.reshape((n_samples, -1)) </code></pre> Second, I create the SVM model and train it with the dataset. <pre class="prettyprint"><code>classifier = svm.SVC(gamma = 0.001) classifier.fit(data[:n_samples], digits.target[:n_samples]) </code></pre> And then, I try to read my own image and use the function <code>predict()</code> to recognize the digit. Here is my image: <img src="https://i.stack.imgur.com/0bgQX.jpg" alt="enter image description here"> I reshape the image into (8, 8) and then convert it to a 1D array. <pre class="prettyprint"><code>img = misc.imread("w1.jpg") img = misc.imresize(img, (8, 8)) img = img[:, :, 0] </code></pre> Finally, when I print out the prediction, it returns [1] <pre class="prettyprint"><code>predicted = classifier.predict(img.reshape((1,img.shape[0]*img.shape[1] ))) print predicted </code></pre> Whatever I user others images, it still returns [1] <img src="https://i.stack.imgur.com/0LhHb.jpg" alt="enter image description here"> <img src="https://i.stack.imgur.com/caGUx.jpg" alt="enter image description here"> When I print out the "default" dataset of number "9", it looks like:<img src="https://i.stack.imgur.com/90MOj.jpg" alt="enter image description here"> My image number "9" : <img src="https://i.stack.imgur.com/RDg94.jpg" alt="enter image description here"> You can see the non-zero number is quite large for my image. I dont know why. I am looking for help to solve my problem. Thanks

My best bet would be that there is a problem with your data types and array shapes. It looks like you are training on numpy arrays that are of the type <code>np.float64</code> (or possibly <code>np.float32</code> on 32 bit systems, I don't remember) and where each image has the shape <code>(64,)</code>. Meanwhile your input image for prediction, after the resizing operation in your code, is of type <code>uint8</code> and shape <code>(1, 64)</code>. I would first try changing the shape of your input image since dtype conversions often just work as you would expect. So change this line: <code>predicted = classifier.predict(img.reshape((1,img.shape[0]*img.shape[1] )))</code> to this: <code>predicted = classifier.predict(img.reshape(img.shape[0]*img.shape[1]))</code> If that doesn't fix it, you can always try recasting the data type as well with <code>img = img.astype(digits.images.dtype)</code>. I hope that helps. Debugging by proxy is a lot harder than actually sitting in front of your computer :) Edit: According to the SciPy documentation, the training data contains integer values from 0 to 16. The values in your input image should be scaled to fit the same interval. (http://scikit-learn.org/stable/modules/generated/sklearn.datasets.load_digits.html#sklearn.datasets.load_digits)

Scikit-learn SVM digit recognition

Tags:

python

image

image-processing

scikit-learn

I want to make a program to recognize the digit in an image. I follow the tutorial in scikit learn .

I can train and fit the svm classifier like the following.

First, I import the libraries and dataset

from sklearn import datasets, svm, metrics

digits = datasets.load_digits()
n_samples = len(digits.images)
data = digits.images.reshape((n_samples, -1))

Second, I create the SVM model and train it with the dataset.

classifier = svm.SVC(gamma = 0.001)
classifier.fit(data[:n_samples], digits.target[:n_samples])

And then, I try to read my own image and use the function predict() to recognize the digit.

Here is my image: enter image description here

I reshape the image into (8, 8) and then convert it to a 1D array.

img = misc.imread("w1.jpg")
img = misc.imresize(img, (8, 8))
img = img[:, :, 0]

Finally, when I print out the prediction, it returns [1]

predicted = classifier.predict(img.reshape((1,img.shape[0]*img.shape[1] )))
print predicted

Whatever I user others images, it still returns [1]

enter image description here

When I print out the "default" dataset of number "9", it looks like: enter image description here

My image number "9" :

enter image description here

You can see the non-zero number is quite large for my image.

I dont know why. I am looking for help to solve my problem. Thanks

957

asked Jul 22 '16 06:07

VICTOR

2 Answers

My best bet would be that there is a problem with your data types and array shapes.

It looks like you are training on numpy arrays that are of the type np.float64 (or possibly np.float32 on 32 bit systems, I don't remember) and where each image has the shape (64,).

Meanwhile your input image for prediction, after the resizing operation in your code, is of type uint8 and shape (1, 64).

I would first try changing the shape of your input image since dtype conversions often just work as you would expect. So change this line:

predicted = classifier.predict(img.reshape((1,img.shape[0]*img.shape[1] )))

to this:

predicted = classifier.predict(img.reshape(img.shape[0]*img.shape[1]))

If that doesn't fix it, you can always try recasting the data type as well with

img = img.astype(digits.images.dtype).

I hope that helps. Debugging by proxy is a lot harder than actually sitting in front of your computer :)

Edit: According to the SciPy documentation, the training data contains integer values from 0 to 16. The values in your input image should be scaled to fit the same interval. (http://scikit-learn.org/stable/modules/generated/sklearn.datasets.load_digits.html#sklearn.datasets.load_digits)

119

answered Oct 06 '22 21:10

Dr K

1) You need to create your own training set - based on data similar to what you will be making predictions. The call to datasets.load_digits() in scikit-learn is loading a preprocessed version of the MNIST Digits dataset, which, for all we know, could have very different images to the ones that you are trying to recognise.

2) You need to set the parameters of your classifier properly. The call to svm.SVC(gamma = 0.001) is just choosing an arbitrary value of the gamma parameter in SVC, which may not be the best option. In addition, you are not configuring the C parameter - which is pretty important for SVMs. I'd bet that this is one of the reasons why your output is 'always 1'.

3) Whatever final settings you choose for your model, you'll need to use a cross-validation scheme to ensure that the algorithm is effectively learning

There's a lot of Machine Learning theory behind this, but, as a good start, I would really recommend to have a look at SVM - scikit-learn for a more in-depth description of how the SVC implementation in sickit-learn works, and GridSearchCV for a simple technique for parameter setting.

answered Oct 06 '22 21:10

carrdelling

Related questions
                            
                                Python list basic manipulation [duplicate]
                            
                                What are the difference between sep and end in print function?
                            
                                PySpark - Add a new column with a Rank by User
                            
                                Link C in llvmlite
                            
                                Multiple inputs from one input
                            
                                How to group by and dummies in pandas
                            
                                how to include ssl with python build on MacOS
                            
                                Python testing if my data follows a lognormal distribution
                            
                                How to eliminate all strings from a list
                            
                                How to allow POST method with Flask?
                            
                                List to csv in python with header [closed]
                            
                                How to set cookies in phantomjs using selenium with python?
                            
                                Select rows randomly based on condition pandas python
                            
                                django one app one model multiple databases
                            
                                Lambda function to make simple HTTP request
                            
                                bluetooth error no advertisable device
                            
                                Pandas - return a dataframe after groupby
                            
                                Remove first character from string Django template
                            
                                Flask jinja2 update div content without refresh page
                            
                                How to natively increment a dictionary element's value?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With