Pre-processing before digit recognition with KNN classifier

Tags:

Right now I'm trying to create digit recognition system using OpenCV. There are many articles and examples in WEB (and even on StackOverflow). I decided to use KNN classifier because this solution is the most popular in WEB. I found a database of handwritten digits with a training set of 60k examples and with error rate less than 5%.

I used this tutorial as an example of how to work with this database using OpenCV. I'm using exactly same technique and on test data (t10k-images.idx3-ubyte) I've got 4% error rate. But when I try to classify my own digits I've got much bigger error. For example:

is recognized as 7
and are recognized as 5
and are recognized as 1
is recognized as 8

And so on (I can upload all images if it's needed).

As you can see all digits have good quality and are easily-recognizable for human.

So I decided to do some pre-processing before classifying. From the table on MNIST database site I found that people are using deskewing, noise removal, blurring and pixel shift techniques. Unfortunately almost all links to the articles are broken. So I decided to do such pre-processing by myself, because I already know how to do that.

Right now, my algorithm is the following:

Erode image (I think that my original digits are too
rough).
Remove small contours.
Threshold and blur image.
Center digit (instead of shifting).

I think that deskewing is not needed in my situation because all digits are normally rotated. And also I have no idea how to find a right rotation angle. So after this I've got these images:

is also 1
is 3 (not 5 as it used to be)
is 5 (not 8)
is 7 (profit!)

So, such pre-processing helped me a bit, but I need better results, because in my opinion such digits should be recognized without problems.

Can anyone give me any advice with pre-processing? Thanks for any help.

P.S. I can upload my source (c++) code.

649

asked May 06 '13 15:05

ArtemStorozhuk

2 Answers

I realized my mistake - it wasn't connected with pre-processing at all (thanks to @DavidBrown and @John). I used handwritten dataset of digits instead of printed (capitalized). I didn't find such database in the web so I decided to create it by myself. I have uploaded my database to the Google Drive.

And here's how you can use it (train and classify):

int digitSize = 16;
//returns list of files in specific directory
static vector<string> getListFiles(const string& dirPath)
{
    vector<string> result;
    DIR *dir;
    struct dirent *ent;
    if ((dir = opendir(dirPath.c_str())) != NULL)
    {
        while ((ent = readdir (dir)) != NULL)
        {
            if (strcmp(ent->d_name, ".") != 0 && strcmp(ent->d_name, "..") != 0 )
            {
                result.push_back(ent->d_name);
            }
        }
        closedir(dir);
    }
    return result;
}

void DigitClassifier::train(const string& imagesPath)
{
    int num = 510;
    int size = digitSize * digitSize;
    Mat trainData = Mat(Size(size, num), CV_32FC1);
    Mat responces = Mat(Size(1, num), CV_32FC1);

    int counter = 0;
    for (int i=1; i<=9; i++)
    {
        char digit[2];
        sprintf(digit, "%d/", i);
        string digitPath(digit);
        digitPath = imagesPath + digitPath;
        vector<string> images = getListFiles(digitPath);
        for (int j=0; j<images.size(); j++)
        {
            Mat mat = imread(digitPath+images[j], 0);
            resize(mat, mat, Size(digitSize, digitSize));
            mat.convertTo(mat, CV_32FC1);
            mat = mat.reshape(1,1);
            for (int k=0; k<size; k++)
            {
                trainData.at<float>(counter*size+k) = mat.at<float>(k);
            }
            responces.at<float>(counter) = i;
            counter++;
        }
    }
    knn.train(trainData, responces);
}

int DigitClassifier::classify(const Mat& img) const
{
    Mat tmp = img.clone();

    resize(tmp, tmp, Size(digitSize, digitSize));

    tmp.convertTo(tmp, CV_32FC1);

    return knn.find_nearest(tmp.reshape(1, 1), 5);
}

answered Sep 19 '22 09:09

ArtemStorozhuk

5 & 6 , 1 & 7, 9 & 8 are recognized as the same because central points of classes are too similar. What about this ?

Apply connected component labeling method to digits for getting real boundaries of digits and crop images over these boundaries. So, you will work on more correct area and central points are normalized.
Then divide digits into two parts as horizontally. (For example you will have two circles after dividing "8")

As a result, "9" and "8" are more recognizable as well as "5" and "6". Upper parts will be same but lower parts are different.

answered Sep 20 '22 09:09

Fuat Coşkun

Related questions
                            
                                Confusion about constant expressions
                            
                                C++11: GCC 4.8 static thread_local std::unique_ptr undefined reference
                            
                                is_convertible for multiple arguments
                            
                                Run cling on Windows
                            
                                Building Boost for Android with error "cannot find -lrt"
                            
                                How to efficiently find coefficients of a polynomial from its roots? [duplicate]
                            
                                Whats the fastest way to find biggest sum of M adjacent elements in a matrix [duplicate]
                            
                                Problems with inheritance in the STL [closed]
                            
                                How to use etrace with a dynamic library for chronological traceing of function calls in C++?
                            
                                Call of overloaded static_cast is ambiguous
                            
                                How do you create a simple comment header template for all new classes in Visual C++ 2010?
                            
                                Is there Python Clang wrapper in the vein of pygccxml which wraps GCC-XML?
                            
                                Permutations of a List of Types Using boost::mpl
                            
                                Calling function template specialization using C calling conventions
                            
                                Render cairo surface directly to OpenGL texture
                            
                                Closest distance between two points(disjoint set)
                            
                                Is the Java Native Interface (JNI) affected by C++ ABI compatibility issues?
                            
                                Can I decode a C++ exception from a Windows SEH exception? (And if so, how?)
                            
                                Rules for lookup of operators in C++11
                            
                                Porting cross-platform C++ libraries to Windows Phone 8 platform

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pre-processing before digit recognition with KNN classifier

Tags:

c++

image-processing

opencv

image-recognition

knn

ArtemStorozhuk

People also ask

2 Answers

ArtemStorozhuk

Fuat Coşkun

Recent Activity

Donate For Us