Training feedforward neural network for OCR [closed]

Tags:

Currently I'm learning about neural networks and I'm trying to create an application that can be trained to recognize handwritten characters. For this problem I use a feed-forward neural network and it seems to work when I train it to recognize 1, 2 or 3 different characters. But when I try to make the network learn more than 3 characters it will stagnate at a error percentage around the 40 - 60%.

I tried with multiple layers and less/more neurons but I can't seem to get it right, now I'm wondering if a feedforward neural network is capable of recognizing that much information.

Some statistics:

Network type: Feed-forward neural network

Input neurons: 100 (a 10 * 10) grid is used to draw the characters

Output neurons: The amount of characters to regocnize

Does anyone know what's the possible flaw in my architecture is? Are there too much input neurons? Is the feedforward neural network not capable of character regocnition?

956

asked Mar 13 '12 12:03

Marnix v. R.

1 Answers

For handwritten character recognition you need

many training examples (maybe you should create distortions of your training set)
softmax activation function in the output layer
cross entropy error function
training with stochastic gradient descent
a bias in each layer

A good test problem is the handwritten digit data set MNIST. Here are papers that successfully applied neural networks on this data set:

Y. LeCun, L. Bottou, Y. Bengio and P. Haffner: Gradient-Based Learning Applied to Document Recognition, http://yann.lecun.com/exdb/publis/pdf/lecun-98.pdf

Dan Claudiu Ciresan, Ueli Meier, Luca Maria Gambardella, Juergen Schmidhuber: Deep Big Simple Neural Nets Excel on Handwritten Digit Recognition, http://arxiv.org/abs/1003.0358

I trained an MLP with 784-200-50-10 architecture and got >96% accuracy on the test set.

answered Nov 07 '22 01:11

alfa

Related questions
                            
                                How to make softmax work with policy gradient?
                            
                                hidden markov model thresholding
                            
                                No. of hidden layers, units in hidden layers and epochs till Neural Network starts behaving acceptable on Training data
                            
                                Chess: Bug in Alpha-Beta
                            
                                Easy to use Perl modules for neural networks
                            
                                Why does decreasing K in K-nearest-neighbours increase complexity?
                            
                                Formulating Effect axiom
                            
                                Java Rule Engine for Game AI
                            
                                Simple example using BernoulliNB (naive bayes classifier) scikit-learn in python - cannot explain classification
                            
                                How to parse product titles (unstructured) into structured data?
                            
                                Anchor Boxes in YOLO : How are they decided
                            
                                Porting a piece of Lisp code to Clojure (PAIP)
                            
                                Face Recognition for classifying digital photos?
                            
                                Where Dropout should be inserted.? Fully Connected Layer.? Convolutional Layer.? or Both.? [closed]
                            
                                Tile based game theory
                            
                                C++ machine learning framework [closed]
                            
                                What are the uses of recurrent neural networks when using them with Reinforcement Learning?
                            
                                Time complexity of Uniform-cost search
                            
                                How to know when to use a particular kind of Similarity index? Euclidean Distance vs. Pearson Correlation
                            
                                Recommendations for using graphs theory in machine learning? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Training feedforward neural network for OCR [closed]

Tags:

artificial-intelligence

neural-network

backpropagation

ocr

feed-forward

Marnix v. R.

People also ask

1 Answers

alfa

Recent Activity

Donate For Us