Hopefully the last NN question you'll get from me this weekend, but here goes :) Is there a way to handle an input that you "don't always know"... so it doesn't affect the weightings somehow? Soo... if I ask someone if they are male or female and they would not like to answer, is there a way to disregard this input? Perhaps by placing it squarely in the centre? (assuming 1,0 inputs at 0.5?) Thanks

Neural networks are fairly resistant to noise - that's one of their big advantages. You may want to try putting inputs at (-1.0,1.0) instead, with 0 as the non-input input, though. That way the input to the weights from that neuron is 0.0, meaning that no learning will occur there. Probably the best book I've ever had the misfortune of not finishing (yet!) is Neural Networks and Learning Machines by Simon S. Haykin. In it, he talks about all kinds of issues, including the way you should distribute your inputs/training set for the best training, etc. It's a really great book!

Neural Network: Handling unavailable inputs (missing or incomplete data) [closed]

Tags:

machine-learning

neural-network

Hopefully the last NN question you'll get from me this weekend, but here goes :)

Is there a way to handle an input that you "don't always know"... so it doesn't affect the weightings somehow?

Soo... if I ask someone if they are male or female and they would not like to answer, is there a way to disregard this input? Perhaps by placing it squarely in the centre? (assuming 1,0 inputs at 0.5?)

Thanks

553

asked Apr 08 '10 23:04

Micheal

2 Answers

Neural networks are fairly resistant to noise - that's one of their big advantages. You may want to try putting inputs at (-1.0,1.0) instead, with 0 as the non-input input, though. That way the input to the weights from that neuron is 0.0, meaning that no learning will occur there.

Probably the best book I've ever had the misfortune of not finishing (yet!) is Neural Networks and Learning Machines by Simon S. Haykin. In it, he talks about all kinds of issues, including the way you should distribute your inputs/training set for the best training, etc. It's a really great book!

188

answered Dec 04 '22 20:12

Daniel G

You probably know this or suspect it, but there's no statistical basis for guessing or supplying the missing values by averaging over the range of possible values, etc.

For NN in particular, there are quite a few techniques avaialble. The technique i use--that i've coded--is one of the simpler techniques, but it has a solid statistical basis and it's still used today. The academic paper that describes it here.

The theory that underlies this technique is weighted integration over the incomlete data. In practice, no integrals are evaluated, instead they are approximated by closed-form solutions of Gaussian Basis Function networks. As you'll see in the paper (which is a step-by-step explanation, it's simple to implement in your backprop algorithm.

answered Dec 04 '22 21:12

doug

Related questions
                            
                                Subtract mean from image
                            
                                SkLearn Multinomial NB: Most Informative Features
                            
                                Tensorflow Object Detection API
                            
                                How to achieve stratified K fold splitting for arbitrary number of categorical variables?
                            
                                What does "shuffle" do in fit_generator in keras?
                            
                                Are there programs that iteratively write new programs?
                            
                                Simple example using BernoulliNB (naive bayes classifier) scikit-learn in python - cannot explain classification
                            
                                How to apply Machine Learning algorithm in PHP? [closed]
                            
                                Why do dilated convolutions preserve resolution?
                            
                                pytorch freeze weights and update param_groups
                            
                                Can I use arbitrary metrics to search KD-Trees?
                            
                                Pitch detection using neural networks [closed]
                            
                                document image processing
                            
                                Choosing Features to identify Twitter Questions as "Useful"
                            
                                How to determine the learning rate and the variance in a gradient descent algorithm？
                            
                                How to parse product titles (unstructured) into structured data?
                            
                                Affinity Propagation preferences initialization
                            
                                Using Reinforcement Learning for Classfication Problems [closed]
                            
                                Returning probabilities in a classification prediction in Keras?
                            
                                Can sklearn DecisionTreeClassifier truly work with categorical data?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With