Naive Bayes: the within-class variance in each feature of TRAINING must be positive

Tags:

When trying to fit Naive Bayes:

    training_data = sample; % 
    target_class = K8;
 # train model
 nb = NaiveBayes.fit(training_data, target_class);

 # prediction
 y = nb.predict(cluster3);

I get an error:

??? Error using ==> NaiveBayes.fit>gaussianFit at 535
The within-class variance in each feature of TRAINING
must be positive. The within-class variance in feature
2 5 6 in class normal. are not positive.

Error in ==> NaiveBayes.fit at 498
            obj = gaussianFit(obj, training, gindex);

Can anyone shed light on this and how to solve it? Note that I have read a similar post here but I am not sure what to do? It seems as if its trying to fit based on columns rather than rows, the class variance should be based on the probability of each row belonging to a specific class. If I delete those columns then it works but obviously this isnt what I want to do.

706

asked Nov 17 '12 04:11

G Gr

1 Answers

Assuming that there is no bug anywhere in your code (or NaiveBayes code from mathworks), and again assuming that your training_data is in the form of NxD where there are N observations and D features, then columns 2, 5, and 6 are completely zero for at least a single class. This can happen if you have relatively small training data and high number of classes, in which a single class may be represented by a few observations. Since NaiveBayes by default treats all features as part of a normal distribution, it cannot work with a column that has zero variance for all features related to a single class. In other words, there is no way for NaiveBayes to find the parameters of the probability distribution by fitting a normal distribution to the features of that specific class (note: the default for distribution is normal).

Take a look at the nature of your features. If they seem to not follow a normal distribution within each class, then normal is not the option you want to use. Maybe your data is closer to a multinomial model mn:

nb = NaiveBayes.fit(training_data, target_class, 'Distribution', 'mn');

122

answered Sep 23 '22 02:09

Bee

Related questions
                            
                                Normalized cuts with Matlab 2013a
                            
                                How to display legend in bottom right corner instead of top right?
                            
                                Matlab - Transpose a 3D matrix only in the third dimension
                            
                                How to zoom in/out in Matlab editor?
                            
                                Remove zeros column and rows from a matrix matlab
                            
                                determine if array contains specific integer in octave
                            
                                How to customize App Designer figures in more ways than officially documented?
                            
                                2-D line gradient color in Matlab
                            
                                -bash: matlab: command not found
                            
                                How to calculate a rotation matrix in n dimensions given the point to rotate, an angle of rotation and an axis of rotation (n-2 subspace)
                            
                                Reading text values into matlab variables from ASCII files
                            
                                How can I create a barseries plot using both grouped and stacked styles in MATLAB?
                            
                                Agglomerative Clustering in Matlab
                            
                                How can I query the number of physical cores from MATLAB?
                            
                                Using vector as range in for-loop In Matlab
                            
                                Replicate MATLAB's `conv2()` Using Fourier Domain Convolution
                            
                                Image interpolation from random pixels
                            
                                Multiply each column of a matrix by another matrix
                            
                                Assign value to the same field of every element of non-scalar struct
                            
                                unit of fft(DFT) x axis [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Naive Bayes: the within-class variance in each feature of TRAINING must be positive

Tags:

classification

naivebayes

matlab

bayesian

variance

G Gr

People also ask

1 Answers

Bee

Recent Activity

Donate For Us