Matlab - Gaussian mixture and Fuzzy C-means less accurate than K-means on high-dimensional data (image of 26-dimension vectors)

Tags:

I took the matlab code from this tutorial Texture Segmentation Using Gabor Filters.

To test clustering algorithms on the resulting multi-dimensional texture responses to gabor filters, I applied Gaussian Mixture and Fuzzy C-means instead of the K-means to compare their results (number of clusters = 2 in all of the cases):

Original image:

Original image

K-means clusters:

L = kmeans(X, 2, 'Replicates', 5);

kmeans

GMM clusters:

options = statset('MaxIter',1000);
gmm = fitgmdist(X, 2, 'Options', options);
L = cluster(gmm, X);

gmm

Fuzzy C-means:

[centers, U] = fcm(X, 2);
[values indexes] = max(U);

Fuzzy C-means

What I've found weird in this case is that K-means clusters are more accurate than those extracted using GMM and Fuzzy C-means.

Can anyone explain to me if the high-dimensionality (L x W x 26: 26 is the number of gabor filters used) of the data given as input to the GMM and the Fuzzy C-means classifiers is what's causing the clustering to be less accurate?

In other words is the GMM and the Fuzzy C-means clustering more sensitive to the dimensionality of the data, than K-means is?

232

asked Nov 19 '15 00:11

Hakim

1 Answers

Glad the comment was useful, here are my observations in answer form.

Each of these methods are sensitive to initialization, but k-means is cheating by using 5 'Replicates' and higher quality initialization (k-means++). The rest of the methods appear to be using a single random initialization.

k-means is GMM if you force spherical covariance. So in theory, it shouldn't do much better (it might do slightly better if the true covariance was in fact spherical).

I think most of the discrepancy comes down to initialization. You should be able to test this by using the k-means result as initial conditions for the other algorithms. Or as you tried, run several times using different random seeds and check if there is more variation in GMM and Fuzzy C-means than there is in k-means.

answered Oct 02 '22 07:10

kmac

Related questions
                            
                                What is the performance gap between Matlab fmincons and C++'s NLP solver like ipopt?
                            
                                Why does eroding / dilating image with zeros structuring element cause (-)Inf value?
                            
                                Can't get clean output in my MATLAB implementation of Canny-Deriche
                            
                                Matlab how to vectorize double for loop? Setting values for nested structure array is very slow
                            
                                c++ program for reading csv writing into array; then manipulating and printing into text file (already written in matlab)
                            
                                Excessively large overhead in MATLAB .mat file
                            
                                Neural Networks: Sigmoid Activation Function for continuous output variable
                            
                                how do i do a random swap but impose a limit of how far that number can move from its original position?
                            
                                Matlab Editor - show tabs and whitespace characters
                            
                                Find all polygons in points using MATLAB
                            
                                Multiple editor windows with multiple tabs
                            
                                How to implement the order analysis in MATLAB
                            
                                MATLAB write multipage tiff exponentially slow
                            
                                New MATLAB version overrides my function with class method. Can I still call my function?
                            
                                MatLab Bottleneck
                            
                                Calling a parallelized executable with MATLAB 'dos' command behaves differently from the standalone executable
                            
                                Change DISPLAY variable from within Matlab
                            
                                Make signal names coming from library links unique?
                            
                                What's better for performance, cell arrays of objects or heterogeneous arrays?
                            
                                Add shaded area to plotyy in Matlab

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Matlab - Gaussian mixture and Fuzzy C-means less accurate than K-means on high-dimensional data (image of 26-dimension vectors)

Tags:

image-processing

matlab

cluster-analysis

k-means

textures