Fit gaussians (or other distributions) on my data using python

Question

I have a database of features, a 2D np.array (2000 samples and each sample contains 100 features, 2000 X 100). I want to fit gaussian distributions to my database using python. My code is the following:

data = load_my_data() # loads a np.array with size 2000x200
clf = mixture.GaussianMixture(n_components= 50, covariance_type='full')
clf.fit(data)

I am not sure about the parameters for example the covariance_type and how can I investigate whether the fit was occured succesfully or not.

EDIT: I debug the code to investigate what is happening with the clf.means_ and appartently it produced a matrix n_components X size_of_features 50 X 20). Is there a way that i can check that the fitting was successful, or to plot data? What are the alternatives to Gaussian mixtures (mixtures of exponential for example, I cannot find any available implementation)?

chenxingwei · Accepted Answer

I think you are using sklearn package.

Once you have fit, then type

print clf.means_

If it has output, then the data is fitted, if it raise errors, not fitted.

Hope this helps you.

Fit gaussians (or other distributions) on my data using python

Tags:

python

database

gaussian

Jose Ramon

1 Answers

chenxingwei

Recent Activity

Donate For Us

Fit gaussians (or other distributions) on my data using python

Tags:

python

database

gaussian

Jose Ramon

1 Answers

chenxingwei

Related questions

Recent Activity

Donate For Us