Spherical k-means implementation in Python

Tags:

I've been using scipy's k-means for quite some time now, and I'm pretty happy about the way it works in terms of usability and efficiency. However, now I want to explore different k-means variants, more specifically, I'd like to apply spherical k-means in some of my problems.

Do you know any good Python implementation (i.e. similar to scipy's k-means) of spherical k-means? If not, how hard would it be to modify scipy's source code to adapt its k-means algorithm to be spherical?

Thank you.

670

asked Oct 07 '13 14:10

Oriol Nieto

1 Answers

In spherical k-means, you aim to guarantee that the centers are on the sphere, so you could adjust the algorithm to use the cosine distance, and should additionally normalize the centroids of the final result.

When using the Euclidean distance, I prefer to think of the algorithm as projecting the cluster centers onto the unit sphere in each iteration, i.e., the centers should be normalized after each maximization step.

Indeed, when the centers and data points are both normalized, there is a 1-to-1 relationship between the cosine distance and Euclidean distance

|a - b|_2 = 2 * (1 - cos(a,b))

The package jasonlaska/spherecluster modifies scikit-learns's k-means into spherical k-means and also provides another sphere clustering algorithm.

139

answered Oct 08 '22 20:10

Jaska

Related questions
                            
                                Python, how to decode Binary coded decimal (BCD)
                            
                                TypeError: object.__new__() takes no parameters
                            
                                switch to different user using fabric
                            
                                forcing pyYAML to dump consistently
                            
                                Python monkey patching
                            
                                Change "Quoted-printable" encoding to "utf-8"
                            
                                Computer Vision: Masking a human hand
                            
                                Easy way to launch Python scripts with the mouse in OS-X
                            
                                Default working directory for Python IDLE?
                            
                                Scipy: Speeding up calculation of a 2D complex integral
                            
                                How can I convert windows timezones to timezones pytz understands?
                            
                                what does bad color sequence mean in python turtle?
                            
                                How can I fix ValueError: Too many values to unpack" in Python?
                            
                                Why use Flask's url_for?
                            
                                Copying excel data into a python list in IPython using clipboard?
                            
                                Getting started with PyOpenCL
                            
                                Cleaner Way to Take Items from One List to Another
                            
                                Get Unique Tuples from List , Python
                            
                                Stream child process output in flowing mode
                            
                                Avoid double typing class names in python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Spherical k-means implementation in Python

Tags:

python

scipy

k-means

Oriol Nieto

People also ask

1 Answers

Jaska

Recent Activity

Donate For Us