DBSCAN with python and scikit-learn: What exactly are the integer labes returned by make_blobs?

Tags:

I'm trying to comprehend the example for the DBSCAN algorithm implemented by scikit (http://scikit-learn.org/0.13/auto_examples/cluster/plot_dbscan.html).

I changed the line

X, labels_true = make_blobs(n_samples=750, centers=centers, cluster_std=0.4)

with X = my_own_data, so I can use my own data for the DBSCAN.

now, the variable labels_true, which is the second returned argument of make_blobs is used to calculate some values of the results, like this:

print "Homogeneity: %0.3f" % metrics.homogeneity_score(labels_true, labels)
print "Completeness: %0.3f" % metrics.completeness_score(labels_true, labels)
print "V-measure: %0.3f" % metrics.v_measure_score(labels_true, labels)
print "Adjusted Rand Index: %0.3f" % \
    metrics.adjusted_rand_score(labels_true, labels)
print "Adjusted Mutual Information: %0.3f" % \
    metrics.adjusted_mutual_info_score(labels_true, labels)
print ("Silhouette Coefficient: %0.3f" %
       metrics.silhouette_score(D, labels, metric='precomputed'))

how can I calculate labels_true from my data X? what exactly do scikit mean with label on this case?

thanks for your help!

776

asked Apr 04 '13 18:04

otmezger

1 Answers

labels_true is the "true" assignment of points to labels: which cluster they should actually belong on. This is available because make_blobs knows which "blob" it generated the point from.

You can't get that for your own arbitrary data X, unless you have some kind of true labels for the points (in which case you wouldn't be doing clustering anyway). This just shows some measures of how well the clustering performed in a fake case where you know the true answer.

171

answered Sep 29 '22 11:09

Danica

Related questions
                            
                                How to get a file object from mkstemp()?
                            
                                Flask and WTForms - how to get wtforms to refresh select data
                            
                                python regular expression matching anything
                            
                                Use lxml to parse text file with bad header in Python
                            
                                Selenium WebDriver (2.25) Timeout Not Working
                            
                                How do I display and close an image with Python?
                            
                                Data type error with drawContours unless I pickle/unpickle first
                            
                                Dynamically change widget background color in Tkinter
                            
                                python compare datetimes with different timezones
                            
                                Python regex compile (with re.VERBOSE) not working
                            
                                Extract text with lxml.html
                            
                                Convert pyBarcode output to PIL Image file
                            
                                python: recurcive list processing changes original list
                            
                                win32gui get the current active application name
                            
                                Manipulating the numpy.random.exponential distribution in Python
                            
                                Bug in Python's documentation?
                            
                                python http/udp bittorrent tracker scrape library
                            
                                How to selectively import module in python?
                            
                                How do i test/refactor my tests?
                            
                                Using git submodule to import a python project

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

DBSCAN with python and scikit-learn: What exactly are the integer labes returned by make_blobs?

Tags:

python

scikit-learn

dbscan

otmezger

People also ask

1 Answers

Danica

Recent Activity

Donate For Us