kernel density score VS score_samples python scikit

Tags:

I am using scikit learn and python for a few days now and more specially KernelDensity. Once the model is fitted I would like to evaluate the probability of new points. The method score() is made for this but apparently doesn't work as when I put an array as entry 1 number is the output. I use score_samples() but it is very slow.

I think that score is not working but I don't have skills to imrpove it. Please let me know if you have any idea

699

asked Jul 10 '14 16:07

Romain

2 Answers

score() uses score_samples() as follows:

return np.sum(self.score_samples(X))

So, that's why you should use score_samples() in your case.

108

answered Sep 28 '22 02:09

slava

It's a bit hard to tell, without any code, but:

We assume your points you want to evaluate are saved within array X and you have a kernel density estimation kde, so you call:

logprobX = kde.score_samples(X)

But be careful, these are logarithmic! So you also need to do:

probX = np.exp(logprobX)

These values fit to your (eventually calculated) histogram.

The time running these lines are depending on the length of X. On my machine, it's quite fast to calculate 7500 pts.

answered Sep 28 '22 02:09

Ben Müller

Related questions
                            
                                Removing specific ticks from matplotlib plot
                            
                                Can't install discount with pip: error: command 'cc' failed with exit status 1
                            
                                Configuring Django
                            
                                Flask app gives ubiquitous 404 when proxied through nginx
                            
                                Pandas: Impute NaN's
                            
                                Export Django Database into YAML file
                            
                                Python string with space and without space at the end and immutability
                            
                                Fast ping sweep in python
                            
                                Adding lines after specific line
                            
                                Pandas seems to ignore first column name when reading tab-delimited data, gives KeyError
                            
                                How to convert tuple to a multi nested dictionary in python?
                            
                                Python - Remove list(s) from list of lists (Similar functionality to .pop() )
                            
                                How can I ignore zeros when I take the median on columns of an array?
                            
                                Display notifications in Gnome Shell
                            
                                How to check which arguments a function/method takes? [duplicate]
                            
                                Numpy Indexing of 2 Arrays
                            
                                Calculate Polygon area in planar units (e.g. square-meters) in Shapely
                            
                                How to extract dependencies information from a setup.py
                            
                                Scipy rankdata reverse highest to lowest
                            
                                Login to webpage from script using Requests and Django

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

kernel density score VS score_samples python scikit

Tags:

python

scikit-learn

kernel-density

Romain

People also ask

2 Answers

slava

Ben Müller

Recent Activity

Donate For Us