weights option for seaborn distplot?

Tags:

I'd like to have a weights option in seaborn distplot, similar to that in numpy histogram. Without this option, the only alternative would be to apply the weighting to the input array, which could result in an impractical size (and time).

317

asked Jul 29 '15 14:07

nbecker

2 Answers

You can provide weights by passing them to the underlying matplotlib's histogram function using the hist_kws argument, as:

sns.distplot(..., hist_kws={'weights': your weights array}, ...)

Take note though, that the weights will be passed only to the underlying histogram; neither the kde, nor the fit functions of the distplot will be affected.

146

answered Oct 15 '22 16:10

vlasisva

As @vlasisla already mentioned in their answer, weights should be provided through the keyword argument hist_kws so they would be passed to mathpolotlib's hist function. Though, this will not make any effect unless kde (kernel density estimation) option is disabled at the same time. This code would actually have a desired effect:

sns.distplot(x, hist_kws={'weights': x_weights}, kde=False)

To understand why both weights and kde are not allowed, let's consider the following example, where x_weights is calculated as x_weights = np.ones_like(x) / len(x) so that all bins' heights sum to 1:

# generate 1000 samples from a normal distribution
np.random.seed(8362) 
x = np.random.normal(size=1000)

# calculate weights
x_weights = np.ones_like(x) / len(x)

# figure 1 - use weights
sns.distplot(x, hist_kws={'weights': x_weights}, kde=False)
# figure 2 - default plot with kde
sns.distplot(x)

Figure 1. Using dist with weights and not KDE Figure 2. Using dist with default parameters

In Figure 1 we provided dist function with weights, so in this figure all bins' heights sum to 1. In Figure 2 the default behaviour of dist is enabled, so the area under the KDE function sums to 1 and bins' heights are normalised correspondingly. It can be easily seen now, that plotting KDE when weights are provided indeed would not make much sense.

answered Oct 15 '22 16:10

myrs

Related questions
                            
                                How to set timeout detection on a RabbitMQ server?
                            
                                How to do Python's zip in C#?
                            
                                Is there a performance gain from defining routes in app.yaml versus one large mapping in a WSGIApplication in AppEngine?
                            
                                python logging alternatives [closed]
                            
                                Python/Scipy 2D Interpolation (Non-uniform Data)
                            
                                Django: why are Django model fields class attributes?
                            
                                What's your folder layout for a Flask app divided in modules?
                            
                                pickling error in python?
                            
                                mod_wsgi and multiple installations of python
                            
                                lxml not adding newlines when inserting a new element into existing xml
                            
                                RFCOMM without pairing using PyBluez on Debian?
                            
                                Multidimensional Scaling Fitting in Numpy, Pandas and Sklearn (ValueError)
                            
                                What part of speech does "s" stand for in WordNet synsets
                            
                                selenium.common.exceptions.WebDriverException: Message: 'Can not connect to GhostDriver'
                            
                                multiprocessing.Process.is_alive() returns True although process has finished, why?
                            
                                argparse argument dependency
                            
                                Multiprocessing of shared list
                            
                                How to Zoom with Axes3D in Matplotlib
                            
                                Why does python print version info to stderr?
                            
                                How to aggregate matching pairs into "connected components" in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

weights option for seaborn distplot?

Tags:

python

matplotlib

seaborn

histogram

nbecker

People also ask

2 Answers

vlasisva

myrs

Recent Activity

Donate For Us