Python Pandas Create New Bin/Bucket Variable with pd.qcut

Tags:

How do you create a new Bin/Bucket Variable using pd.qut in python?

This might seem elementary to experienced users but I was not super clear on this and it was surprisingly unintuitive to search for on stack overflow/google. Some thorough searching yielded this (Assignment of qcut as new column) but it didn't quite answer my question because it didn't take the last step and put everything into bins (i.e. 1,2,...).

413

asked Feb 10 '15 22:02

sfortney

1 Answers

In Pandas 0.15.0 or newer, pd.qcut will return a Series, not a Categorical if the input is a Series (as it is, in your case) or if labels=False. If you set labels=False, then qcut will return a Series with the integer indicators of the bins as values.

So to future-proof your code, you could use

data3['bins_spd'] = pd.qcut(data3['spd_pct'], 5, labels=False)

or, pass a NumPy array to pd.qcut so you get a Categorical as the return value. Note that the Categorical attribute labels is deprecated. Use codes instead:

data3['bins_spd'] = pd.qcut(data3['spd_pct'].values, 5).codes

161

answered Sep 23 '22 14:09

unutbu

Related questions
                            
                                Equivalent of "in" keyword or subquery in pandas
                            
                                Implementation of NoneType, Reasons and Details
                            
                                how do I redraw an image using python's matplotlib?
                            
                                How to apply hierarchy or multi-index to pandas columns
                            
                                Make a Pandas MultiIndex from a product of iterables?
                            
                                Fastest way to load numeric data into python/pandas/numpy array from MySQL
                            
                                Python: solving unicode hell with unidecode
                            
                                Pyplot: Shared axes and no space between subplots
                            
                                What is the point of `cursor` class in psycopg?
                            
                                Python coordinate transformation ECI to ECEF
                            
                                AttributeError: 'NoneType' object has no attribute 'split'
                            
                                Difference between `yield from foo()` and `for x in foo(): yield x`
                            
                                Mac - Python - import error: "No module named site"
                            
                                creating a boolean array which compares numpy elements to None
                            
                                How do I use re.search starting from a certain index in the string?
                            
                                Sklearn.KMeans() : Get class centroid labels and reference to a dataset
                            
                                Is asyncio's loop.run_in_executor thread-safe?
                            
                                I can't install 'pip' for python
                            
                                Load json file in python
                            
                                Popen waiting for child process even when the immediate child has terminated

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python Pandas Create New Bin/Bucket Variable with pd.qcut

Tags:

python

pandas

buckets

bins

sfortney

People also ask

1 Answers

unutbu

Recent Activity

Donate For Us