This code was working until I upgrade my python 2.x to 3.x. I have a df consisting of 3 columns ipk1, ipk2, ipk3. ipk1, ipk2, ipk3 consisting of float numbers 0 - 4.0, I would like to bin them into string. The data looks something like this: <pre class="prettyprint"><code> ipk1 ipk2 ipk3 ipk4 ipk5 jk 0 3.25 3.31 3.31 3.31 3.34 P 1 3.37 3.33 3.36 3.33 3.41 P 2 3.41 3.47 3.59 3.55 3.60 P 3 3.23 3.10 3.05 2.98 2.97 L 4 3.24 3.40 3.22 3.23 3.25 L </code></pre> on python 2.x this code works but after I upgrade it into python 3 it isn't. Is there any other way to bin it into string ? I have tried using while it also not help anything. <pre class="prettyprint"><code>train1.loc[train1['ipk1'] > 3.6, 'ipk1'] = 'A', train1.loc[(train1['ipk1']>3.2) & (train1['ipk1']<=3.6),'ipk1']='B', train1.loc[(train1['ipk1']>2.8) & (train1['ipk1']<=3.2),'ipk1']='C', train1.loc[(train1['ipk1']>2.4) & (train1['ipk1']<=2.8),'ipk1']='D', train1.loc[(train1['ipk1']>2.0) & (train1['ipk1']<=2.4),'ipk1']='E', train1.loc[(train1['ipk1']>1.6) & (train1['ipk1']<=2.0),'ipk1']='F', train1.loc[(train1['ipk1']>1.2) & (train1['ipk1']<=1.6),'ipk1']='G', train1.loc[train1['ipk1'] <= 1.2, 'ipk1'] = 'H' </code></pre> The error I receive: <pre class="prettyprint"><code>TypeError: '>' not supported between instances of 'str' and 'float' </code></pre> My expected output: <pre class="prettyprint"><code> ipk1 ipk2 ipk3 ipk4 ipk5 jk 0 B 3.31 3.31 3.31 3.34 P 1 B 3.33 3.36 3.33 3.41 P 2 B 3.47 3.59 3.55 3.60 P 3 B 3.10 3.05 2.98 2.97 L 4 B 3.40 3.22 3.23 3.25 L </code></pre>

This is a good use case for <code>pandas.cut</code>: <pre class="prettyprint"><code>bins = [-np.inf, 1.2, 1.6, 2.0, 2.4, 2.8, 3.2, 3.6, np.inf] labels = ['H', 'G', 'F', 'E', 'D', 'C', 'B', 'A'] df['ipk1'] = pd.cut(df['ipk1'], bins=bins, labels=labels) </code></pre>

How to bin column of floats with pandas

Tags:

python

pandas

dataframe

binning

This code was working until I upgrade my python 2.x to 3.x. I have a df consisting of 3 columns ipk1, ipk2, ipk3. ipk1, ipk2, ipk3 consisting of float numbers 0 - 4.0, I would like to bin them into string.

The data looks something like this:

Click to copy

    ipk1    ipk2    ipk3    ipk4    ipk5    jk
0   3.25    3.31    3.31    3.31    3.34    P
1   3.37    3.33    3.36    3.33    3.41    P
2   3.41    3.47    3.59    3.55    3.60    P
3   3.23    3.10    3.05    2.98    2.97    L
4   3.24    3.40    3.22    3.23    3.25    L

on python 2.x this code works but after I upgrade it into python 3 it isn't. Is there any other way to bin it into string ? I have tried using while it also not help anything.

Click to copy

train1.loc[train1['ipk1'] > 3.6, 'ipk1'] = 'A',
train1.loc[(train1['ipk1']>3.2) & (train1['ipk1']<=3.6),'ipk1']='B',
train1.loc[(train1['ipk1']>2.8) & (train1['ipk1']<=3.2),'ipk1']='C',
train1.loc[(train1['ipk1']>2.4) & (train1['ipk1']<=2.8),'ipk1']='D',
train1.loc[(train1['ipk1']>2.0) & (train1['ipk1']<=2.4),'ipk1']='E',
train1.loc[(train1['ipk1']>1.6) & (train1['ipk1']<=2.0),'ipk1']='F',
train1.loc[(train1['ipk1']>1.2) & (train1['ipk1']<=1.6),'ipk1']='G',
train1.loc[train1['ipk1'] <= 1.2, 'ipk1'] = 'H'

The error I receive:

Click to copy

TypeError: '>' not supported between instances of 'str' and 'float'

My expected output:

Click to copy

    ipk1    ipk2    ipk3    ipk4    ipk5    jk
0   B       3.31    3.31    3.31    3.34    P
1   B       3.33    3.36    3.33    3.41    P
2   B       3.47    3.59    3.55    3.60    P
3   B       3.10    3.05    2.98    2.97    L
4   B       3.40    3.22    3.23    3.25    L

985

asked May 24 '19 16:05

yuliansen

1 Answers

This is a good use case for pandas.cut:

Click to copy

bins = [-np.inf, 1.2, 1.6, 2.0, 2.4, 2.8, 3.2, 3.6, np.inf]
labels = ['H', 'G', 'F', 'E', 'D', 'C', 'B', 'A']

df['ipk1'] = pd.cut(df['ipk1'], bins=bins, labels=labels)

answered Sep 19 '22 22:09

cs95

Related questions
                            
                                PySpark replace value in several column at once
                            
                                How to filter out list elements that contain invalid characters in Python 3.x?
                            
                                When I save the output of displacy.render(doc, style="dep") to a svg file, there is a TypeError: write() argument must be str, not None
                            
                                Keras 2: Using lambda function in "Merge" layers
                            
                                YAML - Dumping a nested object without types/tags
                            
                                How to elevate to root within a python script?
                            
                                Python ExcelWriter formatting 'all borders'
                            
                                'ServiceAccountCredentials.from_json_keyfile_name' equivalent for remote json
                            
                                Can I pass arguments to the entrypoint of a SageMaker estimator?
                            
                                ValueError: seek of closed file Working on PyPDF2 and getting this error
                            
                                __repr__ method appears can't be invoked automatically for Exception class
                            
                                Plotting multiple bars with matplotlib using ax.bar()
                            
                                Pandas Mask on multiple Conditions
                            
                                python's json: AttributeError: 'str' object has no attribute 'keys'
                            
                                Pandas replace part of string with values from dictionary
                            
                                running the python script in command line does not print any output
                            
                                Unable to build GUI from the code from PyQt Designer
                            
                                How to get the optimal threshold from ROC curve in Python? [duplicate]
                            
                                When will/won't Python suspend execution of a coroutine?
                            
                                Issue with plyer library of python when creating a executable using pyinstaller

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to bin column of floats with pandas

Tags:

python

pandas

dataframe

binning

yuliansen

People also ask

1 Answers

cs95

Recent Activity

Donate For Us