I have the following df: <pre class="prettyprint"><code> TAN.SK SHA.LO A 0.05 0.01 S 0.04 0.44 D 0.08 -0.18 </code></pre> I would like the new df to be like: <pre class="prettyprint"><code> TAN SHA A 0.05 0.01 S 0.04 0.44 D 0.08 -0.18 </code></pre> Basically remove from the column names <code>.SK</code> and <code>.LO</code> This is what I have tried: <pre class="prettyprint"><code>df.rename(columns=lambda x: x.split('.')[0]) df.columns=df.split('.')[0] </code></pre> This second case works perfectly in case of <code>df.index</code>

I think faster, if many columns, is use vectorized solution with <code>str.split</code> and then select first <code>lists</code> by <code>str[0]</code>: <pre class="prettyprint"><code>print (df.columns.str.split('.')) Index([['TAN', 'SK'], ['SHA', 'LO']], dtype='object') df.columns = df.columns.str.split('.').str[0] print (df) TAN SHA A 0.05 0.01 S 0.04 0.44 D 0.08 -0.18 </code></pre>

Why I can't rename the columns?

Tags:

python

pandas

I have the following df:

          TAN.SK    SHA.LO
A         0.05      0.01   
S         0.04      0.44
D         0.08     -0.18

I would like the new df to be like:

          TAN        SHA
A         0.05      0.01   
S         0.04      0.44
D         0.08     -0.18

Basically remove from the column names .SK and .LO

This is what I have tried:

df.rename(columns=lambda x: x.split('.')[0])

df.columns=df.split('.')[0]

This second case works perfectly in case of df.index

965

asked May 07 '17 15:05

JamesHudson81

2 Answers

DataFrame.rename() does NOT change the DataFrame in place (per default), so you have to assign it back:

In [134]: df = df.rename(columns=lambda x: x.split('.')[0])

In [135]: df
Out[135]:
    TAN   SHA
A  0.05  0.01
S  0.04  0.44
D  0.08 -0.18

In [139]: df.rename(columns=lambda x: x.split('.')[0], inplace=True)

In [140]: df
Out[140]:
    TAN   SHA
A  0.05  0.01
S  0.04  0.44
D  0.08 -0.18

153

answered Sep 19 '22 12:09

MaxU - stop WAR against UA

I think faster, if many columns, is use vectorized solution with str.split and then select first lists by str[0]:

print (df.columns.str.split('.'))
Index([['TAN', 'SK'], ['SHA', 'LO']], dtype='object')

df.columns = df.columns.str.split('.').str[0]
print (df)
    TAN   SHA
A  0.05  0.01
S  0.04  0.44
D  0.08 -0.18

answered Sep 21 '22 12:09

jezrael

Related questions
                            
                                Control individual linewidths in seaborn heatmap
                            
                                Using a string to define Numpy array slice
                            
                                color percentage in image python opencv using histogram
                            
                                Make an input optional in Python [duplicate]
                            
                                Get Data JSON in Flask
                            
                                Sort each column of an numpy.ndarray using the output of numpy.argsort
                            
                                Measure of image similarity for feature matching?
                            
                                Django fails to find psycopg2 module
                            
                                Pandas analogue of SQL's "NOT IN" operator
                            
                                Python: Cannot find and install python module 'video'
                            
                                Quoting parameter in pandas read_csv()
                            
                                Pyspark: cast array with nested struct to string
                            
                                Replicating YEARFRAC() function from Excel in Python
                            
                                python ntlk donwload gives parser eror
                            
                                ValueError: not enough values to unpack (expected 3, got 2)
                            
                                Which decision_function_shape for sklearn.svm.SVC when using OneVsRestClassifier?
                            
                                Find Max from each column in data frame
                            
                                Keras custom metric iteration
                            
                                AttributeError: module 'numpy' has no attribute 'core'
                            
                                Inheriting the class dictionary with metaclasses

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With