After using transpose on a dataframe there is always an extra row as a remainder from the initial dataframe's index for example: <pre class="prettyprint"><code>import pandas as pd df = pd.DataFrame({'fruit':['apple','banana'],'number':[3,5]}) df fruit number 0 apple 3 1 banana 5 df.transpose() 0 1 fruit apple banana number 3 5 </code></pre> Even when i have no index: <pre class="prettyprint"><code>df.reset_index(drop = True, inplace = True) df fruit number 0 apple 3 1 banana 5 df.transpose() 0 1 fruit apple banana number 3 5 </code></pre> The problem is that when I save the dataframe to a csv file by: <pre class="prettyprint"><code>df.to_csv(f) </code></pre> this extra row stays at the top and I have to remove it manually every time. Also this doesn't work: <pre class="prettyprint"><code> df.to_csv(f, index = None) </code></pre> because the old index is no longer considered an index (just another row...). It also happened when I transposed the other way around and I got an extra column which i could not remove. Any tips?

Instead of removing the extra index, why don't try setting the new index that you want and then use slicing ? step 1: Set the new index you want: <code>df.columns = df.iloc[0]</code> step 2: Create a new dataframe removing extra row. <code>df_new = df[1:]</code>

How to remove the extra row (or column) after transpose() in Pandas

Tags:

python

pandas

csv

transpose

After using transpose on a dataframe there is always an extra row as a remainder from the initial dataframe's index for example:

import pandas as pd

df = pd.DataFrame({'fruit':['apple','banana'],'number':[3,5]})
df
    fruit  number
0   apple       3
1  banana       5
df.transpose()
        0       1
fruit   apple  banana
number      3       5

Even when i have no index:

df.reset_index(drop = True, inplace = True)
df
    fruit  number
0   apple       3
1  banana       5

df.transpose()
        0       1
fruit   apple  banana
number      3       5

The problem is that when I save the dataframe to a csv file by:

df.to_csv(f)

this extra row stays at the top and I have to remove it manually every time.

Also this doesn't work:

 df.to_csv(f, index = None)

because the old index is no longer considered an index (just another row...).

It also happened when I transposed the other way around and I got an extra column which i could not remove.

Any tips?

336

asked Jul 01 '16 15:07

Helena K

2 Answers

I had the same problem, I solved it by reseting index before doing the transpose. I mean df.set_index('fruit').transpose():

import pandas as pd

df = pd.DataFrame({'fruit':['apple','banana'],'number':[3,5]})
df
    fruit   number
0   apple   3
1   banana  5

And df.set_index('fruit').transpose() gives:

fruit   apple   banana
number  3       5

194

answered Oct 15 '22 23:10

user1742571

Instead of removing the extra index, why don't try setting the new index that you want and then use slicing ?

step 1: Set the new index you want:
df.columns = df.iloc[0]
step 2: Create a new dataframe removing extra row.
df_new = df[1:]

answered Oct 15 '22 22:10

Radhika Nair

Related questions
                            
                                python requests on Google App Engine not working for HTTPS
                            
                                Flask unit testing: Getting the response's redirect location
                            
                                Accessing argument values for argparse in Python
                            
                                Why is super used so much in PySide/PyQt?
                            
                                What are __signature__ and __text_signature__ used for in Python 3.4
                            
                                Writing hex data into a file
                            
                                Python imports relative path
                            
                                How can I display an image using Pillow?
                            
                                Python 3 exception deletes variable in enclosing scope for unknown reason [duplicate]
                            
                                How to create ternary contour plot in Python?
                            
                                How can I keep test data after Django tests complete?
                            
                                Memory efficient sort of massive numpy array in Python
                            
                                What is the difference between skew and kurtosis functions in pandas vs. scipy?
                            
                                ValueError: setting an array element with a sequence. for Pandas
                            
                                Reorder levels of MultiIndex in a pandas DataFrame
                            
                                How to replace all values in a Pandas Dataframe not in a list? [duplicate]
                            
                                Using Boto3 to interact with amazon Aurora on RDS
                            
                                Average of a numpy array returns NaN
                            
                                overcome Graphdef cannot be larger than 2GB in tensorflow
                            
                                interpolate missing values 2d python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With