I have a multi-level pandas dataframe which im trying to level. I use reset_index but its giving me error that the name already exist. I dont want to use <code>reset_index(drop=True)</code> because i want to keep one of the column names still. <img src="https://i.stack.imgur.com/GSXkb.png" alt="enter image description here"> i want as my new dataframe: country,listing_neighborhood,count right now, <code>df.columns</code> only gives <code>count</code>. my code: <pre class="prettyprint"><code>df.columns = ['count'] df.reset_index() -> gives error that `ValueError: cannot insert country, already exists` </code></pre> I also tried: <code>df.columns.droplevel(0)</code> -> gives error that <code>'Index' object has no attribute 'droplevel'</code>

You can change the existing name so that it would not be duplicated anymore: <blockquote> df.reset_index(name="new_name") </blockquote> Hope this help

Pandas unable to reset index because name exist

Tags:

python

pandas

I have a multi-level pandas dataframe which im trying to level. I use reset_index but its giving me error that the name already exist.

I dont want to use reset_index(drop=True) because i want to keep one of the column names still.

enter image description here

i want as my new dataframe:

country,listing_neighborhood,count

right now,

df.columns only gives count.

my code:

df.columns = ['count']
df.reset_index() -> gives error that `ValueError: cannot insert country, already exists`

I also tried:

df.columns.droplevel(0) -> gives error that 'Index' object has no attribute 'droplevel'

451

asked Feb 13 '18 07:02

jxn

2 Answers

You need remove first duplicated level:

df = pd.DataFrame({
        'A':list('abcdef'),
         'B':[4,5,4,5,5,4],
         'C':[7,8,9,4,2,3],
         'F':list('aaabbb')
})

df = (df.set_index(['A','F','C'])
        .rename_axis(['country','country','listing_neighborhood'])
        .rename(columns={'B':'count'}))

print (df)
                                      count
country country listing_neighborhood       
a       a       7                         4
b       a       8                         5
c       a       9                         4
d       b       4                         5
e       b       2                         5
f       b       3                         4

df = df.reset_index(level=0, drop=True).reset_index()
print (df)
  country  listing_neighborhood  count
0       a                     7      4
1       a                     8      5
2       a                     9      4
3       b                     4      5
4       b                     2      5
5       b                     3      4

Or:

df = df.droplevel(0).reset_index()

answered Sep 23 '22 03:09

jezrael

You can change the existing name so that it would not be duplicated anymore:

df.reset_index(name="new_name")

Hope this help

answered Sep 23 '22 03:09

Catbuilts

Related questions
                            
                                sys.argv[1], IndexError: list index out of range [duplicate]
                            
                                difference between locals() and globals() and dir() in python
                            
                                After installing Anaconda, I get constant "KeyError: 'PYTHONPATH'" messages
                            
                                How to change SparkContext properties in Interactive PySpark session
                            
                                Checking if a variable belongs to a class in python
                            
                                Can't import cv2; "DLL load failed"
                            
                                How to pass a constant value to Python UDF?
                            
                                Create a Legend on a Folium map
                            
                                Why does scipy.norm.pdf sometimes give PDF > 1? How to correct it?
                            
                                How do I install modules on qpython3 (Android port of python)
                            
                                Where does next_batch in the TensorFlow tutorial batch_xs, batch_ys = mnist.train.next_batch(100) come from?
                            
                                Create tuple of multiple items n Times in Python
                            
                                How can I change a specific row label in a Pandas dataframe?
                            
                                How to find out the accuracy?
                            
                                SSL failure on Windows using python requests
                            
                                Wrapping C++ code with python (manually)
                            
                                [Django rest framework]: Serialize a list of strings
                            
                                Appending pandas Data Frame to Google spreadsheet
                            
                                Access standardized residuals, cook's values, hatvalues (leverage) etc. easily in Python?
                            
                                Issue in using win32com to access Excel file

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With