I have the following pandas dataframe: <pre class="prettyprint"><code>token year uses books 386 xanthos 1830 3 3 387 xanthos 1840 1 1 388 xanthos 1840 2 2 389 xanthos 1868 2 2 390 xanthos 1875 1 1 </code></pre> I aggregate the rows with duplicate <code>token</code> and <code>years</code> like so: <pre class="prettyprint"><code>dfalph = dfalph[['token','year','uses','books']].groupby(['token', 'year']).agg([np.sum]) dfalph.columns = dfalph.columns.droplevel(1) uses books token year xanthos 1830 3 3 1840 3 3 1867 2 2 1868 2 2 1875 1 1 </code></pre> Instead of having the 'token' and 'year' fields in the index, I would like to return them to columns and have an integer index.

Method #1: <code>reset_index()</code> <pre class="prettyprint"><code>>>> g uses books sum sum token year xanthos 1830 3 3 1840 3 3 1868 2 2 1875 1 1 [4 rows x 2 columns] >>> g = g.reset_index() >>> g token year uses books sum sum 0 xanthos 1830 3 3 1 xanthos 1840 3 3 2 xanthos 1868 2 2 3 xanthos 1875 1 1 [4 rows x 4 columns] </code></pre> Method #2: don't make the index in the first place, using <code>as_index=False</code> <pre class="prettyprint"><code>>>> g = dfalph[['token', 'year', 'uses', 'books']].groupby(['token', 'year'], as_index=False).sum() >>> g token year uses books 0 xanthos 1830 3 3 1 xanthos 1840 3 3 2 xanthos 1868 2 2 3 xanthos 1875 1 1 [4 rows x 4 columns] </code></pre>

How to move pandas data from index to column after multiple groupby

Tags:

python

pandas

pandas-groupby

multi-index

I have the following pandas dataframe:

token    year    uses  books   386   xanthos  1830    3     3   387   xanthos  1840    1     1   388   xanthos  1840    2     2   389   xanthos  1868    2     2   390   xanthos  1875    1     1

I aggregate the rows with duplicate token and years like so:

dfalph = dfalph[['token','year','uses','books']].groupby(['token', 'year']).agg([np.sum]) dfalph.columns = dfalph.columns.droplevel(1)                 uses  books token    year        xanthos  1830    3     3          1840    3     3          1867    2     2          1868    2     2          1875    1     1

Instead of having the 'token' and 'year' fields in the index, I would like to return them to columns and have an integer index.

990

asked Feb 13 '14 23:02

prooffreader

1 Answers

Method #1: reset_index()

>>> g               uses  books                sum    sum token   year              xanthos 1830     3      3         1840     3      3         1868     2      2         1875     1      1  [4 rows x 2 columns] >>> g = g.reset_index() >>> g      token  year  uses  books                    sum    sum 0  xanthos  1830     3      3 1  xanthos  1840     3      3 2  xanthos  1868     2      2 3  xanthos  1875     1      1  [4 rows x 4 columns]

Method #2: don't make the index in the first place, using as_index=False

>>> g = dfalph[['token', 'year', 'uses', 'books']].groupby(['token', 'year'], as_index=False).sum() >>> g      token  year  uses  books 0  xanthos  1830     3      3 1  xanthos  1840     3      3 2  xanthos  1868     2      2 3  xanthos  1875     1      1  [4 rows x 4 columns]

107

answered Oct 02 '22 11:10

DSM

Related questions
                            
                                Matplotlib: avoiding overlapping datapoints in a "scatter/dot/beeswarm" plot
                            
                                Dummy variables when not all categories are present
                            
                                Calling a parent class constructor from a child class in python [duplicate]
                            
                                Get a random sample with replacement
                            
                                Find "one letter that appears twice" in a string
                            
                                Python: Converting from `datetime.datetime` to `time.time`
                            
                                Python csv.reader: How do I return to the top of the file?
                            
                                Unit Test not running
                            
                                python how to "negate" value : if true return false, if false return true
                            
                                Pandas: join DataFrames on field with different names?
                            
                                Python: Dictionary merge by updating but not overwriting if value exists
                            
                                python replace single backslash with double backslash
                            
                                Organizing Python classes in modules and/or packages
                            
                                Summing the contents of two collections.Counter() objects [duplicate]
                            
                                pandas : update value if condition in 3 columns are met
                            
                                python sorting dictionary by length of values
                            
                                The right way to limit maximum number of threads running at once?
                            
                                Passing csrftoken with python Requests
                            
                                Python Metaclass : Understanding the 'with_metaclass()'
                            
                                How do I compare a Unicode string that has different bytes, but the same value?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With