I created a <code>Series</code> from a <code>DataFrame</code>, when I resampled some data with a count like so: where <code>H2</code> is a <code>DataFrame</code>: <pre class="prettyprint"><code>H3=H2[['SOLD_PRICE']] H5=H3.resample('Q',how='count') H6=pd.rolling_mean(H5,4) </code></pre> This yielded a series that looks like this: <pre class="prettyprint"><code>1999-03-31 SOLD_PRICE NaN 1999-06-30 SOLD_PRICE NaN 1999-09-30 SOLD_PRICE NaN 1999-12-31 SOLD_PRICE 3.00 2000-03-31 SOLD_PRICE 3.00 </code></pre> with an index that looks like: <pre class="prettyprint"><code>MultiIndex [(1999-03-31 00:00:00, u'SOLD_PRICE'), (1999-06-30 00:00:00, u'SOLD_PRICE'), (1999-09-30 00:00:00, u'SOLD_PRICE'), (1999-12-31 00:00:00, u'SOLD_PRICE'),..... </code></pre> I don't want the second column as an index. Ideally I'd have a <code>DataFrame</code> with column 1 as "Date" and column 2 as "Sales" (dropping the second level of the index). I don't quite see how to reconfigure the index.

Just call <code>reset_index()</code>: <pre class="prettyprint"><code>In [130]: s Out[130]: 0 1 1999-03-31 SOLD_PRICE NaN 1999-06-30 SOLD_PRICE NaN 1999-09-30 SOLD_PRICE NaN 1999-12-31 SOLD_PRICE 3 2000-03-31 SOLD_PRICE 3 Name: 2, dtype: float64 In [131]: s.reset_index() Out[131]: 0 1 2 0 1999-03-31 SOLD_PRICE NaN 1 1999-06-30 SOLD_PRICE NaN 2 1999-09-30 SOLD_PRICE NaN 3 1999-12-31 SOLD_PRICE 3 4 2000-03-31 SOLD_PRICE 3 </code></pre> There are many ways to drop columns: Call <code>reset_index()</code> twice and specify a column: <pre class="prettyprint"><code>In [136]: s.reset_index(0).reset_index(drop=True) Out[136]: 0 2 0 1999-03-31 NaN 1 1999-06-30 NaN 2 1999-09-30 NaN 3 1999-12-31 3 4 2000-03-31 3 </code></pre> Delete the column after resetting the index: <pre class="prettyprint"><code>In [137]: df = s.reset_index() In [138]: df Out[138]: 0 1 2 0 1999-03-31 SOLD_PRICE NaN 1 1999-06-30 SOLD_PRICE NaN 2 1999-09-30 SOLD_PRICE NaN 3 1999-12-31 SOLD_PRICE 3 4 2000-03-31 SOLD_PRICE 3 In [139]: del df[1] In [140]: df Out[140]: 0 2 0 1999-03-31 NaN 1 1999-06-30 NaN 2 1999-09-30 NaN 3 1999-12-31 3 4 2000-03-31 3 </code></pre> Call <code>drop()</code> after resetting: <pre class="prettyprint"><code>In [144]: s.reset_index().drop(1, axis=1) Out[144]: 0 2 0 1999-03-31 NaN 1 1999-06-30 NaN 2 1999-09-30 NaN 3 1999-12-31 3 4 2000-03-31 3 </code></pre> Then, after you've reset your index, just rename the columns <pre class="prettyprint"><code>In [146]: df.columns = ['Date', 'Sales'] In [147]: df Out[147]: Date Sales 0 1999-03-31 NaN 1 1999-06-30 NaN 2 1999-09-30 NaN 3 1999-12-31 3 4 2000-03-31 3 </code></pre>

Pandas reset index on series to remove multiindex

Tags:

python

pandas

I created a Series from a DataFrame, when I resampled some data with a count like so: where H2 is a DataFrame:

H3=H2[['SOLD_PRICE']] H5=H3.resample('Q',how='count') H6=pd.rolling_mean(H5,4)

This yielded a series that looks like this:

1999-03-31  SOLD_PRICE     NaN 1999-06-30  SOLD_PRICE     NaN 1999-09-30  SOLD_PRICE     NaN 1999-12-31  SOLD_PRICE    3.00 2000-03-31  SOLD_PRICE    3.00

with an index that looks like:

MultiIndex [(1999-03-31 00:00:00, u'SOLD_PRICE'), (1999-06-30 00:00:00, u'SOLD_PRICE'), (1999-09-30 00:00:00, u'SOLD_PRICE'), (1999-12-31 00:00:00, u'SOLD_PRICE'),.....

I don't want the second column as an index. Ideally I'd have a DataFrame with column 1 as "Date" and column 2 as "Sales" (dropping the second level of the index). I don't quite see how to reconfigure the index.

621

asked Sep 04 '13 21:09

dartdog

1 Answers

Just call reset_index():

In [130]: s Out[130]: 0           1 1999-03-31  SOLD_PRICE   NaN 1999-06-30  SOLD_PRICE   NaN 1999-09-30  SOLD_PRICE   NaN 1999-12-31  SOLD_PRICE     3 2000-03-31  SOLD_PRICE     3 Name: 2, dtype: float64  In [131]: s.reset_index() Out[131]:             0           1   2 0  1999-03-31  SOLD_PRICE NaN 1  1999-06-30  SOLD_PRICE NaN 2  1999-09-30  SOLD_PRICE NaN 3  1999-12-31  SOLD_PRICE   3 4  2000-03-31  SOLD_PRICE   3

There are many ways to drop columns:

Call reset_index() twice and specify a column:

In [136]: s.reset_index(0).reset_index(drop=True) Out[136]:             0   2 0  1999-03-31 NaN 1  1999-06-30 NaN 2  1999-09-30 NaN 3  1999-12-31   3 4  2000-03-31   3

Delete the column after resetting the index:

In [137]: df = s.reset_index()  In [138]: df Out[138]:             0           1   2 0  1999-03-31  SOLD_PRICE NaN 1  1999-06-30  SOLD_PRICE NaN 2  1999-09-30  SOLD_PRICE NaN 3  1999-12-31  SOLD_PRICE   3 4  2000-03-31  SOLD_PRICE   3  In [139]: del df[1]  In [140]: df Out[140]:             0   2 0  1999-03-31 NaN 1  1999-06-30 NaN 2  1999-09-30 NaN 3  1999-12-31   3 4  2000-03-31   3

Call drop() after resetting:

In [144]: s.reset_index().drop(1, axis=1) Out[144]:             0   2 0  1999-03-31 NaN 1  1999-06-30 NaN 2  1999-09-30 NaN 3  1999-12-31   3 4  2000-03-31   3

Then, after you've reset your index, just rename the columns

In [146]: df.columns = ['Date', 'Sales']  In [147]: df Out[147]:          Date  Sales 0  1999-03-31    NaN 1  1999-06-30    NaN 2  1999-09-30    NaN 3  1999-12-31      3 4  2000-03-31      3

178

answered Sep 28 '22 05:09

Phillip Cloud

Related questions
                            
                                tkinter gui layout using frames and grid
                            
                                pyspark: ValueError: Some of types cannot be determined after inferring
                            
                                Is there an Object spread syntax in python 2.7x like in Javascript?
                            
                                Plotly: How to set the range of the y axis?
                            
                                Python: How do you insert into a list by slicing?
                            
                                Python: Sharing global variables between modules and classes therein
                            
                                Python: try-except as an Expression?
                            
                                Python socket bind to any IP?
                            
                                ImportError: Could not import settings
                            
                                Compressing `x if x else y` statement in Python
                            
                                Filter with Array column with Postgres and SQLAlchemy
                            
                                Why did Django 1.9 replace tuples () with lists [] in settings and URLs?
                            
                                Using a regular expression to replace upper case repeated letters in python with a single lowercase letter
                            
                                Is there a gi.repository documentation for python?
                            
                                Python 2.7 or Python 3 (for speed)? [closed]
                            
                                Negative form of isinstance() in Python
                            
                                User groups and permissions
                            
                                How can I create a Python timestamp with millisecond granularity?
                            
                                Inserting a row at a specific location in a 2d array in numpy?
                            
                                How to put items into priority queues?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With