I have a dataframe as given: <pre class="prettyprint"><code>df = {'TYPE' : pd.Series(['Advisory','Advisory1','Advisory2','Advisory3']), 'CNTRY' : pd.Series(['IND','FRN','IND','FRN']), 'VALUE' : pd.Series([1., 2., 3., 4.])} df = pd.DataFrame(df) df = pd.pivot_table(df,index=["CNTRY"],columns=["TYPE"]).reset_index() </code></pre> After pivoting, how can I get the dataframe having columns and <code>df</code> to be like the below; removing the multilevel index, <code>VALUE</code> <pre class="prettyprint"><code>Type|CNTRY|Advisory|Advisory1|Advisory2|Advisory3 0 FRN NaN 2.0 NaN 4.0 1 IND 1.0 NaN 3.0 NaN </code></pre>

You can add parameter <code>values</code>: <pre class="prettyprint"><code>df = pd.pivot_table(df,index="CNTRY",columns="TYPE", values='VALUE').reset_index() print (df) TYPE CNTRY Advisory Advisory1 Advisory2 Advisory3 0 FRN NaN 2.0 NaN 4.0 1 IND 1.0 NaN 3.0 NaN </code></pre> And for remove columns name <code>rename_axis</code>: <pre class="prettyprint"><code>df = pd.pivot_table(df,index="CNTRY",columns="TYPE", values='VALUE') \ .reset_index().rename_axis(None, axis=1) print (df) CNTRY Advisory Advisory1 Advisory2 Advisory3 0 FRN NaN 2.0 NaN 4.0 1 IND 1.0 NaN 3.0 NaN </code></pre> But maybe is necessary only <code>pivot</code>: <pre class="prettyprint"><code>df = df.pivot(index="CNTRY",columns="TYPE", values='VALUE') \ .reset_index().rename_axis(None, axis=1) print (df) CNTRY Advisory Advisory1 Advisory2 Advisory3 0 FRN NaN 2.0 NaN 4.0 1 IND 1.0 NaN 3.0 NaN </code></pre> because <code>pivot_table</code> aggregate duplicates by default aggregate function <code>mean</code>: <pre class="prettyprint"><code>df = {'TYPE' : pd.Series(['Advisory','Advisory1','Advisory2','Advisory1']), 'CNTRY' : pd.Series(['IND','FRN','IND','FRN']), 'VALUE' : pd.Series([1., 4., 3., 4.])} df = pd.DataFrame(df) print (df) CNTRY TYPE VALUE 0 IND Advisory 1.0 1 FRN Advisory1 1.0 <-same FRN and Advisory1 2 IND Advisory2 3.0 3 FRN Advisory1 4.0 <-same FRN and Advisory1 df = df.pivot_table(index="CNTRY",columns="TYPE", values='VALUE') .reset_index().rename_axis(None, axis=1) print (df) TYPE Advisory Advisory1 Advisory2 CNTRY FRN 0.0 2.5 0.0 IND 1.0 0.0 3.0 </code></pre> Alternative with <code>groupby</code>, aggregate function and <code>unstack</code>: <pre class="prettyprint"><code>df = df.groupby(["CNTRY","TYPE"])['VALUE'].mean().unstack(fill_value=0) .reset_index().rename_axis(None, axis=1) print (df) CNTRY Advisory Advisory1 Advisory2 0 FRN 0.0 2.5 0.0 1 IND 1.0 0.0 3.0 </code></pre>

How to remove multilevel index in pandas pivot table

Tags:

python

pandas

pivot

pivot-table

I have a dataframe as given:

df = {'TYPE' : pd.Series(['Advisory','Advisory1','Advisory2','Advisory3']),
 'CNTRY' : pd.Series(['IND','FRN','IND','FRN']),
 'VALUE' : pd.Series([1., 2., 3., 4.])}
df = pd.DataFrame(df)
df = pd.pivot_table(df,index=["CNTRY"],columns=["TYPE"]).reset_index()

After pivoting, how can I get the dataframe having columns and df to be like the below; removing the multilevel index, VALUE

Type|CNTRY|Advisory|Advisory1|Advisory2|Advisory3
0     FRN     NaN      2.0      NaN     4.0 
1     IND     1.0      NaN      3.0     NaN

611

asked Jun 13 '17 06:06

Shivpe_R

2 Answers

You can add parameter values:

df = pd.pivot_table(df,index="CNTRY",columns="TYPE", values='VALUE').reset_index()
print (df)
TYPE CNTRY  Advisory  Advisory1  Advisory2  Advisory3
0      FRN       NaN        2.0        NaN        4.0
1      IND       1.0        NaN        3.0        NaN

And for remove columns name rename_axis:

df = pd.pivot_table(df,index="CNTRY",columns="TYPE", values='VALUE') \
       .reset_index().rename_axis(None, axis=1)
print (df)
  CNTRY  Advisory  Advisory1  Advisory2  Advisory3
0   FRN       NaN        2.0        NaN        4.0
1   IND       1.0        NaN        3.0        NaN

But maybe is necessary only pivot:

df = df.pivot(index="CNTRY",columns="TYPE", values='VALUE') \
       .reset_index().rename_axis(None, axis=1)
print (df)
  CNTRY  Advisory  Advisory1  Advisory2  Advisory3
0   FRN       NaN        2.0        NaN        4.0
1   IND       1.0        NaN        3.0        NaN

because pivot_table aggregate duplicates by default aggregate function mean:

df = {'TYPE' : pd.Series(['Advisory','Advisory1','Advisory2','Advisory1']),
 'CNTRY' : pd.Series(['IND','FRN','IND','FRN']),
 'VALUE' : pd.Series([1., 4., 3., 4.])}
df = pd.DataFrame(df)
print (df)
  CNTRY       TYPE  VALUE
0   IND   Advisory    1.0
1   FRN  Advisory1    1.0 <-same FRN and Advisory1 
2   IND  Advisory2    3.0
3   FRN  Advisory1    4.0 <-same FRN and Advisory1 

df = df.pivot_table(index="CNTRY",columns="TYPE", values='VALUE')
       .reset_index().rename_axis(None, axis=1)
print (df)
TYPE   Advisory  Advisory1  Advisory2
CNTRY                                
FRN         0.0        2.5        0.0
IND         1.0        0.0        3.0

Alternative with groupby, aggregate function and unstack:

df = df.groupby(["CNTRY","TYPE"])['VALUE'].mean().unstack(fill_value=0)
      .reset_index().rename_axis(None, axis=1)
print (df)
  CNTRY  Advisory  Advisory1  Advisory2
0   FRN       0.0        2.5        0.0
1   IND       1.0        0.0        3.0

137

answered Sep 22 '22 08:09

jezrael

You can use set_index with unstack

df.set_index(['CNTRY', 'TYPE']).VALUE.unstack().reset_index()

TYPE CNTRY  Advisory  Advisory1  Advisory2  Advisory3
0      FRN       NaN        2.0        NaN        4.0
1      IND       1.0        NaN        3.0        NaN

answered Sep 18 '22 08:09

piRSquared

Related questions
                            
                                Indentation not working properly in emacs for python
                            
                                Tornado coroutine
                            
                                how to make post request in python
                            
                                unconverted data remains: .387000 in Python
                            
                                How to specify a variable in pandas as ordinal/categorical?
                            
                                Replace exact substring in python [duplicate]
                            
                                How to set a random integer as the default value for a Django CharField?
                            
                                Python docx Replace string in paragraph while keeping style
                            
                                Remove leap year day from pandas dataframe
                            
                                Python, numpy; How to best deal with possible 0d arrays
                            
                                Better way to check if all lists in a list are the same length? [duplicate]
                            
                                Avoid tensorflow print on standard error
                            
                                How to pass arguments to animation.FuncAnimation()?
                            
                                Updating Python using 'PIP'
                            
                                Conda - offline install / update
                            
                                Using NLTK corpora with AWS Lambda functions in Python
                            
                                Cursor location and pixel value in a Jupyter notebook inline image
                            
                                Python coverage - exclude packages
                            
                                Jupyter Notebook error [duplicate]
                            
                                How to detect edge and crop an image in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With