Pandas: Optimal way to MultiIndex columns

Tags:

pandas

I start with the following DataFrame:

df_1 = DataFrame({
        "Cat1" : ["a", "b"],
        "Vals1" : [1,2] ,
        "Vals2" : [3,4]
    })
df

enter image description here

I want to get it to look like this:

enter image description here

And I can do it, with this code:

df_2 = (
    pd.melt(df_1, id_vars=["Cat1"])
    .T
)
df_2.columns = (
    pd.MultiIndex
        .from_tuples(
            list(zip(df_2.loc["Cat1", :] , df_2.loc["variable", :])) ,
            names=["Cat1", None]
        )
)
df_2 = (
    df_2
    .loc[["value"], :]
    .reset_index(drop=True)
    .sortlevel(0, axis=1)
)
df_2

But there are so many steps here that I feel code smell, or at least something vaguely not pandas-idiomatic, as if I'm missing the point of something in the API. Doing the equivalent for row-based indexes is just one step, for example, via set_index(). (Note that I am aware that the columns equivalent of set_index() is still an open issue). Is there a better, more official way to do this?

913

asked Apr 04 '17 14:04

sparc_spread

2 Answers

You can use stack(), to_frame(), and T for transpose.

df_1.set_index('Cat1').stack().to_frame().T


Cat1     a           b      
     Vals1 Vals2 Vals1 Vals2
0        1     3     2     4

165

answered Oct 27 '22 21:10

Scott Boston

Think about it as a transposed dataframe. Here you go:

df.set_index('Cat1').unstack().swaplevel().sort_index().to_frame().T
Out[46]: 
Cat1     a           b      
     Vals1 Vals2 Vals1 Vals2
0        1     3     2     4

answered Oct 27 '22 21:10

Zeugma

Related questions
                            
                                Weighted bins in a distribution hist plot
                            
                                Detect a changed password in Django
                            
                                using best params from gridsearchcv
                            
                                sudo and pip not on the same path
                            
                                Python selenium not work with WebDriverWait
                            
                                Considerations for using ReLU as activation function
                            
                                How to rearrange one list based on a second list of indices [duplicate]
                            
                                python & postgresql: reliably check for updates in a specific table
                            
                                Adding global attribute using xarray
                            
                                Difference between Tensorflow convolution and numpy convolution
                            
                                Escape analysis
                            
                                Pandas - Counting quantity of commas in character field
                            
                                I deleted my dict, but my dict_keys don't mind, why is that?
                            
                                Get the inverse function of a polyfit in numpy
                            
                                error using gmail api tuto using python 3 "except errors.HttpError, error:"
                            
                                Nested merges in pandas with suffixes
                            
                                How to get round the HTTP Error 403: Forbidden with urllib.request using Python 3
                            
                                DynamoDB - How to query a nested attribute boto3
                            
                                pandas return columns in dataframe that are not in other dataframe
                            
                                How to make tkinter button widget take up full width of grid

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With