How to pivot one column containing strings in a dataframe? [duplicate]

Tags:

I am trying to reshape a pandas dataframe, by turning one of the columns in the data, into rows (by pivoting or unstacking).

I am new to this, so likely that I'm missing something obvious. I've searched extensively, but have not been able to successfully apply any solutions that I've come across.

df
    Location    Month       Metric       Value
0   Texas       January     Temperature  10
1   New York    January     Temperature  20
2   California  January     Temperature  30
3   Alaska      January     Temperature  40
4   Texas       January     Color        Red
5   New York    January     Color        Blue
6   California  January     Color        Green
7   Alaska      January     Color        Yellow
8   Texas       February    Temperature  15
9   New York    February    Temperature  25
10  California  February    Temperature  35
11  Alaska      February    Temperature  NaN
12  Texas       February    Color        NaN
13  New York    February    Color        Purple
14  California  February    Color        Orange
15  Alaska      February    Color        Brown

I am trying to "pivot" the Metric values into columns. End goal is a result like this:

Location    Month     Temperature   Color
Texas       January   10            Red
New York    January   20            Blue
California  January   30            Green
Alaska      January   40            Yellow
Texas       February  15    
New York    February  25            Purple
California  February  35            Orange
Alaska      February                Brown

I have tried using pivot, pivot_table, as well as unstack methods, but I'm sure I'm missing something. Many of the complications seem to come because I am mixing strings with numbers, and have some missing values in the data as well.

This is the closest I have been able to get so far, but I don't want extra rows for the month column, resulting in more blank values:

df.set_index(['Location','Month','Metric'], append=True, inplace=True)
df.unstack()

    Value
    Metric              Color   Temperature
    Location    Month       
0   Texas       January None    10
1   New York    January None    20
2   California  January None    30
3   Alaska      January None    40
4   Texas       January Red     None
5   New York    January Blue    None
6   California  January Green   None
7   Alaska      January Yellow  None

Any help here would be greatly appreciated. This seems like something that most likely has a simple solution available.

736

asked Feb 28 '18 11:02

brendxn

1 Answers

A pivot solution to what you need. The output is semantics to what you want -

Metric                Color Temperature
Location   Month                       
Alaska     February   Brown         NaN
           January   Yellow          40
California February  Orange          35
           January    Green          30
New York   February  Purple          25
           January     Blue          20
Texas      February     NaN          15
           January      Red          10

Code -

df_p = df.pivot_table(index=['Location', 'Month'], columns=['Metric'], values='Value', aggfunc=np.sum)

123

answered Oct 30 '22 23:10

Vivek Kalyanarangan

Related questions
                            
                                Open chrome with default user profile using python and selenium on mac
                            
                                Comparable types with mypy
                            
                                Where is `_softmax_cross_entropy_with_logits` defined in tensorflow?
                            
                                Pandas split CSV into multiple CSV's (or DataFrames) by a column
                            
                                OpenCV: apply Rotation matrix from Rodrigues() to a point
                            
                                What is the run time of the set difference function in Python?
                            
                                How to implement the derivative of Leaky Relu in python?
                            
                                Group rows by overlapping ranges
                            
                                Three different types of output when reading an image with three different libraries in Python
                            
                                Subtract each row of matrix A from every row of matrix B without loops
                            
                                basemap ImportError: No module named 'mpl_toolkits.basemap'
                            
                                Flask validate_on_submit always False
                            
                                What are the "parts" in a multipart email?
                            
                                Detect when multiprocessing queue is empty and closed
                            
                                Python: Raise square matrix to negative half power
                            
                                Can I get the shape of a numpy save file without reading the entire contents (e.g. memmap)
                            
                                Move mouse cursor to second monitor using pyautogui
                            
                                Hide command prompt in Selenium ChromeDriver
                            
                                Mock method which returns same value passed as argument
                            
                                Difference between slash operator and comma separator in pathlib Path

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to pivot one column containing strings in a dataframe? [duplicate]

Tags:

python

python-3.x

pandas

pivot

pivot-table

brendxn

People also ask

1 Answers

Vivek Kalyanarangan

Recent Activity

Donate For Us