Reshaping Pandas groupby data row values into column headers

Tags:

I am trying to extract grouped row data from a pandas groupby object so that the primary group data ('course' in the example below) act as a row index, the secondary grouped row values act as column headers ('student') and the aggregate values as the corresponding row data ('score').

So, for example, I would like to transform:

import pandas as pd
import numpy as np

data = {'course_id':[101,101,101,101,102,102,102,102] ,
    'student_id':[1,1,2,2,1,1,2,2],
    'score':[80,85,70,60,90,65,95,80]}

df = pd.DataFrame(data, columns=['course_id', 'student_id','score'])

Which I have grouped by course_id and student_id:

group = df.groupby(['course_id', 'student_id']).aggregate(np.mean)
g = pd.DataFrame(group)

Into something like this:

data = {'course':[101,102],'1':[82.5,77.5],'2':[65.0,87.5]}
g3 = pd.DataFrame(data, columns=['course', '1', '2'])

I have spent some time looking through the groupby documentation and I have trawled stack overflow and the like but I'm still not sure how to approach the problem. I would be very grateful if anyone would suggest a sensible way of achieving this for a largish dataset.

Many thanks!

Edited: to fix g3 example typo

405

asked Aug 12 '15 20:08

MrGraeme

1 Answers

>>> g.reset_index().pivot('course_id', 'student_id', 'score')
student_id     1     2
course_id             
101         82.5  65.0
102         77.5  87.5

119

answered Sep 29 '22 16:09

BrenBarn

Related questions
                            
                                parsing xml containing default namespace to get an element value using lxml
                            
                                Error 400 with python-amazon-simple-product-api via pythonanywhere
                            
                                Split by suffix with Python regular expression
                            
                                Passing args in scipy optimize.minimize objective function ( getting error on number of arguments)
                            
                                wxPython: Dragging a file into window to get file path
                            
                                pydub accessing the sampling rate(Hz) and the audio signal from an mp3 file
                            
                                TypeError: <lambda>() takes exactly 1 argument (3 given)
                            
                                How can I install netcdf4-python to ubuntu14.04?
                            
                                DrawContours() not working opencv python
                            
                                memory usage @on_trait_change vs _foo_changed()
                            
                                Django Transactions ATOMIC_REQUESTS
                            
                                Count Number of Rows Between Two Dates BY ID in a Pandas GroupBy Dataframe
                            
                                iPython notebook can't connect to kernel on google-compute-engine
                            
                                'unicode' object has no attribute 'get'
                            
                                Why does my text file keep overwriting the data on it?
                            
                                How to show Matrix in Sphinx Docs?
                            
                                scipy.sparse.hstack(([1], [2])) -> "ValueError: blocks must be 2-D". Why?
                            
                                "unbound method textFile() must be called with SparkContext instance as first argument (got str instance instead)"
                            
                                Nginx WebSocket proxying keep getting HTTP 301 redirects
                            
                                Can't access price of a Product in Django-Oscar?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Reshaping Pandas groupby data row values into column headers

Tags:

python

pandas

MrGraeme

People also ask

1 Answers

BrenBarn

Recent Activity

Donate For Us