Grouping and pivoting DataFrame with additional column for ratio of counts

Tags:

I have a dataframe that looks like this:

id       status      year 
1        yes         2014
3        no          2013
2        yes         2014
4        no          2014

The actual dataframe is very large with multiple ids and years. I am trying to make a new dataframe that has the percents of 'yes's and 'no's grouped by year.

I was thinking of grouping the dataframe by the year, which would then put the statuses per year in a list and then analyzing the counts of yes's and no's that way, but I was wondering whether there is a more pythonic way to do this?

I would like for the end dataframe to look like this:

year      yes_count     no_count     ratio_yes_to_toal    
2013       0             1             0%
2014       2             1             67%

449

asked Dec 19 '18 16:12

Priya

1 Answers

I'd suggest grouping by year and status, counting, pivoting, and then creating an additional column of the ratio:

df2 = df.groupby(['year', 'status']).count().pivot_table(index="year", columns=["status"]).fillna(0)
df2.columns = df2.columns.get_level_values(1)
df2['ratio'] = df2['yes'] / (df2['yes'] + df2['no'])

Output

status   no  yes     ratio
year                      
2013    1.0  0.0  0.000000
2014    1.0  2.0  0.666667

101

answered Oct 26 '22 23:10

Tim

Related questions
                            
                                stop groupby from making 2 combination same pair in python dataframe
                            
                                len throws with 'dict_keyiterator' has no len() when calculating outgoing and incoming edges in networkx
                            
                                OSError: [Errno 24] Too many open files python , ubuntu
                            
                                How does pytorch calculate matrix pairwise distance? Why isn't 'self' distance not zero?
                            
                                DataFrame view in PyCharm when using pyspark
                            
                                TensorFlow assign Tensor to Tensor with array indexing
                            
                                How does the code example in the python os module documentation create a security hole?
                            
                                Down-sample mri T1 image in python with Nipy
                            
                                Matrix Multiplication: Multiply each row of matrix by another 2D matrix in Python
                            
                                Printing to file python script running as a background process
                            
                                Not being able to detect '-' character in regular expression [duplicate]
                            
                                Can't locate a python script from error message
                            
                                Problem with ERR_TOO_MANY_REDIRECTS django 2.1
                            
                                Keyboard event not sent to window with pywin32
                            
                                How to differentiate between default value and user given value?
                            
                                HTML structure into network graph
                            
                                TypeError: list indices must be integers, not dict
                            
                                GLib-GIO-Message: Using the 'memory' GSettings backend. Your settings will not be saved or shared with other applications
                            
                                How to create request body for Python Elasticsearch mSearch
                            
                                How to read from QTextedit in python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Grouping and pivoting DataFrame with additional column for ratio of counts

Tags:

python

pandas

group-by

pandas-groupby

counting

Priya

People also ask

1 Answers

Tim

Recent Activity

Donate For Us