I am trying to calculate a new column which contains maximum values for each of several groups. I'm coming from a Stata background so I know the Stata code would be something like this: <pre class="prettyprint"><code>by group, sort: egen max = max(odds) </code></pre> For example: <pre class="prettyprint"><code>data = {'group' : ['A', 'A', 'B','B'], 'odds' : [85, 75, 60, 65]} </code></pre> Then I would like it to look like: <pre class="prettyprint"><code> group odds max A 85 85 A 75 85 B 60 65 B 65 65 </code></pre> Eventually I am trying to form a column that takes <code>1/(max-min) * odds</code> where <code>max</code> and <code>min</code> are for each group.

Use <code>groupby</code> + <code>transform</code>: <pre class="prettyprint"><code>df['max'] = df.groupby('group')['odds'].transform('max') </code></pre> This is equivalent to the verbose: <pre class="prettyprint"><code>maxima = df.groupby('group')['odds'].max() df['max'] = df['group'].map(maxima) </code></pre> The <code>transform</code> method aligns the <code>groupby</code> result to the <code>groupby</code> indexer, so no explicit mapping is required.

Python Pandas max value in a group as a new column

Tags:

I am trying to calculate a new column which contains maximum values for each of several groups. I'm coming from a Stata background so I know the Stata code would be something like this:

by group, sort: egen max = max(odds)

For example:

data = {'group' : ['A', 'A', 'B','B'],     'odds' : [85, 75, 60, 65]}

Then I would like it to look like:

    group    odds    max      A        85      85      A        75      85      B        60      65      B        65      65

Eventually I am trying to form a column that takes 1/(max-min) * odds where max and min are for each group.

337

asked Feb 25 '16 23:02

Vicki

1 Answers

Use groupby + transform:

df['max'] = df.groupby('group')['odds'].transform('max')

This is equivalent to the verbose:

maxima = df.groupby('group')['odds'].max() df['max'] = df['group'].map(maxima)

The transform method aligns the groupby result to the groupby indexer, so no explicit mapping is required.

answered Sep 30 '22 14:09

jpp

Related questions
                            
                                "No rule to make target 'install'"... But Makefile exists
                            
                                How to make an advanced search with Spring Data REST?
                            
                                Preventing RecyclerView from consuming touch events
                            
                                Kotlin: How to access the Attrs for a CustomView
                            
                                NSURLErrorDomain Code=-1004 for few seconds after app start up
                            
                                Getting Class of list with generic eg: List<Number>::class
                            
                                How to check if all items in list are string
                            
                                Array size without sizeof operator
                            
                                java.lang.UnsupportedClassVersionError: org/sonar/batch/bootstrapper/EnvironmentInformation : Unsupported major.minor version 52.0
                            
                                Separating digits for large numbers in C# code
                            
                                You attempted to set the key on an object that is meant to be immutable and has been frozen
                            
                                Window Soft Input Mode ConstraintLayout

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python Pandas max value in a group as a new column

Tags:

Vicki

People also ask

1 Answers

jpp

Recent Activity

Donate For Us