I am grouping my dataset by column A and then would like to take the minimum value in column B and the corresponding value in column C. <pre class="prettyprint"><code>data = pd.DataFrame({'A': [1, 2], 'B':[ 2, 4], 'C':[10, 4]}) data A B C 0 1 4 3 1 1 5 4 2 1 2 10 3 2 7 2 4 2 4 4 5 2 6 6 </code></pre> and I would like to get : <pre class="prettyprint"><code> A B C 0 1 2 10 1 2 4 4 </code></pre> For the moment I am grouping by A, and creating a value that indicates me the rows I will keep in my dataset: <pre class="prettyprint"><code>a = data.groupby('A').min() a['A'] = a.index to_keep = [str(x[0]) + str(x[1]) for x in a[['A', 'B']].values] data['id'] = data['A'].astype(str) + data['B'].astype('str') data[data['id'].isin(to_keep)] </code></pre> I am sure that there is a much more straight forward way to do this. I have seen many answers here that use multi-indexing but I would like to do this without adding multi-index to my dataframe. Thank you for your help.

I feel like you're overthinking this. Just use <code>groupby</code> and <code>idxmin</code>: <pre class="prettyprint"><code>df.loc[df.groupby('A').B.idxmin()] A B C 2 1 2 10 4 2 4 4 </code></pre> <hr> <pre class="prettyprint"><code>df.loc[df.groupby('A').B.idxmin()].reset_index(drop=True) A B C 0 1 2 10 1 2 4 4 </code></pre>

Had a similar situation but with a more complex column heading (e.g. "B val") in which case this is needed: <pre class="prettyprint"><code>df.loc[df.groupby('A')['B val'].idxmin()] </code></pre>

Pandas GroupBy and select rows with the minimum value in a specific column

Tags:

I am grouping my dataset by column A and then would like to take the minimum value in column B and the corresponding value in column C.

data = pd.DataFrame({'A': [1, 2], 'B':[ 2, 4], 'C':[10, 4]}) data       A   B   C 0   1   4   3 1   1   5   4 2   1   2   10 3   2   7   2 4   2   4   4 5   2   6   6

and I would like to get :

    A   B   C 0   1   2   10 1   2   4   4

For the moment I am grouping by A, and creating a value that indicates me the rows I will keep in my dataset:

a = data.groupby('A').min() a['A'] = a.index to_keep = [str(x[0]) + str(x[1]) for x in a[['A', 'B']].values] data['id'] = data['A'].astype(str) + data['B'].astype('str') data[data['id'].isin(to_keep)]

I am sure that there is a much more straight forward way to do this. I have seen many answers here that use multi-indexing but I would like to do this without adding multi-index to my dataframe. Thank you for your help.

370

asked Jan 31 '19 23:01

Wendy

Video Answer

2 Answers

I feel like you're overthinking this. Just use groupby and idxmin:

df.loc[df.groupby('A').B.idxmin()]     A  B   C 2  1  2  10 4  2  4   4

df.loc[df.groupby('A').B.idxmin()].reset_index(drop=True)     A  B   C 0  1  2  10 1  2  4   4

169

answered Oct 08 '22 18:10

cs95

Had a similar situation but with a more complex column heading (e.g. "B val") in which case this is needed:

df.loc[df.groupby('A')['B val'].idxmin()]

answered Oct 08 '22 18:10

Juho

Related questions
                            
                                Where does pipenv install packages?
                            
                                What is module option in tsconfig used for?
                            
                                using absolute paths in typescript for imports
                            
                                Reset Network tab default table sort order in Chrome developer tools?
                            
                                What does NGX stand for, what is it used for?
                            
                                Unable To Run Unit Tests in Xcode 11: The run destination * is not valid for tests you have chosen to perform
                            
                                anchor href vs angular routerlink
                            
                                How to convert Uint8List image to File Image for upload in flutter web
                            
                                file Input Event type in Angular
                            
                                Chrome devtools audits tab does not show
                            
                                How do I solve "Greetings, time traveller. We are in the golden age of prefix-less CSS, where Autoprefixer is no longer needed for your stylesheet."?
                            
                                Can jQuery select by CSS rule, not class?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With