Can one perform a left join in pandas that selects only the first match on the right?

Tags:

Can one perform a left join in pandas that selects only the first match on the right? Example:

left            = pd.DataFrame() left['age']     = [11, 12] right           = pd.DataFrame() right['age']    = [10, 11, 11] right['salary'] = [ 100, 150, 200 ] left.merge( right, how='left', on='age' )

Returns

   age  salary 0   11     150 1   11     200 2   12     NaN

But what I would like is to preserve the number of rows of left, by merely taking the first match. That is:

   age  salary 0   11     150 2   12     NaN

So I've been using

left.merge( right.drop_duplicates(['age']), how='left', on='age')

but I believe this makes a full copy of right. And it smells funny.

Is there a more elegant way?

717

asked Oct 08 '14 14:10

Quant

1 Answers

Yes, you can use groupby to remove your duplicate lines. Do everything you've done to define left and right. Now, I define a new dataframe on your last line:

left2=left.merge( right, how='left', on='age' ) df= left2.groupby(['age'])['salary'].first().reset_index() df

At first I used a .min(), which will give you the minimum salary at each age, as such:

df= left2.groupby(['age'])['salary'].min().reset_index()

But you were specifically asking about the first match. To do so you use the .first() option. Note: The .reset_index() at the end, just reformats the output of the groupby to be a dataframe again.

135

answered Oct 18 '22 00:10

samus

Related questions
                            
                                Install pywin32 with pip in Windows 7 does not work in python 3.4.2
                            
                                What is the optimal way to run a Node API in Docker on Amazon ECS?
                            
                                JAX-WS: Compile Schema separate from WSDL
                            
                                Webpack with angular 1.x and ES5
                            
                                Does z-index specify the stack level of a non-positioned flex item?
                            
                                Again about global data in Angular2
                            
                                'bytes' object has no attribute 'encode'
                            
                                Espresso test error: AppNotIdleException
                            
                                Curious Session Behaviour (filename string length)
                            
                                How to get the context of the current request in spring-webflux
                            
                                Unable to acquire JDBC Connection
                            
                                How can i test in-app payments when Google Play App Signing feature is enabled?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With