Grouping by multiple columns to find duplicate rows pandas

Tags:

I have a df

id    val1    val2  1     1.1      2.2  1     1.1      2.2  2     2.1      5.5  3     8.8      6.2  4     1.1      2.2  5     8.8      6.2

I want to group by val1 and val2 and get similar dataframe only with rows which has multiple occurance of same val1 and val2 combination.

Final df:

id    val1    val2  1     1.1      2.2  4     1.1      2.2  3     8.8      6.2  5     8.8      6.2

470

asked Oct 09 '17 07:10

Shubham R

1 Answers

You need duplicated with parameter subset for specify columns for check with keep=False for all duplicates for mask and filter by boolean indexing:

df = df[df.duplicated(subset=['val1','val2'], keep=False)] print (df)    id  val1  val2 0   1   1.1   2.2 1   1   1.1   2.2 3   3   8.8   6.2 4   4   1.1   2.2 5   5   8.8   6.2

Detail:

print (df.duplicated(subset=['val1','val2'], keep=False)) 0     True 1     True 2    False 3     True 4     True 5     True dtype: bool

155

answered Oct 10 '22 14:10

jezrael

Related questions
                            
                                How can I set the field unique in django?
                            
                                How to overlap in react-native
                            
                                Why protobuf is bad for large data structures?
                            
                                Meaning of the number in AWS instance type name
                            
                                Merging appsettings with environment variables in .NET Core
                            
                                How to select non-numeric columns using dplyr::select_if
                            
                                Why use Arrow's Options instead of Kotlin nullable
                            
                                How to create new breakpoints in bootstrap 4 using CDN?
                            
                                ConstraintLayout - proportional width/height to self?
                            
                                Can't call setState (or forceUpdate) on an unmounted component
                            
                                Typescript errors when using jest mocks
                            
                                Angular reactive forms set and clear validators

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With