I have a dataframe like this:
df = pd.DataFrame(
    [['101', 'a', 'in', '10'],
     ['101', 'a', 'out', '10'], 
     ['102', 'b', 'in', '20'], 
     ['103', 'c', 'in', '30'],
     ['103', 'c', 'out', '40']], 
    columns=['col1', 'col2', 'col3', 'col4']
)
I want to group by col1 and find paired records that have the same value in col2 and col4, but one has 'in' in col3 one has 'out' in col3. The expected outcome is:
df_out = pd.DataFrame(
    [['101', 'a', 'in', '10'],
     ['101', 'a', 'out', '10']],
    columns=['col1', 'col2', 'col3', 'col4']
)
Thank you for the help.
Let us try transform with nunique
out = df[df.groupby(['col1','col2','col4'])['col3'].transform('nunique')==2]
Out[187]: 
  col1 col2 col3 col4
0  101    a   in   10
1  101    a  out   10
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With