I have two pandas df x and y, both with the same 3 columns A B C (not nullable). I need to create a new df z, obtained by "subtracting from x the rows which are entirely identical to the rows of y", i.e. a <pre class="prettyprint"><code>x left join y on x.A=y.A and x.B=y.B and x.C=y.C where y.A is null </code></pre> How would I do that? Got stuck with indexes, concat, merge, join, ... Example: <pre class="prettyprint"><code>dataframe x A B C q1 q2 q3 q4 q2 q3 q7 q2 q9 dataframe y A B C q4 q2 q3 dataframe z A B C q1 q2 q3 q7 q2 q9 </code></pre>

I think need <code>merge</code> with indicator and filter only rows from <code>left</code> <code>DataFrame</code>: <pre class="prettyprint"><code>df = x.merge(y, indicator='i', how='outer').query('i == "left_only"').drop('i', 1) print (df) A B C 0 q1 q2 q3 2 q7 q2 q93 </code></pre>

pandas left join where right is null on multiple columns

Tags:

python-3.x

pandas

I have two pandas df x and y, both with the same 3 columns A B C (not nullable). I need to create a new df z, obtained by "subtracting from x the rows which are entirely identical to the rows of y", i.e. a

x left join y on x.A=y.A and x.B=y.B and x.C=y.C
where y.A is null

How would I do that? Got stuck with indexes, concat, merge, join, ...

Example:

dataframe x
A    B    C
q1   q2   q3
q4   q2   q3
q7   q2   q9

dataframe y
A    B    C
q4   q2   q3

dataframe z
A    B    C
q1   q2   q3
q7   q2   q9

916

asked Mar 26 '18 08:03

edoedoedo

1 Answers

I think need merge with indicator and filter only rows from left DataFrame:

df = x.merge(y, indicator='i', how='outer').query('i == "left_only"').drop('i', 1)
print (df)
    A   B    C
0  q1  q2   q3
2  q7  q2  q93

149

answered Sep 27 '22 19:09

jezrael

Related questions
                            
                                How to get virtualenv to run Python 3 instead of Python 2.7?
                            
                                DeprecationWarning: Call to deprecated function get_sheet_by_name (Use wb[sheetname])
                            
                                Regex No Character Should Repeat
                            
                                Pytest running very slow for project
                            
                                Python cryptography: create a certificate signed by an existing CA, and export
                            
                                Error: ValueError: The last dimension of the inputs to `Dense` should be defined. Found `None`
                            
                                How to check what version of Virtual Env is installed
                            
                                TypeError: Only valid with DatetimeIndex, TimedeltaIndex or PeriodIndex, but got an instance of 'RangeIndex' and I can't figure out why
                            
                                Python dataclass, what's a pythonic way to validate initialization arguments?
                            
                                Using "try"+"finally" without "except" never generates any error [duplicate]
                            
                                How to Install python3.8 on debian 10?
                            
                                PANDAS & glob - Excel file format cannot be determined, you must specify an engine manually
                            
                                Python - Virtualenv , python 3?
                            
                                Python Minidom - how to iterate through attributes, and get their name and value
                            
                                Django 1.5: UserCreationForm & Custom Auth Model
                            
                                Verifying HTTPS certificates with urllib.request
                            
                                How to check if elements in list 'a' meet conditions in list 'b'?
                            
                                How to copy a directory to google cloud storage using google cloud Python API?
                            
                                Renaming a Collection Using Pymongo
                            
                                Unable to use cv_bridge with ROS Kinetic and Python3

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With