list of columns in common in two pandas dataframes

Tags:

I'm considering merge operations on dataframes each with a large number of columns. Don't want the result to have two columns with the same name. Am trying to view a list of column names in common between the two frames:

import pandas as pd

a = [{'A': 3, 'B': 5, 'C': 3, 'D': 2},{'A': 2,  'B': 4, 'C': 3, 'D': 9}]
df1 = pd.DataFrame(a)
b = [{'F': 0,  'M': 4,  'B': 2,  'C': 8 },{'F': 2,  'M': 4, 'B': 3, 'C': 9}]
df2 = pd.DataFrame(b)

df1.columns
>> Index(['A', 'B', 'C', 'D'], dtype='object')
df2.columns
>> Index(['B', 'C', 'F', 'M'], dtype='object')

(df2.columns).isin(df1.columns)
>> array([ True,  True, False, False])

How do I operate that NumPy boolean array on the Index object so it just gives back a list of the columns in common?

402

asked Jan 31 '18 09:01

cardamom

1 Answers

Use numpy.intersect1d or intersection:

a = np.intersect1d(df2.columns, df1.columns)
print (a)
['B' 'C']

a = df2.columns.intersection(df1.columns)
print (a)
Index(['B', 'C'], dtype='object')

Alternative syntax for the latter option:

df1.columns & df2.columns

159

answered Oct 17 '22 06:10

jezrael

Related questions
                            
                                Django DecimalField generating "quantize result has too many digits for current context" error on save
                            
                                Keep finite entries only in Pandas
                            
                                Read Space-separated Data with Pandas [duplicate]
                            
                                In pandas/python, reading array stored as string
                            
                                Django - Filter queryset by CharField value length
                            
                                Saving nltk drawn parse tree to image file
                            
                                How to install pygments on Ubuntu?
                            
                                Numpy sort ndarray on multiple columns
                            
                                Concatenate python string from list entries [duplicate]
                            
                                Change Flask logs from INFO to DEBUG
                            
                                How to get table names using sqlite3 through python? [duplicate]
                            
                                PyMongo create_index only if it does not exist
                            
                                Bulk saving complex objects SQLAlchemy
                            
                                How to GET data in Flask from AJAX post
                            
                                Write to a file with sudo privileges in Python
                            
                                How do I get a regex pattern type for MyPy
                            
                                Open a csv.gz file in Python and print first 100 rows
                            
                                Plotting a time series?
                            
                                Python json.loads changes the order of the object
                            
                                Is pd.get_dummies one-hot encoding?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

list of columns in common in two pandas dataframes

Tags:

python

python-3.x

pandas

cardamom

People also ask

1 Answers

jezrael

Recent Activity

Donate For Us