I am trying to join to data frames. They look like this <pre class="prettyprint"><code>DF1 = ID COUNTRY YEAR V1 V2 V3 V4 12 USA 2012 x y z a 13 USA 2013 x y z a 14 RUSSIA 2012 x y z a DF2 = ID COUNTRY YEAR TRACT 9 USA 2000 A 13 USA 2013 B </code></pre> The desired end goal is: <pre class="prettyprint"><code>DF3 = ID COUNTRY YEAR V1 V2 V3 V4 TRACT 9 USA 2000 A 12 USA 2012 x y z a 13 USA 2013 x y z a B 14 RUSSIA 2012 x y z a </code></pre> I've been trying to use the pd.merge and the .join function with the on='outer' setting to no success <pre class="prettyprint"><code>df3 = pd.merge(df1,df2,how='outer',left_on=['ID','Country','Year'],right_on=['ID',"Country","Year"]) </code></pre>

try this: <pre class="prettyprint"><code>df.merge(df2,how='outer',left_on=['ID','COUNTRY','YEAR'],right_on=['ID',"COUNTRY","YEAR"]) </code></pre> (the column names should be in caps based on your input tables)

Have you tried <pre class="prettyprint"><code>df1.join(df2) </code></pre> You can add parameters later, but it should work.

python pandas dataframe join two dataframes [duplicate]

Tags:

python

merge

join

pandas

I am trying to join to data frames. They look like this

DF1 = ID     COUNTRY     YEAR     V1     V2     V3    V4
      12     USA         2012     x      y      z      a
      13     USA         2013     x      y      z      a
      14     RUSSIA      2012     x      y      z      a

DF2 = ID     COUNTRY     YEAR     TRACT
      9      USA         2000       A
      13     USA         2013       B

The desired end goal is:

DF3 = ID     COUNTRY     YEAR     V1     V2     V3    V4    TRACT    
      9      USA         2000                                 A
      12     USA         2012     x      y      z      a
      13     USA         2013     x      y      z      a      B
      14     RUSSIA      2012     x      y      z      a

I've been trying to use the pd.merge and the .join function with the on='outer' setting to no success

df3 = pd.merge(df1,df2,how='outer',left_on=['ID','Country','Year'],right_on=['ID',"Country","Year"])

773

asked Feb 21 '15 04:02

bjurstrs

2 Answers

try this:

df.merge(df2,how='outer',left_on=['ID','COUNTRY','YEAR'],right_on=['ID',"COUNTRY","YEAR"])

(the column names should be in caps based on your input tables)

186

answered Oct 05 '22 13:10

JAB

Have you tried

df1.join(df2)

You can add parameters later, but it should work.

answered Oct 05 '22 14:10

Harvey

Related questions
                            
                                OrderedDict does not preserve the order
                            
                                Incrementing a for loop, inside the loop
                            
                                How do I 'check' a radio button value using django RadioSelect widget
                            
                                Python 3 backward compatability (shlex.quote vs pipes.quote)
                            
                                How can I safely check if a python package is outdated?
                            
                                Trouble installing scikit-bio on Windows
                            
                                Shifting an image in numpy
                            
                                Why isn't range getting exhausted in Python-3?
                            
                                How to tell when a method is called for first time of many
                            
                                Fastest way to check does string contain any word from list
                            
                                Idiomatically negate a filter
                            
                                How to subset a data frame using Pandas based on a group criteria?
                            
                                django run localhost from another computer connected to another network
                            
                                Python encoding/decoding problems
                            
                                Error installing TA-Lib for Anaconda
                            
                                Get the value of a ctypes.c_ulong pointer?
                            
                                Is it possible to perform a parameter sensitivity analysis using python?
                            
                                normalize a matrix row-wise in theano
                            
                                Installing numpy from wheel format: "...is not a supported wheel on this platform"
                            
                                Pyenv not auto activating

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With