I have two tables: sales table & product table and these two tables share the 'PART NUMBER' column. The 'PART NUMBER' column in the sales table is not unique, but it is unique in the product table. (see image below of a snapshot of the sales table & product table) <img src="https://i.stack.imgur.com/S1t7w.png" alt="enter image description here"> <img src="https://i.stack.imgur.com/qb9lL.png" alt="enter image description here"> I was trying to add the equivalent 'Description' to each 'PART NUMBER' on the sales table, and I followed the examples from the pandas website my code <pre class="prettyprint"><code>sales.join(part_table, on='PART NUMBER') </code></pre> But I got this error: <pre class="prettyprint"><code>ValueError: columns overlap but no suffix specified: Index([u'PART NUMBER'], dtype='object') </code></pre> Can someone explain what this error means and how to solve it? Many thanks!

I think you want to do a merge rather than a join: <pre class="prettyprint"><code>sales.merge(part_table) </code></pre> Here's an example dataframe: <pre class="prettyprint"><code>In [11]: dfa = pd.DataFrame([[1, 2], [3, 4]], columns=['A', 'B']) In [12]: dfb = pd.DataFrame([[1, 'a'], [3, 'b'], [3, 'c']], columns=['A', 'C']) In [13]: dfa.join(dfb, on=['A']) ValueError: columns overlap but no suffix specified: Index([u'A'], dtype='object') In [14]: dfa.merge(dfb) Out[14]: A B C 0 1 2 a 1 3 4 b 2 3 4 c </code></pre> <hr> It's unclear from the docs if this is intentational (I thought that <code>on</code> would be used as the column) but following the exceptions message if you add suffixs we can see what's going on: <pre class="prettyprint"><code>In [21]: dfb.join(dfa, on=['A'], lsuffix='_a', rsuffix='_b') Out[21]: A_a C A_b B 0 1 a 3 4 1 3 b NaN NaN 2 3 c NaN NaN In [22]: dfb.join(dfa, lsuffix='_a', rsuffix='_b') Out[22]: A_a C A_b B 0 1 a 1 2 1 3 b 3 4 2 3 c NaN NaN </code></pre> It's ignoring the on kwarg and just doing the join.

Join two DataFrames on one key column / ERROR: 'columns overlap but no suffix specified'

Tags:

python

sql

join

pandas

syntax-error

I have two tables: sales table & product table and these two tables share the 'PART NUMBER' column. The 'PART NUMBER' column in the sales table is not unique, but it is unique in the product table. (see image below of a snapshot of the sales table & product table)

enter image description here

I was trying to add the equivalent 'Description' to each 'PART NUMBER' on the sales table, and I followed the examples from the pandas website my code

sales.join(part_table, on='PART NUMBER')

But I got this error:

ValueError: columns overlap but no suffix specified: Index([u'PART NUMBER'], dtype='object')

Can someone explain what this error means and how to solve it?

Many thanks!

549

asked Sep 24 '14 22:09

Yumi

1 Answers

I think you want to do a merge rather than a join:

sales.merge(part_table)

Here's an example dataframe:

In [11]: dfa = pd.DataFrame([[1, 2], [3, 4]], columns=['A', 'B'])

In [12]: dfb = pd.DataFrame([[1, 'a'], [3, 'b'], [3, 'c']], columns=['A', 'C'])

In [13]: dfa.join(dfb, on=['A'])
ValueError: columns overlap but no suffix specified: Index([u'A'], dtype='object')

In [14]: dfa.merge(dfb)
Out[14]:
   A  B  C
0  1  2  a
1  3  4  b
2  3  4  c

It's unclear from the docs if this is intentational (I thought that on would be used as the column) but following the exceptions message if you add suffixs we can see what's going on:

In [21]: dfb.join(dfa, on=['A'], lsuffix='_a', rsuffix='_b')
Out[21]:
   A_a  C  A_b   B
0    1  a    3   4
1    3  b  NaN NaN
2    3  c  NaN NaN

In [22]: dfb.join(dfa, lsuffix='_a', rsuffix='_b')
Out[22]:
   A_a  C  A_b   B
0    1  a    1   2
1    3  b    3   4
2    3  c  NaN NaN

It's ignoring the on kwarg and just doing the join.

131

answered Nov 15 '22 00:11

Andy Hayden

Related questions
                            
                                pygame how to check mouse coordinates [duplicate]
                            
                                How to implement efficient filtering logic in Python?
                            
                                Face morphing using opencv
                            
                                How can I set PYTHONPATH in fish?
                            
                                Match string in python regardless of upper and lower case differences [duplicate]
                            
                                Django - Inline form for OneToOne field in admin site
                            
                                text color in python-pptx module
                            
                                Python realtime plotting
                            
                                How to use or install MagickWand on Mac OS X?
                            
                                Correct way to edit dictionary value python
                            
                                How to read a dataset from a txt file in Python?
                            
                                PRAW: Replying to a post
                            
                                How to add page numbers to PDF file using python and matplotlib?
                            
                                How to use AssertRaisesMessage() in Django tests
                            
                                Django test global setup
                            
                                login sessions for django
                            
                                Python: position text box fixed in corner and correctly aligned
                            
                                Python argparse treat arguments in different ways
                            
                                New to scapy. Trying to understand the sr()
                            
                                Defining setter in a shorter way

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With