How to do pandas equivalence of SQL outer join without a key

Tags:

In SQL, you can join two tables without a key so that all records of both tables merge with each other. If pandas.concat() or pandas.merge() or some other pandas syntax supported this, it could help me with one step of a problem I am trying to solve. I found an outer join option on the help documentation, but I could not find an exact syntax to do what I wanted (join all records without a key).

To explain this a little better:

import pandas as pd

lunchmenupairs2 = [["pizza", "italian"],["lasagna", "italian"],["orange", "fruit"]]
teamcuisinepreferences2 = [["ian", "*"]]

lunchLabels = ["Food", "Type"]
teamLabels = ["Person", "Type"]

df1 = pd.DataFrame.from_records(lunchmenupairs2, columns=lunchLabels)
df2 = pd.DataFrame.from_records(teamcuisinepreferences2, columns=teamLabels)

print(df1)
print(df2)

Outputs these tables:

      Food     Type
0    pizza  italian
1  lasagna  italian
2   orange    fruit

  Person     Type
0    ian        *

I want the final result of the merge to be:

  Person     Type Food     Type
0  ian        *   pizza     italian
1  ian        *   lasagna   italian
2  ian        *   orange    fruit

Then I can easily drop the columns I don't want and move to the next step in the code I am working on. This doesn't work:

merged_data = pd.merge(left=df2,right=df1, how='outer')

Is there a way to do this type of DataFrame merging?

567

asked May 26 '17 12:05

TMWP

2 Answers

You can add a column to both dfs with a constant value,

>>>df1['joincol'] = 1
>>>df2['joincol'] = 1
>>>pd.merge(left=df2,right=df1, on='joincol', how='outer')
  Person Type_x  joincol     Food   Type_y
0    ian      *        1    pizza  italian
1    ian      *        1  lasagna  italian
2    ian      *        1   orange    fruit

then delete it afterward when you remove your other undesired columns.

143

answered Oct 20 '22 08:10

EFT

This is possible with cross-join, introduced in Pandas 1.2.0. Simply run:

df1.merge(df2, how='cross')

answered Oct 20 '22 08:10

Ran Feldesh

Related questions
                            
                                Pandas map string to int based on value in a column
                            
                                ImportError: No module named 'matplotlib' -- Using Anaconda tensorflow environment
                            
                                How to do batching in Tensorflow Serving?
                            
                                How to specify text mode in Python's tempfile.TemporaryFile()?
                            
                                Defining a default argument as a global variable
                            
                                ValueError: Argument must be a dense tensor - Python and TensorFlow
                            
                                How to convert numbers represented as characters for short into numeric in Python
                            
                                Python pandas series: convert float to string, preserving nulls
                            
                                pandas equivalent for grep
                            
                                How to remove automated chart titles generated by Pandas
                            
                                which coefficients go to which class in multiclass logistic regression in scikit learn?
                            
                                How to test if \ symbol (backslash) is in a string?
                            
                                Is Session.run(fetches) guaranteed to execute its "fetches" arguments in-order?
                            
                                python function annotation in class return type is the class raise undefined
                            
                                Sympy allows definition of integer symbols but does not account for their behavior
                            
                                Select only rows that occur at specific time
                            
                                mypy error - incompatible type despite using 'Union'
                            
                                python3 dynamoDB Update_item do not work
                            
                                The accessing time of a numpy array is impacted much more by the last index compared to the second last
                            
                                understanding toolz use cases

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to do pandas equivalence of SQL outer join without a key

Tags:

python

merge

join

dataframe

TMWP

People also ask

2 Answers

EFT

Ran Feldesh

Recent Activity

Donate For Us