pandas join DataFrame force suffix?

Tags:

How can I force a suffix on a merge or join. I understand it's possible to provide one if there is a collision but in my case I'm merging df1 with df2 which doesn't cause any collision but then merging again on df2 which uses the suffixes but I would prefer for each merge to have a suffix because it gets confusing if I do different combinations as you could imagine.

316

asked Feb 05 '14 21:02

stgtscc

2 Answers

You could force a suffix on the actual DataFrame:

In [11]: df_a = pd.DataFrame([[1], [2]], columns=['A'])  In [12]: df_b = pd.DataFrame([[3], [4]], columns=['B'])  In [13]: df_a.join(df_b) Out[13]:     A  B 0  1  3 1  2  4

By appending to it's column's names:

In [14]: df_a.columns = df_a.columns.map(lambda x: str(x) + '_a')  In [15]: df_a Out[15]:     A_a 0    1 1    2

Now joins won't need the suffix correction, whether they collide or not:

In [16]: df_b.columns = df_b.columns.map(lambda x: str(x) + '_b')  In [17]: df_a.join(df_b) Out[17]:     A_a  B_b 0    1    3 1    2    4

answered Sep 19 '22 02:09

Andy Hayden

As of pandas version 0.24.2 you can add a suffix to column names on a DataFrame using the add_suffix method.

This makes a one-liner merge command with force-suffix more bearable, for example:

 df_merged = df1.merge(df2.add_suffix('_2'))

answered Sep 21 '22 02:09

Renier Botha

Related questions
                            
                                What is the necessity of plt.figure() in matplotlib?
                            
                                Pandas Apply Key Error
                            
                                How do I parse a yaml string with python?
                            
                                pandas pd.options.display.max_rows not working as expected
                            
                                C++ GDB Python Pretty Printing Tutorial?
                            
                                getting the opposite diagonal of a numpy array
                            
                                How to convert a string to an image?
                            
                                Python multiple repeat Error
                            
                                Finding the Values of the Arrow Keys in Python: Why are they triples?
                            
                                Why does numpy.linalg.solve() offer more precise matrix inversions than numpy.linalg.inv()?
                            
                                Using Boolean Flags in Python Click Library (command line arguments)
                            
                                Turtle module - Saving an image
                            
                                In Python argparse, is it possible to have paired --no-something/--something arguments?
                            
                                Why does right-clicking create an orange dot in the center of the circle?
                            
                                Celery - How to send task from remote machine?
                            
                                Django populate() isn't reentrant
                            
                                Installing iPython: "ImportError cannot import name path"?
                            
                                How To Plot Multiple Histograms On Same Plot With Seaborn
                            
                                "System error: new style getargs format but argument is not a tuple" when using cv2.blur
                            
                                Numpy: change max in each row to 1, all other numbers to 0

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

pandas join DataFrame force suffix?

Tags:

python

pandas

stgtscc

People also ask

2 Answers

Andy Hayden

Renier Botha

Recent Activity

Donate For Us