How to merge/combine columns in pandas?

Tags:

I have a (example-) dataframe with 4 columns:

data = {'A': ['a', 'b', 'c', 'd', 'e', 'f'],
    'B': [42, 52, np.nan, np.nan, np.nan, np.nan],  
    'C': [np.nan, np.nan, 31, 2, np.nan, np.nan],
    'D': [np.nan, np.nan, np.nan, np.nan, 62, 70]}
df = pd.DataFrame(data, columns = ['A', 'B', 'C', 'D'])

    A   B       C       D
0   a   42.0    NaN     NaN
1   b   52.0    NaN     NaN
2   c   NaN     31.0    NaN
3   d   NaN     2.0     NaN
4   e   NaN     NaN     62.0
5   f   NaN     NaN     70.0

I would now like to merge/combine columns B, C, and D to a new column E like in this example:

data2 = {'A': ['a', 'b', 'c', 'd', 'e', 'f'],
    'E': [42, 52, 31, 2, 62, 70]}
df2 = pd.DataFrame(data2, columns = ['A', 'E'])

    A   E
0   a   42
1   b   52
2   c   31
3   d   2
4   e   62
5   f   70

I found a quite similar question here but this adds the merged colums B, C, and D at the end of column A:

0      a
1      b
2      c
3      d
4      e
5      f
6     42
7     52
8     31
9      2
10    62
11    70
dtype: object

Thanks for help.

332

asked Oct 04 '17 11:10

mati

1 Answers

Option 1
Using assign and drop

In [644]: cols = ['B', 'C', 'D']

In [645]: df.assign(E=df[cols].sum(1)).drop(cols, 1)
Out[645]:
   A     E
0  a  42.0
1  b  52.0
2  c  31.0
3  d   2.0
4  e  62.0
5  f  70.0

Option 2
Using assignment and drop

In [648]: df['E'] = df[cols].sum(1)

In [649]: df = df.drop(cols, 1)

In [650]: df
Out[650]:
   A     E
0  a  42.0
1  b  52.0
2  c  31.0
3  d   2.0
4  e  62.0
5  f  70.0

Option 3 Lately, I like the 3rd option.
Using groupby

In [660]: df.groupby(np.where(df.columns == 'A', 'A', 'E'), axis=1).first() #or sum max min
Out[660]:
   A     E
0  a  42.0
1  b  52.0
2  c  31.0
3  d   2.0
4  e  62.0
5  f  70.0

In [661]: df.columns == 'A'
Out[661]: array([ True, False, False, False], dtype=bool)

In [662]: np.where(df.columns == 'A', 'A', 'E')
Out[662]:
array(['A', 'E', 'E', 'E'],
      dtype='|S1')

122

answered Nov 15 '22 20:11

Zero

Related questions
                            
                                How do I escape forward slashes in python, so that open() sees my file as a filename to write, instead of a filepath to read?
                            
                                Is there a way to change the filemode for a logger object that is not configured using basicConfig?
                            
                                Python "bad interpreter" ERROR
                            
                                new column with coordinates using geopy pandas
                            
                                iPython - set up magic commands in configuration file
                            
                                How to change the number of axis ticks in seaborn plots
                            
                                numpy.core.multiarray failed to import
                            
                                Time Series Analysis - unevenly spaced measures - pandas + statsmodels
                            
                                When bulding a CNN, I am getting complaints from Keras that do not make sense to me.
                            
                                pandas read_csv column dtype is set to decimal but converts to string
                            
                                Split nested array values from Pandas Dataframe cell over multiple rows
                            
                                Pandas: get multiindex level as series
                            
                                Using tf.unpack() when first dimension of Variable is None
                            
                                Exclude unwanted tag on Beautifulsoup Python
                            
                                How to use paho mqtt client in django?
                            
                                What does `layer.get_weights()` return?
                            
                                Flier colors in boxplot with matplotlib
                            
                                python pandas sum by hour of day
                            
                                Copying MultiIndex dataframes with pd.read_clipboard?
                            
                                Django custom for complex Func (sql function)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to merge/combine columns in pandas?

Tags:

python

merge

pandas

dataframe

multiple-columns

mati

People also ask

1 Answers

Zero

Recent Activity

Donate For Us