Pandas: sum up multiple columns into one column without last column

Tags:

If I have a dataframe similar to this one

Apples   Bananas   Grapes   Kiwis
2        3         nan      1
1        3         7        nan
nan      nan       2        3

I would like to add a column like this

Apples   Bananas   Grapes   Kiwis   Fruit Total
2        3         nan      1        6
1        3         7        nan      11
nan      nan       2        3        5

I guess you could use df['Apples'] + df['Bananas'] and so on, but my actual dataframe is much larger than this. I was hoping a formula like df['Fruit Total']=df[-4:-1].sum could do the trick in one line of code. That didn't work however. Is there any way to do it without explicitly summing up all columns?

660

asked Feb 06 '17 08:02

Tuutsrednas

3 Answers

You can first select by iloc and then sum:

df['Fruit Total']= df.iloc[:, -4:-1].sum(axis=1)
print (df)
   Apples  Bananas  Grapes  Kiwis  Fruit Total
0     2.0      3.0     NaN    1.0          5.0
1     1.0      3.0     7.0    NaN         11.0
2     NaN      NaN     2.0    3.0          2.0

For sum all columns use:

df['Fruit Total']= df.sum(axis=1)

173

answered Oct 19 '22 10:10

jezrael

This may be helpful for beginners, so for the sake of completeness, if you know the column names (e.g. they are in a list), you can use:

column_names = ['Apples', 'Bananas', 'Grapes', 'Kiwis']
df['Fruit Total']= df[column_names].sum(axis=1)

This gives you flexibility about which columns you use as you simply have to manipulate the list column_names and you can do things like pick only columns with the letter 'a' in their name. Another benefit of this is that it's easier for humans to understand what they are doing through column names. Combine this with list(df.columns) to get the column names in a list format. Thus, if you want to drop the last column, all you have to do is:

column_names = list(df.columns)
df['Fruit Total']= df[column_names[:-1]].sum(axis=1)

answered Oct 19 '22 09:10

kelkka

It is possible to do it without knowing the number of columns and even without iloc:

print(df)
   Apples  Bananas  Grapes  Kiwis
0     2.0      3.0     NaN    1.0
1     1.0      3.0     7.0    NaN
2     NaN      NaN     2.0    3.0

cols_to_sum = df.columns[ : df.shape[1]-1]

df['Fruit Total'] = df[cols_to_sum].sum(axis=1)

print(df)
   Apples   Bananas Grapes  Kiwis   Fruit Total
0  2.0      3.0     NaN     1.0     5.0
1  1.0      3.0     7.0     NaN     11.0
2  NaN      NaN     2.0     3.0     5.0

answered Oct 19 '22 10:10

Ramon

Related questions
                            
                                Memory errors and list limits?
                            
                                assigning class variable as default value to class method argument
                            
                                Shortest way to get first item of `OrderedDict` in Python 3
                            
                                Importing installed package from script raises "AttributeError: module has no attribute" or "ImportError: cannot import name"
                            
                                What's the difference between "virtualenv" and "-m venv" in creating Virtual environments(Python)
                            
                                Python: finding uid/gid for a given username/groupname (for os.chown)
                            
                                Difference between the built-in pow() and math.pow() for floats, in Python?
                            
                                Slice indices must be integers or None or have __index__ method
                            
                                Unable log in to the django admin page with a valid username and password
                            
                                f-strings vs str.format()
                            
                                Visual Studio Code: Intellisense not working
                            
                                Parsing files (ics/ icalendar) using Python
                            
                                Best practice for setting the default value of a parameter that's supposed to be a list in Python?
                            
                                matching any character including newlines in a Python regex subexpression, not globally
                            
                                How are exceptions implemented under the hood? [closed]
                            
                                Using moviepy, scipy and numpy in amazon lambda
                            
                                how to merge two data frames based on particular column in pandas python?
                            
                                How to write an inline-comment in Python
                            
                                PIP Install Numpy throws an error "ascii codec can't decode byte 0xe2"
                            
                                Why is range(0) == range(2, 2, 2) True in Python 3?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas: sum up multiple columns into one column without last column

Tags:

python

pandas

sum