Pandas : vectorized operations on maximum values per row

Tags:

I have the following pandas dataframe df:

index        A    B    C
    1        1    2    3
    2        9    5    4
    3        7    12   8
    ...      ...  ...  ...

I want the maximum value of each row to remain unchanged, and all the other values to become -1. The output would thus look like this :

index        A    B    C
    1       -1   -1    3
    2        9   -1   -1
    3       -1    12  -1
    ...      ...  ...  ...

By using df.max(axis = 1), I get a pandas Series with the maximum values per row. However, I'm not sure how to use these maximums optimally to create the result I need. I'm looking for a vectorized, fast implementation.

569

asked Mar 06 '16 21:03

S Leon

1 Answers

Consider using where:

>>> df.where(df.eq(df.max(1), 0), -1)
       A   B  C
index          
1     -1  -1  3
2      9  -1 -1
3     -1  12 -1

Here df.eq(df.max(1), 0) is a boolean DataFrame marking the row maximums; True values (the maximums) are left untouched whereas False values become -1. You can also use a Series or another DataFrame instead of a scalar if you like.

The operation can also be done inplace (by passing inplace=True).

answered Nov 14 '22 22:11

Alex Riley

Related questions
                            
                                Merge multiple backups of the same table schema into 1 master table
                            
                                How many instances of app Gunicorn creates
                            
                                Adding Flask support to an existing Pycharm project
                            
                                Setting x axis label to bottom in openpyxl
                            
                                Django order by highest number of likes
                            
                                no module named fuzzywuzzy
                            
                                Pygame, set transparency on an image imported using convert_alpha()
                            
                                Change order of columns in Flask-Admin list view
                            
                                Numpy Dot Product of two 2-d arrays in numpy to get 3-d array
                            
                                Python: Selenium WebDriver find_elements_by_class_name
                            
                                Print dict with custom class as values wont call their string method?
                            
                                How to avoid Pandas Groupby key error when a GroupBy object might not contain a certain key
                            
                                Python Twitter Bot w/ Heroku Error: R10 Boot Timeout
                            
                                Why is statsmodels throwing an IndedxError when I try to fit a linear mixed-effect model?
                            
                                What is the name of the driver to connect to Azure SQL Database from pyodbc in Azure ML?
                            
                                How to use python multiprocessing module in django view
                            
                                Conditional field requirement with DjangoRestFramework serializer
                            
                                Save XLSX file to a specified location using OpenPyXL
                            
                                How to properly update requests in Ubuntu 14.04
                            
                                How can SciKit-Learn Random Forest sub sample size may be equal to original training data size?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas : vectorized operations on maximum values per row

Tags:

python

pandas

dataframe

vectorization

max

S Leon

People also ask

1 Answers

Alex Riley

Recent Activity

Donate For Us