Pandas: add new column with count how often the highest score of a day was reached by this person

Tags:

It is pandas/Dataframe, it contains all scores for everyone everyday, I want to add one extra column to collect how many time it has the highest score (could be more than one people and some data are nan)

import pandas as pd
import numpy as np

data = np.array([['','day1','day2','day3','day4','day5'],
                ['larry',1,4,7,3,5],
                ['niko',2,-1,3,6,4],
                ['tin',np.nan,5,5, 6,7]])
                
df = pd.DataFrame(data=data[1:,1:],
                  index=data[1:,0],
                  columns=data[0,1:])
print(df)

output

      day1 day2 day3 day4 day5
larry    1    4    7    3    5
niko     2   -1    3    6    4
tin    nan    5    5    6    7

expected result is (larry: 1 time, niko: 2 times, tin: 3 times)

      times_of_top day1 day2 day3 day4 day5
larry            1    1    4    7    3    5
niko             2    2   -1    3    6    4
tin              3  nan    5    5    6    7

niko has the highest score on day1 and day4 so his times_of_top is 2.
tin has the highest score on day2, day4 and day5 so his times_of_top is 3.

220

asked Feb 06 '21 10:02

Larry Cai

1 Answers

One way using pandas.DataFrame.stack and count:

# df = df.astype(float)
# Since the sample data are in object type

df["times_of_top"] = df[df == df.max()].stack().count(0)
print(df)

Output:

       day1  day2  day3  day4  day5  times_of_top
larry   1.0   4.0   7.0   3.0   5.0             1
niko    2.0  -1.0   3.0   6.0   4.0             2
tin     NaN   5.0   5.0   6.0   7.0             3

142

answered Oct 18 '22 14:10

Chris

Related questions
                            
                                Call function with multiple optional arguments of different types
                            
                                Make edges start from outside the node in Networkx
                            
                                Should __pycache__ folders be included in production containers?
                            
                                What's is meant by the dir() built-in function returns "(some of) the attributes of the given object"?
                            
                                Why is my key movement not working properly?
                            
                                Automatically register new prefect flows?
                            
                                How to draw the hyperplanes for SVM One-Versus-All?
                            
                                Feature importance in a binary classification and extracting SHAP values for one of the classes only
                            
                                Pip SSLError WRONG_VERSION_NUMBER under proxy
                            
                                How to convert a string representation of a list without double quoted elements to an actual list?
                            
                                Getting % Rate using Pandas Group By and .sum()
                            
                                Use GPU on python docker image
                            
                                Python can have virtual environments, is there an equivalent for Dart/flutter?
                            
                                How to check if a URL is downloadable in requests
                            
                                Generating list of probabilites
                            
                                Rotate through list of delimiters in join()
                            
                                How to fix discord music bot that stops playing before the song is actually over?
                            
                                Stripe Checkout - Create Session - Apply Tax Rates on subscriptions
                            
                                Recursive definitions in Pandas
                            
                                What is the purpose of graph collections in TensorFlow?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas: add new column with count how often the highest score of a day was reached by this person

Tags:

python

pandas

dataframe

numpy

Larry Cai

People also ask

1 Answers

Chris

Recent Activity

Donate For Us