how to find the most frequent value of each row of a dataframe? For example:
In [14]: df
Out[14]:
a b c
0 2 3 3
1 1 1 2
2 7 7 8
return: [3,1,7]
value_counts() function can be used to obtain counts, and then the idxmax() function can be used to obtain the value with the most counts.
mean(axis = 1) , it will return the mean of all the indices or rows. So, the number of values return by the function will be equal to the number of rows/indices. df. mean(axis = 1) will return a Pandas series with mean of all the rows.
We can find the number of occurrences of elements using the value_counts() method. From that the most frequent element can be accessed by using the mode() method.
try .mode() method:
In [88]: df
Out[88]:
a b c
0 2 3 3
1 1 1 2
2 7 7 8
In [89]: df.mode(axis=1)
Out[89]:
0
0 3
1 1
2 7
From docs:
Gets the mode(s) of each element along the axis selected. Adds a row for each mode per label, fills in gaps with nan.
Note that there could be multiple values returned for the selected axis (when more than one item share the maximum frequency), which is the reason why a dataframe is returned. If you want to impute missing values with the mode in a dataframe df, you can just do this: df.fillna(df.mode().iloc[0])
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With