How to fillna by groupby outputs in pandas?

1 Answers

df['D'].fillna(df.groupby(['A','B','C'])['D'].transform('mean')) would be faster than apply

In [2400]: df
Out[2400]:
   A  B  C    D
0  1  1  1  1.0
1  1  1  1  NaN
2  1  1  1  3.0
3  3  3  3  5.0

In [2401]: df['D'].fillna(df.groupby(['A','B','C'])['D'].transform('mean'))
Out[2401]:
0    1.0
1    2.0
2    3.0
3    5.0
Name: D, dtype: float64

In [2402]: df['D'] = df['D'].fillna(df.groupby(['A','B','C'])['D'].transform('mean'))

In [2403]: df
Out[2403]:
   A  B  C    D
0  1  1  1  1.0
1  1  1  1  2.0
2  1  1  1  3.0
3  3  3  3  5.0

Details

In [2396]: df.shape
Out[2396]: (10000, 4)

In [2398]: %timeit df['D'].fillna(df.groupby(['A','B','C'])['D'].transform('mean'))
100 loops, best of 3: 3.44 ms per loop


In [2397]: %timeit df.groupby(['A','B','C'])['D'].apply(lambda x: x.fillna(x.mean()))
100 loops, best of 3: 5.34 ms per loop

138

answered Oct 04 '22 15:10

Zero

Related questions
                            
                                how do you install django older version using easy_install?
                            
                                Check for [] operator
                            
                                Python's foreach backwards
                            
                                Using unicodedata.normalize in Python 2.7
                            
                                How to avoid creation of .pyc files on OS X 10.8 with Python 2.7?
                            
                                Basic authentication with jira-python
                            
                                Finding the date of the next Saturday
                            
                                How to use \r to print on same line? [duplicate]
                            
                                Recursive remove directory using SFTP
                            
                                How to open an unicode text file inside a zip?
                            
                                Hosting Django app with Waitress
                            
                                Print md5 hash of an image opened with Python's PIL
                            
                                Python list with constant value
                            
                                How to check whether a line starts with a word or tab or white space in python?
                            
                                Celery ImportError: No module named proj
                            
                                importing external ".txt" file in python
                            
                                Python equivalent of Haskell's [1..] (to index a list)
                            
                                Create a compress function in Python?
                            
                                Add data labels to Seaborn factor plot
                            
                                Django paginator page range for not displaying all numbers

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to fillna by groupby outputs in pandas?

Tags:

python

pandas

Abhisek Dash

People also ask

1 Answers

Zero

Recent Activity

Donate For Us