I have a very simple Pandas Series: <pre class="prettyprint"><code>xx = pd.Series([1, 2, np.nan, np.nan, 3, 4, 5]) </code></pre> If I run this I get what I want: <pre class="prettyprint"><code>>>> xx.rolling(3,1).mean() 0 1.0 1 1.5 2 1.5 3 2.0 4 3.0 5 3.5 6 4.0 </code></pre> But if I have to use <code>.apply()</code> I cannot get it to work by ignoring <code>NaN</code>s in the <code>mean()</code> operation: <pre class="prettyprint"><code>>>> xx.rolling(3,1).apply(np.mean) 0 1.0 1 1.5 2 NaN 3 NaN 4 NaN 5 NaN 6 4.0 >>> xx.rolling(3,1).apply(lambda x : np.mean(x)) 0 1.0 1 1.5 2 NaN 3 NaN 4 NaN 5 NaN 6 4.0 </code></pre> What should I do in order to both use <code>.apply()</code> and have the result in the first output? My actual problem is more complicated that I have to use <code>.apply()</code> to realize but it boils down to this issue.

You can use np.nanmean() <pre class="prettyprint"><code>xx.rolling(3,1).apply(lambda x : np.nanmean(x)) Out[59]: 0 1.0 1 1.5 2 1.5 3 2.0 4 3.0 5 3.5 6 4.0 dtype: float64 </code></pre> If you have to process the nans explicitly, you can do: <pre class="prettyprint"><code>xx.rolling(3,1).apply(lambda x : np.mean(x[~np.isnan(x)])) Out[94]: 0 1.0 1 1.5 2 1.5 3 2.0 4 3.0 5 3.5 6 4.0 dtype: float64 </code></pre>

pandas rolling apply to allow nan

Tags:

python

pandas

nan

mean

I have a very simple Pandas Series:

xx = pd.Series([1, 2, np.nan, np.nan, 3, 4, 5])

If I run this I get what I want:

>>> xx.rolling(3,1).mean()
0    1.0
1    1.5
2    1.5
3    2.0
4    3.0
5    3.5
6    4.0

But if I have to use .apply() I cannot get it to work by ignoring NaNs in the mean() operation:

>>> xx.rolling(3,1).apply(np.mean)
0    1.0
1    1.5
2    NaN
3    NaN
4    NaN
5    NaN
6    4.0

>>> xx.rolling(3,1).apply(lambda x : np.mean(x))
0    1.0
1    1.5
2    NaN
3    NaN
4    NaN
5    NaN
6    4.0

What should I do in order to both use .apply() and have the result in the first output? My actual problem is more complicated that I have to use .apply() to realize but it boils down to this issue.

951

asked Jun 06 '17 23:06

Zhang18

1 Answers

You can use np.nanmean()

xx.rolling(3,1).apply(lambda x : np.nanmean(x))
Out[59]: 
0    1.0
1    1.5
2    1.5
3    2.0
4    3.0
5    3.5
6    4.0
dtype: float64

If you have to process the nans explicitly, you can do:

xx.rolling(3,1).apply(lambda x : np.mean(x[~np.isnan(x)]))
Out[94]: 
0    1.0
1    1.5
2    1.5
3    2.0
4    3.0
5    3.5
6    4.0
dtype: float64

answered Oct 08 '22 06:10

Allen

Related questions
                            
                                How do I flatten a pandas dataframe keeping index and column names
                            
                                rename certain value in pandas series
                            
                                pdfkit - An A4 html page does not print into an A4 pdf
                            
                                How to install graphviz in Ubuntu 15 to plot a decision tree for XGBoost?
                            
                                Index JSON files in elasticsearch using Python?
                            
                                Python Gevent Pywsgi server with ssl
                            
                                How to wait for RxPy parallel threads to complete
                            
                                Apply migrations and models from all the apps
                            
                                Apply seaborn heatmap columnwise on pandas dataframe
                            
                                Calculate histograms along axis
                            
                                How to shuffle groups of rows of a Pandas dataframe?
                            
                                Installing a python package that is not available in anaconda (smtplib)
                            
                                How do I get a per mille sign in my axis title using Latex in matplotlib?
                            
                                Text to Binary in Python
                            
                                How to check if there's any odd/even numbers in an Iterable (e.g. list/tuple)?
                            
                                How to Install/add jdk 7 in Docker Container
                            
                                speed up pandas apply or using map
                            
                                What is the most efficient way to compute a Kronecker Product in TensorFlow?
                            
                                pandas dataframe index match
                            
                                Collapsing rows in a Pandas dataframe if all rows have only one value in their columns

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With