I am resampling a Pandas TimeSeries. The timeseries consist of binary values (it is a categorical variable) with no missing values, but after resampling NaNs appear. How is this possible? I can't post any example data here since it is sensitive info, but I create and resample the series as follows: <pre class="prettyprint"><code>series = pd.Series(data, ts) series_rs = series.resample('60T', how='mean') </code></pre>

Please note that fill_method has now been deprecated. <code>resample()</code> now returns a resampling object on which you can perform operations just like a groupby object. common downsampling operations: <pre class="prettyprint"><code>.mean() .sum() .agg() .apply() </code></pre> upsampling operations: <pre class="prettyprint"><code>.ffill() .bfill() </code></pre> See the whats-new message in the documentation https://pandas.pydata.org/pandas-docs/stable/whatsnew.html#whatsnew-0180-breaking-resample so the example would become <pre class="prettyprint"><code>series_rs = series.resample('60T').mean() </code></pre>

Pandas TimeSeries resample produces NaNs

Tags:

python

pandas

time-series

resampling

I am resampling a Pandas TimeSeries. The timeseries consist of binary values (it is a categorical variable) with no missing values, but after resampling NaNs appear. How is this possible?

I can't post any example data here since it is sensitive info, but I create and resample the series as follows:

series = pd.Series(data, ts)
series_rs = series.resample('60T', how='mean')

725

asked Oct 27 '15 09:10

Peter Lenaers

2 Answers

upsampling converts to a regular time interval, so if there are no samples you get NaN.

You can fill missing values backward by fill_method='bfill' or for forward - fill_method='ffill' or fill_method='pad'.

import pandas as pd

ts = pd.date_range('1/1/2015', periods=10, freq='100T')
data = range(10)
series = pd.Series(data, ts)
print series
#2015-01-01 00:00:00    0
#2015-01-01 01:40:00    1
#2015-01-01 03:20:00    2
#2015-01-01 05:00:00    3
#2015-01-01 06:40:00    4
#2015-01-01 08:20:00    5
#2015-01-01 10:00:00    6
#2015-01-01 11:40:00    7
#2015-01-01 13:20:00    8
#2015-01-01 15:00:00    9
#Freq: 100T, dtype: int64
series_rs = series.resample('60T', how='mean')
print series_rs
#2015-01-01 00:00:00     0
#2015-01-01 01:00:00     1
#2015-01-01 02:00:00   NaN
#2015-01-01 03:00:00     2
#2015-01-01 04:00:00   NaN
#2015-01-01 05:00:00     3
#2015-01-01 06:00:00     4
#2015-01-01 07:00:00   NaN
#2015-01-01 08:00:00     5
#2015-01-01 09:00:00   NaN
#2015-01-01 10:00:00     6
#2015-01-01 11:00:00     7
#2015-01-01 12:00:00   NaN
#2015-01-01 13:00:00     8
#2015-01-01 14:00:00   NaN
#2015-01-01 15:00:00     9
#Freq: 60T, dtype: float64
series_rs = series.resample('60T', how='mean', fill_method='bfill')
print series_rs
#2015-01-01 00:00:00    0
#2015-01-01 01:00:00    1
#2015-01-01 02:00:00    2
#2015-01-01 03:00:00    2
#2015-01-01 04:00:00    3
#2015-01-01 05:00:00    3
#2015-01-01 06:00:00    4
#2015-01-01 07:00:00    5
#2015-01-01 08:00:00    5
#2015-01-01 09:00:00    6
#2015-01-01 10:00:00    6
#2015-01-01 11:00:00    7
#2015-01-01 12:00:00    8
#2015-01-01 13:00:00    8
#2015-01-01 14:00:00    9
#2015-01-01 15:00:00    9
#Freq: 60T, dtype: float64

178

answered Sep 28 '22 05:09

jezrael

Please note that fill_method has now been deprecated. resample() now returns a resampling object on which you can perform operations just like a groupby object.

common downsampling operations:

.mean()
.sum()
.agg()
.apply()

upsampling operations:

.ffill()
.bfill()

See the whats-new message in the documentation https://pandas.pydata.org/pandas-docs/stable/whatsnew.html#whatsnew-0180-breaking-resample

so the example would become

series_rs = series.resample('60T').mean()

answered Sep 28 '22 03:09

Bart Bisschops

Related questions
                            
                                I get a error when using HoughCircles with Python OpenCV that a module is missing [duplicate]
                            
                                Ruby hash equivalent to Python dict setdefault
                            
                                PyEnchant: spellchecking block of text with a personal word list
                            
                                Django Custom View Decorators
                            
                                Ignore certain packages and their dependencies with pip freeze
                            
                                Celery Task with countdown
                            
                                Issue with createsuperuser when implementing custom user model
                            
                                PySpark broadcast variables from local functions
                            
                                Python - Start firefox with Selenium in private mode [duplicate]
                            
                                How numpy.cov() function is implemented?
                            
                                SQLAlchemy decimal precision
                            
                                u'rest_framework' is not a registered namespace
                            
                                Python 3 doesn't have the file function
                            
                                Associate "external' class model with flask sqlalchemy
                            
                                "Too many open files" error when opening and loading images in Pillow
                            
                                Django Rest Framework: `get_serializer_class` called several times, with wrong value of request method
                            
                                Interpolate (or extrapolate) only small gaps in pandas dataframe
                            
                                SQLAlchemy: Return a record filtered by max value of a column
                            
                                Column and row dimensions in OpenPyXL are always None
                            
                                Hierarchic pie/donut chart from Pandas DataFrame using bokeh or matplotlib

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With