pandas resampling without performing statistics

Tags:

aggregation

I have a five minute dataframe:

rng = pd.date_range('1/1/2011', periods=60, freq='5Min')
df = pd.DataFrame(np.random.randn(60, 4), index=rng, columns=['A', 'B', 'C', 'D'])

                            A         B         C         D
2011-01-01 00:00:00  1.287045 -0.621473  0.482130  1.886648
2011-01-01 00:05:00  0.402645 -1.335942 -0.609894 -0.589782
2011-01-01 00:10:00 -0.311789  0.342995 -0.875089 -0.781499
2011-01-01 00:15:00  1.970683  0.471876  1.042425 -0.128274
2011-01-01 00:20:00 -1.900357 -0.718225 -3.168920 -0.355735
2011-01-01 00:25:00  1.128843 -0.097980  1.130860 -1.045019
2011-01-01 00:30:00 -0.261523  0.379652 -0.385604 -0.910902

I would like to resample only the data on the 15 minute interval, but without aggregating into a statistic (I dont want the mean,median,stdev).I want to subsample and get the actual data on the 15 minute interval.Is there a builtin method to do this?

My output would be:

                            A         B         C         D                 
2011-01-01 00:00:00  1.287045 -0.621473  0.482130  1.886648                 
2011-01-01 00:15:00  1.970683  0.471876  1.042425 -0.128274                 
2011-01-01 00:30:00 -0.261523  0.379652 -0.385604 -0.910902

305

asked Feb 09 '16 22:02

John Saraceno

2 Answers

You can resample to 15 min and take the 'first' of each group:

In [40]: df.resample('15min').first()
Out[40]:
                            A         B         C         D
2011-01-01 00:00:00 -0.415637 -1.345454  1.151189 -0.834548
2011-01-01 00:15:00  0.221777 -0.866306  0.932487 -1.243176
2011-01-01 00:30:00 -0.690039  0.778672 -0.527087 -0.156369
...

Another way to do this is constructing the new desired index and do a reindex (this is a bit more work in this case, but in the case of a irregular time series this ensures it takes the data at exactly each 15min):

In [42]: new_rng = pd.date_range('1/1/2011', periods=20, freq='15min')

In [43]: df.reindex(new_rng)
Out[43]:
                            A         B         C         D
2011-01-01 00:00:00 -0.415637 -1.345454  1.151189 -0.834548
2011-01-01 00:15:00  0.221777 -0.866306  0.932487 -1.243176
2011-01-01 00:30:00 -0.690039  0.778672 -0.527087 -0.156369
...

175

answered Nov 06 '22 00:11

joris

Function asfreq() doesn't do any aggregation:

df.asfreq('15min')

answered Nov 05 '22 23:11

Borja

Related questions
                            
                                How to get the column names of a DataFrame GroupBy object?
                            
                                Pandas: How to fill null values with mean of a groupby?
                            
                                Return index value as string
                            
                                self-join with Pandas
                            
                                How to label bubble chart/scatter plot with column from pandas dataframe?
                            
                                Pandas pd.cut() - binning datetime column / series
                            
                                How to name a Pandas Series
                            
                                Converting HDF5 to Parquet without loading into memory
                            
                                Pandas add column from one dataframe to another based on a join
                            
                                How can I tell if a dataframe is of mixed type?
                            
                                How to create a square dataframe/matrix given 3 columns - Python
                            
                                Best way to add dictionary to dataframe
                            
                                If condition with a dataframe
                            
                                How to get DataFrame.pct_change to calculate monthly change on daily price data?
                            
                                format output data in pandas to_html
                            
                                Pandas dataframe: Check if data is monotonically decreasing
                            
                                How can I concatenate a Series onto a DataFrame with Pandas?
                            
                                Sorted bar charts with pandas/matplotlib or seaborn
                            
                                Use first row as column names? Pandas read_html
                            
                                Merging and subtracting DataFrame columns in pandas?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With