Resampling Minute data

Tags:

python

pandas

I have minute based OHLCV data for the opening range/first hour (9:30-10:30 AM EST). I'm looking to resample this data so I can get one 60-minute value and then calculate the range.

When I call the dataframe.resample() function on the data I get two rows and the initial row starts at 9:00 AM. I'm looking to get only one row which starts at 9:30 AM.

Note: the initial data begins at 9:30.

enter image description here

Edit: Adding code:

# Extract data for regular trading hours (rth) from the 24 hour data set rth = data.between_time(start_time = '09:30:00', end_time = '16:15:00', include_end = False)  # Extract data for extended trading hours (eth) from the 24 hour data set eth = data.between_time(start_time = '16:30:00', end_time = '09:30:00', include_end = False)  # Extract data for initial balance (rth) from the 24 hour data set initial_balance = data.between_time(start_time = '09:30:00', end_time = '10:30:00', include_end =      False)

Got stuck tried to separate the opening range by individual date and get the Initial Balance

conversion = {'Open' : 'first', 'High' : 'max', 'Low' : 'min', 'Close' : 'last', 'Volume' : 'sum'} sample = data.between_time(start_time = '09:30:00', end_time = '10:30:00', include_end = False) sample = sample.ix['2007-05-07'] sample.tail()  sample.resample('60Min', how = conversion)

By default resample starts at the beggining of the hour. I would like it to start from where the data starts.

246

asked Feb 13 '13 19:02

aozkan

2 Answers

You can use the base argument of resample:

sample.resample('60Min', how=conversion, base=30)

From the above docs-link:

base : int, default 0
For frequencies that evenly subdivide 1 day, the “origin” of the aggregated intervals.
For example, for ‘5min’ frequency, base could range from 0 through 4. Defaults to 0

106

answered Sep 20 '22 15:09

Andy Hayden

value is the column you want to aggregate, resample the dataframe dates by second and aggregate by mean, then drop the nan rows.

data=[('2014-02-24 16:16:47.204000',    1.391424) ,('2014-02-24 16:18:48.296000',    1.048143) ,('2014-02-24 16:19:52.346000',  -0.823974) ,('2014-02-24 16:22:13.665000',   -0.689560) ,('2014-02-24 16:24:13.760000',   -0.323252) ,('2014-02-24 16:26:15.155000',   -1.095331) ,('2014-02-24 16:29:58.235000',   -0.185681)]  df=pd.DataFrame(data,columns=['Date','Value']) df['Date']=pd.to_datetime(df['Date']) minutes=df.resample('1Min',on='Date').mean().dropna()  print(minutes)

output:

Value Date                          2014-02-24 16:16:00  1.391424 2014-02-24 16:18:00  1.048143 2014-02-24 16:19:00 -0.823974 2014-02-24 16:22:00 -0.689560 2014-02-24 16:24:00 -0.323252 2014-02-24 16:26:00 -1.095331 2014-02-24 16:29:00 -0.185681

answered Sep 18 '22 15:09

Golden Lion

Related questions
                            
                                Python Assignment Operator Precedence - (a, b) = a[b] = {}, 5
                            
                                What does the colon inside the parameter mean? [duplicate]
                            
                                Which files should I tell my VCS to ignore when using Sphinx for documentation? [closed]
                            
                                How to sort a Python dictionary by value? [duplicate]
                            
                                Can you format pandas integers for display, like `pd.options.display.float_format` for floats?
                            
                                sklearn - Cross validation with multiple scores
                            
                                tensorflow deep neural network for regression always predict same results in one batch
                            
                                How to deploy python script?
                            
                                Python object conversion
                            
                                What do square brackets, "[]", mean in function/class documentation?
                            
                                How to force PyYAML to load strings as unicode objects?
                            
                                SHA-256 implementation in Python
                            
                                How to multiply functions in python?
                            
                                Django - exception handling best practice and sending customized error message
                            
                                Best video manipulation library for Python? [closed]
                            
                                Is there a library function in Python to turn a generator-function into a function returning a list?
                            
                                What is the fastest way to output large DataFrame into a CSV file?
                            
                                How do I run doctests with PyCharm?
                            
                                Most Pythonic way to declare an abstract class property
                            
                                Customize module search path (PYTHONPATH) via pipenv

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With