Pandas - how to convert RangeIndex into DateTimeIndex

Tags:

I have the following dataframe. It is OHLC one-minute data. Obviously I need the T column to become and index in order to use time-series functionallity

C H L O T V

13712 6873.0 6873.0 6873.0 6873.0 2018-01-13T17:17:00 799.448421 
13713 6878.0 6878.0 6875.0 6875.0 2018-01-13T17:18:00 1707.578666 
13714 6880.0 6880.0 6825.0 6825.0 2018-01-13T17:21:00 481.245707 
13715 6876.0 6876.0 6876.0 6876.0 2018-01-13T17:22:00 839.177283 
13716 6870.0 6878.0 6830.0 6878.0 2018-01-13T17:23:00 4336.830277

I used:

df['T'] = pd.to_datetime(df['T'])

So far so good! The T column is now recognised as a date

Check:

<class 'pandas.core.frame.DataFrame'>
DatetimeIndex: 13717 entries, 1970-01-01 00:00:00 to 1970-01-01 00:00:00.000013716
Data columns (total 7 columns):
BV    13717 non-null float64
C     13717 non-null float64
H     13717 non-null float64
L     13717 non-null float64
O     13717 non-null float64
T     13717 non-null datetime64[ns]
V     13717 non-null float64
dtypes: datetime64[ns](1), float64(6)
memory usage: 857.3 KB

And now comes the fun and unexplainable part:

df.set_index(df['T'])


   C H L O T V
T

2018-01-03 17:27:00 5710.0 5710.0 5663.0 5667.0 2018-01-03 17:27:00 3863.030204 
2018-01-03 17:28:00 5704.0 5710.0 5663.0 5710.0 2018-01-03 17:28:00 1208.627542 
2018-01-03 17:29:00 5699.0 5699.0 5663.0 5663.0 2018-01-03 17:29:00 1755.123688

Still looks good, but when I check the type of index I get:

RangeIndex(start=0, stop=13717, step=1)

And now if I try:

df.index = pd.to_datetime(df.index)

I end up with:

DatetimeIndex([          '1970-01-01 00:00:00',
               '1970-01-01 00:00:00.000000001',
               '1970-01-01 00:00:00.000000002',
               '1970-01-01 00:00:00.000000003',
               '1970-01-01 00:00:00.000000004' and so on...

which is evidently wrong.

The questions are: 1. Why don't I get the normal DateTimeIndex if I am converting a date to index?

How can I convert that RangeIndex to DateTimeIndex with correct timestamps?

Thanks!

717

asked Jan 14 '18 09:01

rioZg

1 Answers

If input data are csv the simpliest is use parameters parse_dates and index_col in read_csv:

df = pd.read_csv(file, parse_dates=['T'], index_col=['T'])

If not, then use your solution, don't forget assign back output of set_index and if need drop column T also after DatetimeIndex use T instead df['T']:

df['T'] = pd.to_datetime('T')
df = df.set_index('T')

#alternative solution
#df.set_index('T', inplace=True)

Why don't I get the normal DateTimeIndex if I am converting a date to index?

Because your index is default (0,1,2..), so df.index = pd.to_datetime(df.index) parse integerss like ns and get weird datetimes.

171

answered Sep 29 '22 22:09

jezrael

Related questions
                            
                                Cannot load CLoader with pyyaml
                            
                                How to add a toctree entry?
                            
                                Triplet model for image retrieval from the Keras pretrained network
                            
                                PyGTK Window always on top of all 'always on top` windows
                            
                                Set default content_type for Flask test client
                            
                                Django cannot determine queryset for chaining one-to-many with one-to-one relationship
                            
                                Subparsers.add_parser TypeError: __init__() got an unexpected keyword argument 'prog'
                            
                                Pandas update and add rows one dataframe with key column in another dataframe
                            
                                Can pygame detect the video mode of the screen?
                            
                                Put request working in curl, but not in Python
                            
                                Python OLS model: __init__() missing 1 required positional argument: 'endog'
                            
                                how to use after_request in flask to close database connection and python?
                            
                                Regular Expression Matching Stock Ticker
                            
                                Bokeh datetime axis, control of minor ticks
                            
                                How to send a message on the bot startup to every server it is in?
                            
                                How to safely count dictionary keys in python [duplicate]
                            
                                How Python execute [list] * num? what's time complexity and memory complexity?
                            
                                Pandas find Duplicates in cross values
                            
                                How to run django-pytest on a project which imports settings from environment variables defined in manage.py
                            
                                What is the function in TensorFlow that is equivalent to expand() in PyTorch?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas - how to convert RangeIndex into DateTimeIndex

Tags:

python

indexing

pandas

time-series

rioZg

People also ask

1 Answers

jezrael

Recent Activity

Donate For Us