Say I have a DataFrame <code>df</code> with date as index and some values. How can I select the rows where the date is larger than some value <code>x</code>? I know I can convert the index to a column and then do the select <code>df[df['date']>x]</code>, but is that slower than doing the operation on the index?

The existing answer is correct, however if we are selecting based on the index, the second method from here would be faster: <pre class="prettyprint"><code># Set index df = df.set_index(df['date']) # Select observations between two datetimes df.loc[pd.TimeStamp('2002-1-1 01:00:00'):pd.TimeStamp('2002-1-1 04:00:00')] </code></pre>

Python Pandas Select Index where index is larger than x

2 Answers

Example of selecting from a DataFrame with the use of index:

from numpy.random import randn from pandas import DataFrame from datetime import timedelta as td import dateutil.parser  d = dateutil.parser.parse("2014-01-01") df = DataFrame(randn(6,2), columns=list('AB'), index=[d + td(days=x) for x in range(1,7)])  In [1]: df Out[1]:                    A         B 2014-01-02 -1.172285  1.706200 2014-01-03  0.039511 -0.320798 2014-01-04 -0.192179 -0.539397 2014-01-05 -0.475917 -0.280055 2014-01-06  0.163376  1.124602 2014-01-07 -2.477812  0.656750  In [2]: df[df.index > dateutil.parser.parse("2014-01-04")] Out[2]:                    A         B 2014-01-05 -0.475917 -0.280055 2014-01-06  0.163376  1.124602 2014-01-07 -2.477812  0.656750

177

answered Sep 22 '22 03:09

Datageek

The existing answer is correct, however if we are selecting based on the index, the second method from here would be faster:

# Set index df = df.set_index(df['date'])  # Select observations between two datetimes df.loc[pd.TimeStamp('2002-1-1 01:00:00'):pd.TimeStamp('2002-1-1 04:00:00')]

answered Sep 18 '22 03:09

ntg

Related questions
                            
                                difference between origin/branch_name and branch_name?
                            
                                How to deduce the return type of a function object from parameters list?
                            
                                Async await how to use return values
                            
                                Getting a Reference to ViewHolder on RecyclerView Click
                            
                                Row count in a csv file
                            
                                Use different build types of Library Module in Android App Module in Android Studio and Gradle
                            
                                Flask logging not working at all
                            
                                Unknown discriminator value C# Mongo
                            
                                Why people create virtualenv in a docker container?
                            
                                Default field separator for awk
                            
                                Python List Slicing with None as argument
                            
                                Hiding lines after showing a pyplot figure

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python Pandas Select Index where index is larger than x

Tags:

user3092887

People also ask

2 Answers

Datageek

ntg

Recent Activity

Donate For Us