How to subset row of condition with some of N rows before the condition meet , more faster than my code?

Tags:

Since my data set is time series where I have 30 different data frame and each of data frame have more than 10,000 number of rows. I want to examine, the trend before the temperature value goes below 40.

So, I want to subset row when the temperature value is below than 40 and I also want to subset 24 rows before the value become below 40.

I already try some code, the only code that working is below. But it take longer time to subset(like more than 10 minutes for one data frame). So, my code is bad. So I want to know code in python that can subset faster. Can you guys help me?

df=temperature_df.copy()
drop_temperature_df=pd.DataFrame()

# get the index during drop temperature
drop_temperature_index=np.array(df[df[temperature]<40].index)

# subset the data frame for 24 hours before drop temperature
for i,index in enumerate(drop_temperature_index):
    drop_temperature_df=drop_temperature_df.append(df.loc[index-24:index,:])

K['K_{}'.format(string)]=drop_temperature_df.copy() #save the subset data frame

So like data below, I have temperature point below 40 at 1/26/2018 0800 So, I want to subset the point below 40 with 24 rows before (1/25/2018 0800 until 1/26/2018 0800).

enter image description here

669

asked May 15 '19 00:05

nrmzmh

1 Answers

I think you can using the ffill with limit , and find the notnull index , slice the dataframe

yourdf=df[df.temperature.where(df.temperature<40).bfill(limit=24).notnull()].copy()

141

answered Oct 19 '22 01:10

BENY

Related questions
                            
                                Installing Graphviz for use with Python 3 on Windows 10
                            
                                How to do a random stratified sampling with Python (Not a train/test split)?
                            
                                Include submodules on click
                            
                                Protocol error, got "H" as reply type byte
                            
                                Altering traceback of a non-callable module
                            
                                Connect the nearest points in segment and label segment
                            
                                'Pip' recognized in Command Prompt but not in PyCharm terminal
                            
                                Can I add outlier detection and removal to Scikit learn Pipeline?
                            
                                Lowpass filter with a time-varying cutoff frequency, with Python
                            
                                Django: Transaction and select_for_update()
                            
                                Convert a C or numpy array to a Tkinter PhotoImage with a minimum number of copies
                            
                                Avoid global variables for unpicklable shared state among multiprocessing.Pool workers
                            
                                sqlalchemy.pool + psycopg2 timeout issue
                            
                                How to store and use HTML templates in serverless application on AWS Lambda (using AWS SAM)?
                            
                                Write-streaming to Google Cloud Storage in Python
                            
                                airflow dag failed... but all tasks succeeded
                            
                                Pandas dataframe type datetime64[ns] is not working in Hive/Athena
                            
                                OpenCV - How to get real world distance from a 2D image using a chessboard as reference
                            
                                How to make a progress bar on a web page for pandas operation
                            
                                Python Asyncio Task Cancellation

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to subset row of condition with some of N rows before the condition meet , more faster than my code?

Tags:

python

slice

pandas

conditional-statements

subset

nrmzmh

People also ask

1 Answers

BENY

Recent Activity

Donate For Us