I need help with this issue: I have a dataset of water level values distributed every 30 minutes, but I need only the hourly values. I tried with the <code>aggregate()</code> function but due to function <code>FUN</code> is one requisite it determines my analysis to be mean, or median and I don't want to use any stat function. This one example of my data frame <pre class="prettyprint"><code>06/16/2015 02:00:00 0.036068 06/16/2015 02:30:00 0.008916 06/16/2015 03:00:00 -0.008622 06/16/2015 03:30:00 -0.014057 06/16/2015 04:00:00 -0.011172 06/16/2015 04:30:00 0.002401 06/16/2015 05:00:00 0.029632 06/16/2015 05:30:00 0.061902002 06/16/2015 06:00:00 0.087366998 06/16/2015 06:30:00 0.105176002 06/16/2015 07:00:00 0.1153 06/16/2015 07:30:00 0.126197994 06/16/2015 08:00:00 0.144154996 </code></pre>

We convert the 'RefDateTimeRef' column to <code>POSIXct</code>, extract the 'minute', 'second' with <code>format</code> and compare it with <code>00:00</code> to return a logical vector which we use to subset the rows. <pre class="prettyprint"><code>df1[format(as.POSIXct(df1[,1], format = "%m/%d/%Y %H:%M"), "%M:%S")=="00:00",] # RefDateTimeRef Data #10 04/14/2016 09:00 0.153 #22 04/14/2016 08:00 0.148 </code></pre> <hr> Or with <code>lubridate</code> <pre class="prettyprint"><code>library(lubridate) df1[ minute(mdy_hm(df1[,1]))==0,] # RefDateTimeRef Data #10 04/14/2016 09:00 0.153 #22 04/14/2016 08:00 0.148 </code></pre> <hr> Or with <code>sub</code> to remove the substring until the hour part and then use <code>==</code> to get the logical vector and subset the rows. <pre class="prettyprint"><code>df1[ sub(".*\\s+\\S{2}:", "", df1[,1])=="00",] </code></pre> NOTE: I would advice against using <code>sub</code> or <code>substr</code> as it can sometimes lead to incorrect answers.

How to pick hourly values from dataset?

Tags:

r

dataset

subset

I need help with this issue:

I have a dataset of water level values distributed every 30 minutes, but I need only the hourly values. I tried with the aggregate() function but due to function FUN is one requisite it determines my analysis to be mean, or median and I don't want to use any stat function.

This one example of my data frame

06/16/2015 02:00:00 0.036068
06/16/2015 02:30:00 0.008916
06/16/2015 03:00:00 -0.008622
06/16/2015 03:30:00 -0.014057
06/16/2015 04:00:00 -0.011172
06/16/2015 04:30:00 0.002401
06/16/2015 05:00:00 0.029632
06/16/2015 05:30:00 0.061902002
06/16/2015 06:00:00 0.087366998
06/16/2015 06:30:00 0.105176002
06/16/2015 07:00:00 0.1153
06/16/2015 07:30:00 0.126197994
06/16/2015 08:00:00 0.144154996

256

asked May 01 '16 15:05

FernRay

2 Answers

We convert the 'RefDateTimeRef' column to POSIXct, extract the 'minute', 'second' with format and compare it with 00:00 to return a logical vector which we use to subset the rows.

df1[format(as.POSIXct(df1[,1], format = "%m/%d/%Y %H:%M"), "%M:%S")=="00:00",]
#     RefDateTimeRef  Data
#10 04/14/2016 09:00 0.153
#22 04/14/2016 08:00 0.148

Or with lubridate

library(lubridate)
df1[ minute(mdy_hm(df1[,1]))==0,]
#     RefDateTimeRef  Data
#10 04/14/2016 09:00 0.153
#22 04/14/2016 08:00 0.148

Or with sub to remove the substring until the hour part and then use == to get the logical vector and subset the rows.

df1[ sub(".*\\s+\\S{2}:", "", df1[,1])=="00",]

NOTE: I would advice against using sub or substr as it can sometimes lead to incorrect answers.

answered Sep 22 '22 10:09

akrun

df <- read.table(text = '06/16/2015 02:00:00 0.036068
06/16/2015 02:30:00 0.008916
06/16/2015 03:00:00 -0.008622
06/16/2015 03:30:00 -0.014057
06/16/2015 04:00:00 -0.011172
06/16/2015 04:30:00 0.002401
06/16/2015 05:00:00 0.029632
06/16/2015 05:30:00 0.061902002
06/16/2015 06:00:00 0.087366998
06/16/2015 06:30:00 0.105176002
06/16/2015 07:00:00 0.1153
06/16/2015 07:30:00 0.126197994
06/16/2015 08:00:00 0.144154996')

colnames(df) <- c('Date','Time','Value')

index <- ifelse(substring(df$Time,4) == "00:00",T,F)

final_df <- df[index,]

answered Sep 21 '22 10:09

Kunal Puri

Related questions
                            
                                Is there a way to show overlapping histograms in R without adjusting transparency?
                            
                                How to generate spatial points with a pattern
                            
                                How to use dbGetQuery in tryCatch with PostgreSQL?
                            
                                conditionalPanel in Shiny not working
                            
                                Replacing special characters from different encodings in r
                            
                                Using functionals instead of for loops to identify sequential changes in a vector
                            
                                Arrange common plot width with facetted ggplot 2.0.0 & gridExtra
                            
                                plot/ggplot2 - Fill area with too many points
                            
                                JSON (using jsonlite) parsing error in R
                            
                                Creating a desktop icon for Shiny App
                            
                                Call lm with a model matrix instead of a formula
                            
                                Allow .SDcols to vary with grouping variable in data.table
                            
                                R how to remove VERY special characters in strings?
                            
                                How can I plot square wave data in R?
                            
                                Speeding up correlation matrix calculation in R
                            
                                How to make a dataset reactive in Shiny?
                            
                                R options(digits=2) function changes the total number of digits’ format. Looking for a way to change the digits after decimal point
                            
                                How do I customize particular columns for a tableGrob in R?
                            
                                How to display numbers in scientific notation in ASCII tables?
                            
                                tryCatch suppress error message

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With