Combine rows based on ranges in a column

Tags:

r

I have a pretty large dataset where I have a column for time in seconds and I want to combine rows where the time is close (range: .1-.2 seconds apart) as a mean.

Here is an example of how the data looks:

Click to copy

BPM seconds
63.9 61.899
63.9 61.902
63.8 61.910
62.1 130.94
62.1 130.95
61.8 211.59
63.8 280.5
60.3 290.4

So I would want to combine the first 3 rows, then the 2 following after that, and the rest would stand alone. Meaning I would want the data to look like this:

Click to copy

BPM seconds
63.9 61.904
62.1 130.95
61.8 211.59
63.8 280.5
60.3 290.4

273

asked Oct 02 '18 07:10

Mary Smirnova

1 Answers

We need to create groups, this is the important bit, the rest is standard aggregation:

Click to copy

cumsum(!c(0, diff(df1$seconds)) < 0.2)
# [1] 0 0 0 1 1 2 3 4

Then aggregate using aggregate:

Click to copy

aggregate(df1[, 2], list(cumsum(!c(0, diff(df1$seconds)) < 0.2)), mean)
#   Group.1         x
# 1       0  61.90367
# 2       1 130.94500
# 3       2 211.59000
# 4       3 280.50000
# 5       4 290.40000

Or use dplyr:

Click to copy

library(dplyr)

df1 %>% 
  group_by(myGroup = cumsum(!c(0, diff(seconds)) < 0.2)) %>% 
  summarise(BPM = first(BPM),
            seconds = mean(seconds))
# # A tibble: 5 x 3
#   myGroup   BPM seconds
#     <int> <dbl>   <dbl>
# 1       0  63.9    61.9
# 2       1  62.1   131. 
# 3       2  61.8   212. 
# 4       3  63.8   280. 
# 5       4  60.3   290.

Reproducible example data:

Click to copy

df1 <- read.table(text = "BPM seconds
                  63.9 61.899
                  63.9 61.902
                  63.8 61.910
                  62.1 130.94
                  62.1 130.95
                  61.8 211.59
                  63.8 280.5
                  60.3 290.4", header = TRUE)

answered Sep 29 '22 14:09

zx8754

Related questions
                            
                                Using emoji in xaringan presentation
                            
                                R: Promise cannot find object
                            
                                Grouping n or more observations in data table without interrputing sequences of consecutive values
                            
                                Blogdown Website post - Hide date and/or title of post
                            
                                All-to-all setdiff on two numeric vectors with a numeric threshold for accepting matches
                            
                                Show letters as key glyphs for geom_text legend instead of default 'a'
                            
                                R: table function suprisingly slow
                            
                                How do I split a data frame among columns, say at every nth column?
                            
                                Can't figure out how to use conda environment after reticulate::use_condaenv(path)
                            
                                Implementing custom stopping metrics to optimize during training in H2O model directly from R
                            
                                How to make scatterplot points open a hyperlink using ggplotly - R
                            
                                Add column with percentage of matching words in two different columns (by row) in R
                            
                                How to output the columns with the maximum value
                            
                                Populating a "count matrix" with permutations of R data.table rows
                            
                                R: From GeoJson to DataFrame?
                            
                                How to Apply String Vector to Logical Vector
                            
                                data.table modifies parent environment / weird behavior with setDT
                            
                                R. plotly - padding or margin for graph inside Shinyapp?
                            
                                show multiple plots from ggplot on one page in r
                            
                                Fill down every other row with level above in tidyverse

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Combine rows based on ranges in a column

Tags:

r

Mary Smirnova

People also ask

1 Answers

zx8754

Recent Activity

Donate For Us