I am trying to filter all rows within a group in a data.table if a max value within that group is > some value. Below is how I would do it in DPLY and how I got it working in two steps in data.table. <pre class="prettyprint"><code>#DPLYR df<-data.table( x =1:12 ,y = 1:3 ) df %>% group_by(y) %>% filter(max(x) < 11) ##data.table df[,max_value :=max(x),by=y][max_value<11] The output should be x y 1: 1 1 2: 4 1 3: 7 1 4: 10 1 </code></pre> Is there a way to do this in one step without creating the column in my dataset? All that I have been able to find are subsetting a group to get one specific value within a group, not return all row of the group that meet the condition.

We can use <code>.I</code> to get the row index, extract the index column and subset <pre class="prettyprint"><code>df[df[, .I[max(x) < 11], y]$V1] # x y #1: 1 1 #2: 4 1 #3: 7 1 #4: 10 1 </code></pre> Or another option is <code>.SD</code> <pre class="prettyprint"><code>df[, .SD[max(x) < 11], y] </code></pre>

Filter rows within data.table group if max group value > some value [duplicate]

Tags:

r

data.table

I am trying to filter all rows within a group in a data.table if a max value within that group is > some value. Below is how I would do it in DPLY and how I got it working in two steps in data.table.

#DPLYR 
df<-data.table(
  x =1:12
  ,y = 1:3
)

df %>% group_by(y) %>% 
  filter(max(x) < 11)

##data.table
df[,max_value :=max(x),by=y][max_value<11]

The output should be

    x y
1:  1 1 
2:  4 1 
3:  7 1 
4: 10 1

Is there a way to do this in one step without creating the column in my dataset? All that I have been able to find are subsetting a group to get one specific value within a group, not return all row of the group that meet the condition.

713

asked Aug 28 '19 18:08

Andrew Troiano

Video Answer

1 Answers

We can use .I to get the row index, extract the index column and subset

df[df[, .I[max(x) < 11], y]$V1]
#    x y
#1:  1 1
#2:  4 1
#3:  7 1
#4: 10 1

Or another option is .SD

df[, .SD[max(x) < 11], y]

answered Oct 19 '22 00:10

akrun

Related questions
                            
                                Pass a single argument as dots in tidyeval
                            
                                set slower frame rate or longer duration for gganimate
                            
                                How to point to a specific HTML anchor in Shiny
                            
                                How to use plotlyProxy() in shiny app with ggplotly() to make plots render faster
                            
                                knitr changes (1) to <ol> when rendering html?
                            
                                Efficient way to add numbers to alphanumeric strings in R
                            
                                How to calculate common values across different groups?
                            
                                Swipe effect for images in Shiny R
                            
                                How can I use JavaScript in code chunks of RMarkdown?
                            
                                picture as a background of shiny dashboard
                            
                                Extracting an edge list with conditions from an igraph object in R
                            
                                ggplot facet wrap variable as an argument in a function
                            
                                R, dplyr: Function that quickly builds list of complementary rows based on conditions
                            
                                How to run R script in python using rpy2
                            
                                Rendering html outputs from r markdown in shiny app
                            
                                Draw a "grid" between arranged plots
                            
                                How do I get count of number of items in selection?
                            
                                Renderplotly does not work despite not having any errors
                            
                                How to show legend in heatmap?
                            
                                What's the difference between a list and a vector whose mode is list?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With