Given a table like: <pre class="prettyprint"><code> id value 1 1 a 2 2 a 3 2 b 4 2 c 5 3 c </code></pre> I would like to filter for: a) the ids that only have value a, i.e. id 1. b) the ids that contain a and b jointly, i.e. id 2. Data: <pre class="prettyprint"><code>data.frame(id = c(1,2,2,2,3), value = c("a", "a", "b", "c", "c")) </code></pre>

Try a) <pre class="prettyprint"><code>df %>% group_by(id) %>% filter(all(value == "a")) </code></pre> b) <pre class="prettyprint"><code>df %>% group_by(id) %>% filter(all(c("a", "b") %in% value)) </code></pre>

Filter groups in dplyr that exclusively contain specific combinations of values

Tags:

r

dplyr

Given a table like:

I would like to filter for:

a) the ids that only have value a, i.e. id 1.

b) the ids that contain a and b jointly, i.e. id 2.

Data:

data.frame(id = c(1,2,2,2,3), value = c("a", "a", "b", "c", "c"))

786

asked Dec 14 '15 15:12

chopin_is_the_best

1 Answers

Try

df %>% group_by(id) %>% filter(all(value == "a"))

df %>% group_by(id) %>% filter(all(c("a", "b") %in% value))

187

answered Nov 15 '22 21:11

talat

Related questions
                            
                                Histogram of two variables in R
                            
                                How to read csv data with unknown encoding in R
                            
                                shapiro.test(..) cannot deal with more than 5000 data points
                            
                                rCharts with Highcharts as shiny application
                            
                                Legend of a raster map with categorical data
                            
                                melt multiple groups of measure.vars
                            
                                R: Avoid accidently overwriting variables
                            
                                05:00:00 - 28:59:59 time format
                            
                                NumPy percentile function different from MATLAB's percentile function
                            
                                Cannot use dput for data.table in R
                            
                                R: Reorder facet_wrapped x-axis with free_x in ggplot2
                            
                                How to order data within subgroups in data.table R
                            
                                Different colour palettes for two different colour aesthetic mappings in ggplot2
                            
                                Why is zoo::rollmean slow compared to a simple Rcpp implementation?
                            
                                How to hide figures in knitr, but create them as png?
                            
                                R data.table: subgroup weighted percent of group
                            
                                How to check if a filename is writeable in R?
                            
                                dplyr mutate using rbinom do not return random numbers
                            
                                Plotting POSIXct timestamp series with ggplot2
                            
                                nls troubles: Missing value or an infinity produced when evaluating the model

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With