Filter based on NA in dplyr

Tags:

dplyr

This is my df

df <- structure(structure(list(group = structure(c(1L, 1L, 1L, 1L, 1L, 2L, 2L, 2L, 2L, 2L, 3L, 3L, 3L, 3L, 3L, 4L, 4L, 4L, 4L, 4L, 5L, 5L, 5L, 5L, 5L), .Label = c("A", "B", "C", "D", "E"), class = "factor"), y = c(NA, NA, NA, NA, 1, NA, NA, NA, 1, 2, NA, NA, 1, 2, 3, NA, 2, 2, 3, 4, NA, 3, 3, 4, 5), x = c(1L, 2L, 3L, 4L,5L, 1L, 2L, 3L, 4L, 5L, 1L, 2L, 3L, 4L, 5L, 1L, 2L, 3L, 4L, 5L, 1L, 2L, 3L, 4L, 5L)), .Names = c("group", "y", "x"), row.names = c(NA, 25L), class = "data.frame"))

> df
   group  y x
1      A NA 1
2      A NA 2
3      A NA 3
4      A NA 4
5      A  1 5
6      B NA 1
7      B NA 2
8      B NA 3
9      B  1 4
10     B  2 5
11     C NA 1
12     C NA 2
13     C  1 3
14     C  2 4
15     C  3 5
16     D NA 1
17     D  2 2
18     D  2 3
19     D  3 4
20     D  4 5
21     E NA 1
22     E  3 2
23     E  3 3
24     E  4 4
25     E  5 5

My goal is to calculate the mean per x value (across groups), using mutate. But first I'd like to filter the data, such that only those values of x remain for which there are at least 3 non-NA values. So in this example I only want to include those entries for which x is at least 3. I can't figure out how to create the filter(), any suggestions?

215

asked Jan 16 '15 16:01

erc

1 Answers

You could try

df %>% 
   group_by(group) %>% #group_by(x) %>% #as per the OP's clarification
   filter(sum(!is.na(y))>=3) %>% 
   mutate(Mean=mean(x, na.rm=TRUE))

158

answered Oct 03 '22 05:10

akrun

Related questions
                            
                                ggplot2 - Modify geom_density2d to accept weights as a parameter?
                            
                                merge partial matched strings
                            
                                How does cox.zph deal with time-dependent covariates?
                            
                                Reactive colours in shiny
                            
                                Date format for subset of ticks on time axis
                            
                                How write code to web crawling and scraping in R
                            
                                Why is the R match function so slow?
                            
                                R R6 classes and UseMethod / generic methods
                            
                                How to obtain shiny-server version info in Ubuntu?
                            
                                How to avoid looping over list after reading from JSON with R
                            
                                Error in R gbm function when cv.folds > 0
                            
                                dplyr: Counts/Percentages of factor grouped by school not getting grouped
                            
                                Conditionally hiding data labels in ggplot2 graph
                            
                                How to add parts to graph one by one in shiny
                            
                                R ts with missing values
                            
                                R: Can I include an R markdown file in a shiny ui.R file?
                            
                                Combine texreg, knitr, booktabs & dcolumn
                            
                                R data.table intersection of all groups
                            
                                How do I determine the number of significant figures in data in R?
                            
                                Using R, Randomly Assigning Students Into Groups Of 4

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With