I tried to remove NA's from the subset using dplyr piping. Is my answer an indication of a missed step. I'm trying to learn how to write functions using dplyr: <pre class="prettyprint"><code>> outcome.df%>% + group_by(Hospital,State)%>% + arrange(desc(HeartAttackDeath,na.rm=TRUE))%>% + head() Source: local data frame [6 x 5] Groups: Hospital, State </code></pre> <pre class="prettyprint"> Hospital State HeartAttackDeath 1 ABBEVILLE AREA MEDICAL CENTER SC NA 2 ABBEVILLE GENERAL HOSPITAL LA NA 3 ABBOTT NORTHWESTERN HOSPITAL MN 12.3 4 ABILENE REGIONAL MEDICAL CENTER TX 17.2 5 ABINGTON MEMORIAL HOSPITAL PA 14.3 6 ABRAHAM LINCOLN MEMORIAL HOSPITAL IL NA Variables not shown: HeartFailureDeath (dbl), PneumoniaDeath (dbl) </pre>

I don't think <code>desc</code> takes an <code>na.rm</code> argument... I'm actually surprised it doesn't throw an error when you give it one. If you just want to remove <code>NA</code>s, use <code>na.omit</code> (base) or <code>tidyr::drop_na</code>: <pre class="prettyprint"><code>outcome.df %>% na.omit() %>% group_by(Hospital, State) %>% arrange(desc(HeartAttackDeath)) %>% head() library(tidyr) outcome.df %>% drop_na() %>% group_by(Hospital, State) %>% arrange(desc(HeartAttackDeath)) %>% head() </code></pre> If you only want to remove <code>NA</code>s from the HeartAttackDeath column, filter with <code>is.na</code>, or use <code>tidyr::drop_na</code>: <pre class="prettyprint"><code>outcome.df %>% filter(!is.na(HeartAttackDeath)) %>% group_by(Hospital, State) %>% arrange(desc(HeartAttackDeath)) %>% head() outcome.df %>% drop_na(HeartAttackDeath) %>% group_by(Hospital, State) %>% arrange(desc(HeartAttackDeath)) %>% head() </code></pre> As pointed out at the dupe, <code>complete.cases</code> can also be used, but it's a bit trickier to put in a chain because it takes a data frame as an argument but returns an index vector. So you could use it like this: <pre class="prettyprint"><code>outcome.df %>% filter(complete.cases(.)) %>% group_by(Hospital, State) %>% arrange(desc(HeartAttackDeath)) %>% head() </code></pre>

Removing NA in dplyr pipe [duplicate]

Tags:

r

na

dplyr

I tried to remove NA's from the subset using dplyr piping. Is my answer an indication of a missed step. I'm trying to learn how to write functions using dplyr:

> outcome.df%>% + group_by(Hospital,State)%>% + arrange(desc(HeartAttackDeath,na.rm=TRUE))%>% + head() Source: local data frame [6 x 5] Groups: Hospital, State

                            Hospital State HeartAttackDeath 1     ABBEVILLE AREA MEDICAL CENTER    SC               NA 2        ABBEVILLE GENERAL HOSPITAL    LA               NA 3      ABBOTT NORTHWESTERN HOSPITAL    MN             12.3 4   ABILENE REGIONAL MEDICAL CENTER    TX             17.2 5        ABINGTON MEMORIAL HOSPITAL    PA             14.3 6 ABRAHAM LINCOLN MEMORIAL HOSPITAL    IL               NA Variables not shown: HeartFailureDeath (dbl), PneumoniaDeath   (dbl)

601

asked Oct 30 '14 23:10

ITCoderWhiz

1 Answers

I don't think desc takes an na.rm argument... I'm actually surprised it doesn't throw an error when you give it one. If you just want to remove NAs, use na.omit (base) or tidyr::drop_na:

outcome.df %>%   na.omit() %>%   group_by(Hospital, State) %>%   arrange(desc(HeartAttackDeath)) %>%   head()  library(tidyr) outcome.df %>%   drop_na() %>%   group_by(Hospital, State) %>%   arrange(desc(HeartAttackDeath)) %>%   head()

If you only want to remove NAs from the HeartAttackDeath column, filter with is.na, or use tidyr::drop_na:

outcome.df %>%   filter(!is.na(HeartAttackDeath)) %>%   group_by(Hospital, State) %>%   arrange(desc(HeartAttackDeath)) %>%   head()  outcome.df %>%   drop_na(HeartAttackDeath) %>%   group_by(Hospital, State) %>%   arrange(desc(HeartAttackDeath)) %>%   head()

As pointed out at the dupe, complete.cases can also be used, but it's a bit trickier to put in a chain because it takes a data frame as an argument but returns an index vector. So you could use it like this:

outcome.df %>%   filter(complete.cases(.)) %>%   group_by(Hospital, State) %>%   arrange(desc(HeartAttackDeath)) %>%   head()

109

answered Sep 18 '22 13:09

Gregor Thomas

Related questions
                            
                                Get all Parameters as List
                            
                                How to overlay density plots in R?
                            
                                Use a value from the previous row in an R data.table calculation
                            
                                How to prevent scientific notation in R? [duplicate]
                            
                                Legend on bottom, two rows wrapped in ggplot2 in r
                            
                                Filter multiple values on a string column in dplyr
                            
                                How do you add a general label to facets in ggplot2?
                            
                                Types and classes of variables
                            
                                How do I deal with special characters like \^$.?*|+()[{ in my regex?
                            
                                What does "The following object is masked from 'package:xxx'" mean?
                            
                                Error in fetch(key) : lazy-load database
                            
                                Usage of `...` (three-dots or dot-dot-dot) in functions [duplicate]
                            
                                ggplot combining two plots from different data.frames
                            
                                Return index of the smallest value in a vector?
                            
                                Create a data.frame where a column is a list
                            
                                Formula with dynamic number of variables
                            
                                How can I interrupt a running code in R with a keyboard command?
                            
                                Trimming a huge (3.5 GB) csv file to read into R
                            
                                R sequence of dates with lubridate
                            
                                Saving a high resolution image in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With