I am looking for a way to extract the first and last non-NA value from each group. I am using dplyr::first() and dplyr::last(), but I can´t work out how to choose the first or last non-NA value. <pre class="prettyprint"><code>library(dplyr) set.seed(123) d <- data.frame( group = rep(1:3, each = 3), year = rep(seq(2000,2002,1),3), value = sample(1:9, r = T)) #Introduce NA values in first row of group 2 and last row of group 3 d %>% mutate( value = case_when( group == 2 & year ==2000 ~ NA_integer_, group == 3 & year ==2002 ~ NA_integer_, TRUE ~ value))%>% group_by(group) %>% mutate( first = dplyr::first(value), last = dplyr::last(value)) </code></pre> RESULT (with issue) <pre class="prettyprint"><code># A tibble: 9 x 5 # Groups: group [3] group year value first last <int> <dbl> <int> <int> <int> 1 1 2000 3 3 4 2 1 2001 8 3 4 3 1 2002 4 3 4 4 2 2000 NA NA 1 5 2 2001 9 NA 1 6 2 2002 1 NA 1 7 3 2000 5 5 NA 8 3 2001 9 5 NA 9 3 2002 NA 5 NA </code></pre> Can you help me make the values in the "first" column for group 2 = 9 and the values in the "last" column from group 3 = 9? I very much prefer a tidyverse solution if one such exists?

Use <code>na.omit</code>, compare: <pre class="prettyprint"><code>first(c(NA, 11, 22)) # [1] NA first(na.omit(c(NA, 11, 22))) # [1] 11 </code></pre> Using example data: <pre class="prettyprint"><code>d %>% mutate( value = case_when( group == 2 & year ==2000 ~ NA_integer_, group == 3 & year ==2002 ~ NA_integer_, TRUE ~ value))%>% group_by(group) %>% mutate( first = dplyr::first(na.omit(value)), last = dplyr::last(na.omit(value))) # # A tibble: 9 x 5 # # Groups: group [3] # group year value first last # <int> <dbl> <int> <int> <int> # 1 1 2000 3 3 4 # 2 1 2001 8 3 4 # 3 1 2002 4 3 4 # 4 2 2000 NA 9 1 # 5 2 2001 9 9 1 # 6 2 2002 1 9 1 # 7 3 2000 5 5 9 # 8 3 2001 9 5 9 # 9 3 2002 NA 5 9 </code></pre>

dplyr::first() to choose first non NA value

Tags:

I am looking for a way to extract the first and last non-NA value from each group. I am using dplyr::first() and dplyr::last(), but I can´t work out how to choose the first or last non-NA value.

library(dplyr)
set.seed(123)
d <- data.frame(
  group = rep(1:3, each = 3),
  year = rep(seq(2000,2002,1),3),
  value = sample(1:9, r = T))

#Introduce NA values in first row of group 2 and last row of group 3
d %>%
  mutate(
    value = case_when(
      group == 2 & year ==2000 ~ NA_integer_,
      group == 3 & year ==2002 ~ NA_integer_,
      TRUE ~ value))%>%
  group_by(group) %>% 
  mutate(
    first = dplyr::first(value),
    last = dplyr::last(value))

RESULT (with issue)

# A tibble: 9 x 5
# Groups:   group [3]
  group  year value first  last
  <int> <dbl> <int> <int> <int>
1     1  2000     3     3     4
2     1  2001     8     3     4
3     1  2002     4     3     4
4     2  2000    NA    NA     1
5     2  2001     9    NA     1
6     2  2002     1    NA     1
7     3  2000     5     5    NA
8     3  2001     9     5    NA
9     3  2002    NA     5    NA

Can you help me make the values in the "first" column for group 2 = 9 and the values in the "last" column from group 3 = 9?

I very much prefer a tidyverse solution if one such exists?

810

asked Sep 07 '18 10:09

Steen Harsted

Video Answer

1 Answers

Use na.omit, compare:

first(c(NA, 11, 22))
# [1] NA

first(na.omit(c(NA, 11, 22)))
# [1] 11

Using example data:

d %>%
  mutate(
    value = case_when(
      group == 2 & year ==2000 ~ NA_integer_,
      group == 3 & year ==2002 ~ NA_integer_,
      TRUE ~ value))%>%
  group_by(group) %>% 
  mutate(
    first = dplyr::first(na.omit(value)),
    last = dplyr::last(na.omit(value)))

# # A tibble: 9 x 5
# # Groups:   group [3]
#   group  year value first  last
#   <int> <dbl> <int> <int> <int>
# 1     1  2000     3     3     4
# 2     1  2001     8     3     4
# 3     1  2002     4     3     4
# 4     2  2000    NA     9     1
# 5     2  2001     9     9     1
# 6     2  2002     1     9     1
# 7     3  2000     5     5     9
# 8     3  2001     9     5     9
# 9     3  2002    NA     5     9

198

answered Oct 14 '22 07:10

zx8754

Related questions
                            
                                UnhandledPromiseRejectionWarning on async await promise
                            
                                Android invalid color state list tag gradient
                            
                                What do number literals with a suffix, like 0u8, mean in Rust?
                            
                                How to use environment variables in Github Page?
                            
                                jq: error: test1/0 is not defined at <top-level>, line 1
                            
                                Why isn't React considered MVC?
                            
                                Can't set up the HMR: stuck with "Waiting for update signal from WDS..." in console
                            
                                Is there a way to see a list of all bookmarks in Visual Studio 2017?
                            
                                Should Flutter web use Wasm instead of dart2js
                            
                                Undefined behaviour in vector of vectors cast
                            
                                AWS Elastic Beanstalk Docker Does not support Multi-Stage Build
                            
                                Composer RuntimeException - Could not load package mews/purifier [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With