I have the following data frame: <pre class="prettyprint"><code>library(dplyr) library(tibble) df <- tibble( source = c("a", "b", "c", "d", "e"), score = c(10, 5, NA, 3, NA ) ) df </code></pre> It looks like this: <pre class="prettyprint"><code># A tibble: 5 x 2 source score <chr> <dbl> 1 a 10 . # current max value 2 b 5 3 c NA 4 d 3 5 e NA </code></pre> What I want to do is to replace <code>NA</code> in score column with values ranging for existing <code>max + n</code> onwards. Where <code>n</code> range from 1 to total number of rows of the <code>df</code> Resulting in this (hand-coded) : <pre class="prettyprint"><code> source score a 10 b 5 c 11 # obtained from 10 + 1 d 3 e 12 # obtained from 10 + 2 </code></pre> How can I achieve that?

Another option : <pre class="prettyprint"><code>transform(df, score = pmin(max(score, na.rm = TRUE) + cumsum(is.na(score)), score, na.rm = TRUE)) # source score #1 a 10 #2 b 5 #3 c 11 #4 d 3 #5 e 12 </code></pre> If you want to do this in <code>dplyr</code> <pre class="prettyprint"><code>library(dplyr) df %>% mutate(score = pmin(max(score, na.rm = TRUE) + cumsum(is.na(score)), score, na.rm = TRUE)) </code></pre>

A base R solution <pre class="prettyprint"><code>df$score[is.na(df$score)] <- seq(which(is.na(df$score))) + max(df$score,na.rm = TRUE) </code></pre> such that <pre class="prettyprint"><code>> df # A tibble: 5 x 2 source score <chr> <dbl> 1 a 10 2 b 5 3 c 11 4 d 3 5 e 12 </code></pre>

How to replace NA with set of values

I have the following data frame:

library(dplyr)
library(tibble)


df <- tibble(
  source = c("a", "b", "c", "d", "e"),
  score = c(10, 5, NA, 3, NA ) ) 


df

It looks like this:

# A tibble: 5 x 2
  source score
  <chr>  <dbl>
1 a         10 . # current max value
2 b          5
3 c         NA
4 d          3
5 e         NA

What I want to do is to replace NA in score column with values ranging for existing max + n onwards. Where n range from 1 to total number of rows of the df

Resulting in this (hand-coded) :

  source score
  a         10
  b          5
  c         11 # obtained from 10 + 1
  d          3
  e         12 #  obtained from 10 + 2

How can I achieve that?

How do I replace the NA with values in R?

The classic way to replace NA's in R is by using the IS.NA() function. The IS.NA() function takes a vector or data frame as input and returns a logical object that indicates whether a value is missing (TRUE or VALUE). Next, you can use this logical object to create a subset of the missing values and assign them a zero.

How do I remove Na from a data set?

To remove all rows having NA, we can use na. omit function. For Example, if we have a data frame called df that contains some NA values then we can remove all rows that contains at least one NA by using the command na. omit(df).

How do I put 0 instead of NA in R?

To replace NA with 0 in an R data frame, use is.na() function and then select all those values with NA and assign them to 0.

Another option :

transform(df, score = pmin(max(score, na.rm = TRUE) + 
                      cumsum(is.na(score)), score, na.rm = TRUE))

#  source score
#1      a    10
#2      b     5
#3      c    11
#4      d     3
#5      e    12

If you want to do this in dplyr

library(dplyr)
df %>% mutate(score = pmin(max(score, na.rm = TRUE) + 
                      cumsum(is.na(score)), score, na.rm = TRUE))

A base R solution

df$score[is.na(df$score)] <- seq(which(is.na(df$score))) + max(df$score,na.rm = TRUE)

such that

> df
# A tibble: 5 x 2
  source score
  <chr>  <dbl>
1 a         10
2 b          5
3 c         11
4 d          3
5 e         12

How to replace NA with set of values

Tags:

replace

r

if-statement

dplyr

tibble

scamander

People also ask

Video Answer

2 Answers

Ronak Shah

ThomasIsCoding

Recent Activity

Donate For Us

How to replace NA with set of values

Tags:

replace

r

if-statement

dplyr

tibble

scamander

People also ask

Video Answer

2 Answers

Ronak Shah

ThomasIsCoding

Related questions

Recent Activity

Donate For Us