Ignoring values or NAs in the sample function

Tags:

I have a matrix in R that I would like to take a single random sample from each row. Some of my data is in NA, but when taking the random sample I do not want the NA to be an option for the sampling. How would I accomplish this?

For example,

a <- matrix (c(rep(5, 10), rep(10, 10), rep(NA, 5)), ncol=5, nrow=5)
a
     [,1] [,2] [,3] [,4] [,5]
[1,]    5    5   10   10   NA
[2,]    5    5   10   10   NA
[3,]    5    5   10   10   NA
[4,]    5    5   10   10   NA
[5,]    5    5   10   10   NA

When I apply the sample function to this matrix to output another matrix I get

b <- matrix(apply(a, 1, sample, size=1), ncol=1)
b

     [,1]
[1,]   NA
[2,]   NA
[3,]   10
[4,]   10
[5,]    5

Instead I do not want the NA to be capable of being the output and want the output to be something like:

b
     [,1]
[1,]   10
[2,]   10
[3,]   10
[4,]    5
[5,]   10

824

asked Apr 02 '12 02:04

Kevin

2 Answers

There might be a better way but sample doesn't appear to have any parameters related to NAs so instead I just wrote an anonymous function to deal with the NAs.

apply(a, 1, function(x){sample(x[!is.na(x)], size = 1)})

essentially does what you want. If you really want the matrix output you could do

b <- matrix(apply(a, 1, function(x){sample(x[!is.na(x)], size = 1)}), ncol = 1)

Edit: You didn't ask for this but my proposed solution does fail in certain cases (mainly if a row contains ONLY NAs.

a <- matrix (c(rep(5, 10), rep(10, 10), rep(NA, 5)), ncol=5, nrow=5)
# My solution works fine with your example data
apply(a, 1, function(x){sample(x[!is.na(x)], size = 1)})

# What happens if a row contains only NAs
a[1,] <- NA

# Now it doesn't work
apply(a, 1, function(x){sample(x[!is.na(x)], size = 1)})

# We can rewrite the function to deal with that case
mysample <- function(x, ...){
    if(all(is.na(x))){
        return(NA)
    }
    return(sample(x[!is.na(x)], ...))
}

# Using the new function things work.
apply(a, 1, mysample, size = 1)

167

answered Nov 15 '22 05:11

Dason

I think @Dason's solution works quite well, but you can also try this:

a <- matrix (c(rep(5, 10), rep(10, 10), rep(NA, 5)), ncol=5, nrow=5)
matrix(sample(na.omit(as.numeric(a)),ncol(a)))
     [,1]
[1,]   10
[2,]    5
[3,]   10
[4,]   10
[5,]    5

Even if there is a complete row with NA's or a complete column with NA'S, this solution can deal with perfectly, for instance:

set.seed(007)
a <- matrix(sample(1:100, 25), 5)
a[1,] <- NA
a[5,1] <- NA
a[,3] <- NA
a[5,5] <- NA
a[3,2] <- NA

matrix(sample(na.omit(as.numeric(a)),ncol(a)))
     [,1]
[1,]   40
[2,]    1
[3,]   42
[4,]   26
[5,]   32

I guess this is what you were looking for (at least this could be another approach).

answered Nov 15 '22 07:11

Jilber Urbina

Related questions
                            
                                Repeat each element in a string a certain number of times
                            
                                Are built-in functions in R usually optimized?
                            
                                Reading a "flipped" table in to a data.frame correctly
                            
                                Adding a Unique Trend Line to a Barplot in GGPLOT2
                            
                                Changing text in a data frame
                            
                                How do I compile a dll with R and RCPP?
                            
                                Color Barplot by Count
                            
                                milliseconds timestamps as keys in data.table
                            
                                how to turn a vector into a set in r
                            
                                Randomly select on Data Frame, for unique rows
                            
                                Associative array from string
                            
                                Is it possible to truncate output when viewing the contents of dataframes?
                            
                                `With` usage inside function (wrapper)
                            
                                Column alignment in xtable output
                            
                                Bootstrap Confidence Intervals in R
                            
                                How do I count the number of observations at given intervals in R?
                            
                                How do I make an array of classes in R?
                            
                                ggplot geom_tile spacing with facets
                            
                                R cleaning up a character and converting it into a numeric
                            
                                Adding points to a geom_tile layer in ggplot2

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Ignoring values or NAs in the sample function

Tags:

r

matrix

apply

sample

Kevin

People also ask

2 Answers

Dason

Jilber Urbina

Recent Activity

Donate For Us