I am trying to create a new vector that is the sum of 35 other vectors. The problem is that there are lots of NA values, but for this particular use, I want to treat those as zeros. Adding the vectors won't work, because if any of the 35 vectors contain an NA, the result is NA. Here is the example of the problem: <pre class="prettyprint"><code>col1<-c(NA,1,2,3) col2<-c(1,2,3,NA) col3<-c(NA,NA,2,3) Sum<-col1+col2+col3 Sum # [1] NA NA 7 NA </code></pre> I want the result to be 1, 3, 7, 6. I suppose I could create new versions of each of the vectors in which I replace the NA with a 0, but that would be a lot of work when applied to 35 vectors. Is there a simple function that will help me out?

Could also have used the <code>rowSums</code> function: <pre class="prettyprint"><code>rowSums( cbind (col1,col2,col3), na.rm=TRUE) #[1] 1 3 7 6 ?rowSums # also has colSums described on same help page </code></pre>

Put them in a matrix first: <pre class="prettyprint"><code>apply(cbind(col1,col2,col3),1,sum,na.rm = TRUE) [1] 1 3 7 6 </code></pre> You can read about each function here using R's built-in documentation: <code>?apply</code>, <code>?cbind</code>. <code>cbind</code> stands for "column bind": it takes several vectors or arrays and binds them "by column" into a single array: <pre class="prettyprint"><code>cbind(col1,col2,col3) col1 col2 col3 [1,] NA 1 NA [2,] 1 2 NA [3,] 2 3 2 [4,] 3 NA 3 </code></pre> <code>apply</code>, well, applies a function (<code>sum</code> in this case) to either the rows or columns of a matrix. This allows us to use the <code>na.rm = TRUE</code> argument to <code>sum</code> so that the NA values are dropped.

Summing lots of Vectors; row-wise or elementwise, but ignoring NA values

Tags:

r

na

vector

sum

I am trying to create a new vector that is the sum of 35 other vectors. The problem is that there are lots of NA values, but for this particular use, I want to treat those as zeros. Adding the vectors won't work, because if any of the 35 vectors contain an NA, the result is NA. Here is the example of the problem:

col1<-c(NA,1,2,3)
col2<-c(1,2,3,NA)
col3<-c(NA,NA,2,3)
Sum<-col1+col2+col3
Sum
# [1] NA NA  7 NA

I want the result to be 1, 3, 7, 6.
I suppose I could create new versions of each of the vectors in which I replace the NA with a 0, but that would be a lot of work when applied to 35 vectors. Is there a simple function that will help me out?

397

asked Nov 20 '13 19:11

user2980491

2 Answers

Could also have used the rowSums function:

rowSums( cbind (col1,col2,col3), na.rm=TRUE)
#[1] 1 3 7 6

?rowSums   # also has colSums described on same help page

165

answered Nov 21 '22 20:11

IRTFM

Put them in a matrix first:

apply(cbind(col1,col2,col3),1,sum,na.rm = TRUE)
[1] 1 3 7 6

You can read about each function here using R's built-in documentation: ?apply, ?cbind.

cbind stands for "column bind": it takes several vectors or arrays and binds them "by column" into a single array:

cbind(col1,col2,col3)
     col1 col2 col3
[1,]   NA    1   NA
[2,]    1    2   NA
[3,]    2    3    2
[4,]    3   NA    3

apply, well, applies a function (sum in this case) to either the rows or columns of a matrix. This allows us to use the na.rm = TRUE argument to sum so that the NA values are dropped.

answered Nov 21 '22 18:11

joran

Related questions
                            
                                R generate 2D histogram from raw data
                            
                                Replace character at certain location within string
                            
                                Why do i get "position_dodge requires constant width" even though widths are constant in ggplot2
                            
                                Within C++ functions, how are Rcpp objects passed to other functions (by reference or by copy)?
                            
                                Python. Get structure from a data.frame
                            
                                How to jitter both geom_line and geom_point by the same magnitude?
                            
                                Warning in install.packages : cannot remove prior installation of package ‘data.table’ [duplicate]
                            
                                How to change the now deprecated dplyr::funs() which includes an ifelse argument?
                            
                                What is the difference among prep/bake/juice in the R package "recipes"?
                            
                                "Error in plot.new() : figure margins too large"
                            
                                how to implement F#'s forward pipe operator in R? [duplicate]
                            
                                Format labels produced by cut() as percentages
                            
                                R- plot numbers instead of points
                            
                                Get Map with specified boundary coordinates
                            
                                R devtools fails as "Package libxml-2.0 was not found in the pkg-config search path"
                            
                                How to remove column names from a matrix in R?
                            
                                Use ls() or objects() to get objects of class data.frame
                            
                                R will plot but won't draw abline
                            
                                Indicating the statistically significant difference in bar graph USING R
                            
                                grepl for a period "." in R?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With