I would like to get the average for certain columns for each row. I have this data: <pre class="prettyprint"><code>w=c(5,6,7,8) x=c(1,2,3,4) y=c(1,2,3) length(y)=4 z=data.frame(w,x,y) </code></pre> Which returns: <pre class="prettyprint"><code> w x y 1 5 1 1 2 6 2 2 3 7 3 3 4 8 4 NA </code></pre> I would like to get the mean for certain columns, not all of them. My problem is that there are a lot of NAs in my data. So if I wanted the mean of x and y, this is what I would like to get back: <pre class="prettyprint"><code> w x y mean 1 5 1 1 1 2 6 2 2 2 3 7 3 3 3 4 8 4 NA 4 </code></pre> I guess I could do something like <code>z$mean=(z$x+z$y)/2</code> but the last row for y is NA so obviously I do not want the NA to be calculated and I should not be dividing by two. I tried <code>cumsum</code> but that returns NAs when there is a single NA in that row. I guess I am looking for something that will add the selected columns, ignore the NAs, get the number of selected columns that do not have NAs and divide by that number. I tried ??mean and ??average and am completely stumped. ETA: Is there also a way I can add a weight to a specific column?

Here are some examples: <pre class="prettyprint"><code>> z$mean <- rowMeans(subset(z, select = c(x, y)), na.rm = TRUE) > z w x y mean 1 5 1 1 1 2 6 2 2 2 3 7 3 3 3 4 8 4 NA 4 </code></pre> weighted mean <pre class="prettyprint"><code>> z$y <- rev(z$y) > z w x y mean 1 5 1 NA 1 2 6 2 3 2 3 7 3 2 3 4 8 4 1 4 > > weight <- c(1, 2) # x * 1/3 + y * 2/3 > z$wmean <- apply(subset(z, select = c(x, y)), 1, function(d) weighted.mean(d, weight, na.rm = TRUE)) > z w x y mean wmean 1 5 1 NA 1 1.000000 2 6 2 3 2 2.666667 3 7 3 2 3 2.333333 4 8 4 1 4 2.000000 </code></pre>

Try using <code>rowMeans</code>: <pre class="prettyprint"><code>z$mean=rowMeans(z[,c("x", "y")], na.rm=TRUE) w x y mean 1 5 1 1 1 2 6 2 2 2 3 7 3 3 3 4 8 4 NA 4 </code></pre>

How can I get the average (mean) of selected columns

Tags:

r

I would like to get the average for certain columns for each row.

I have this data:

w=c(5,6,7,8) x=c(1,2,3,4) y=c(1,2,3) length(y)=4 z=data.frame(w,x,y)

Which returns:

  w x  y 1 5 1  1 2 6 2  2 3 7 3  3 4 8 4 NA

I would like to get the mean for certain columns, not all of them. My problem is that there are a lot of NAs in my data. So if I wanted the mean of x and y, this is what I would like to get back:

  w x  y mean 1 5 1  1    1 2 6 2  2    2 3 7 3  3    3 4 8 4 NA    4

I guess I could do something like z$mean=(z$x+z$y)/2 but the last row for y is NA so obviously I do not want the NA to be calculated and I should not be dividing by two. I tried cumsum but that returns NAs when there is a single NA in that row. I guess I am looking for something that will add the selected columns, ignore the NAs, get the number of selected columns that do not have NAs and divide by that number. I tried ??mean and ??average and am completely stumped.

ETA: Is there also a way I can add a weight to a specific column?

820

asked Feb 28 '12 22:02

thequerist

2 Answers

Here are some examples:

> z$mean <- rowMeans(subset(z, select = c(x, y)), na.rm = TRUE) > z   w x  y mean 1 5 1  1    1 2 6 2  2    2 3 7 3  3    3 4 8 4 NA    4

weighted mean

> z$y <- rev(z$y) > z   w x  y mean 1 5 1 NA    1 2 6 2  3    2 3 7 3  2    3 4 8 4  1    4 >  > weight <- c(1, 2) # x * 1/3 + y * 2/3 > z$wmean <- apply(subset(z, select = c(x, y)), 1, function(d) weighted.mean(d, weight, na.rm = TRUE)) > z   w x  y mean    wmean 1 5 1 NA    1 1.000000 2 6 2  3    2 2.666667 3 7 3  2    3 2.333333 4 8 4  1    4 2.000000

answered Sep 27 '22 17:09

kohske

Try using rowMeans:

z$mean=rowMeans(z[,c("x", "y")], na.rm=TRUE)    w x  y mean 1 5 1  1    1 2 6 2  2    2 3 7 3  3    3 4 8 4 NA    4

answered Sep 27 '22 17:09

Andrew

Related questions
                            
                                How to add multiple columns to a data.frame in one go?
                            
                                ggplot2, axis not showing after using theme(axis.line=element_line())
                            
                                Use expression with a variable r
                            
                                regex multiple pattern with singular replacement
                            
                                Adding a 3rd order polynomial and its equation to a ggplot in r
                            
                                How to get a barplot with several variables side by side grouped by a factor
                            
                                Split a string by any number of spaces
                            
                                Use input of purrr's map function to create a named list as output in R
                            
                                struggling with integers (maximum integer size)
                            
                                How does ggplot scale_continuous expand argument work?
                            
                                Extract non null elements from a list in R
                            
                                In R, using Ubuntu, try to install a lib depending on GMP C lib, it won't find GMP, but I have GMP installed
                            
                                Pandoc insert appendix after bibliography
                            
                                Converting data frame column from character to numeric
                            
                                cartesian product with dplyr R
                            
                                hiding personal functions in R
                            
                                Only download sources of a package and all dependencies
                            
                                Setting y axis breaks in ggplot
                            
                                dplyr left_join by less than, greater than condition
                            
                                Loop over rows of dataframe applying function with if-statement

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With