I have the following dataset (simple version of my actual data), 'data', and would like to calculate weighted means for variables x1 and x2, using weightings w1 and w2 respectively, split up into two groups (groups determined by the variable n). <pre class="prettyprint"><code>data <- data.frame(n = c(1,1,1,2,2,2), x1 = c(4,5,4,7,5,5), x2 = c(7,10,9,NaN,11,12), w1 = c(0,1,1,1,1,1), w2 = c(1,1,1,0,0,1)) </code></pre> I'm trying to do it using with() but get an error when I run this: <pre class="prettyprint"><code>with(data, aggregate(x = list(x1=x1, x2=x2), by = list(n = n), FUN = weighted.mean, w = list(w1 = w1,w2 = w2))) </code></pre> On the otherhand, if weights aren't specified it works, but in this case default level weights are used (i.e. same as using FUN=mean). <pre class="prettyprint"><code>with(data, aggregate(x = list(x1=x1, x2=x2), by = list(n = n), FUN = weighted.mean)) </code></pre> This question is similar to weighted means by group and column, except that my question includes different weightings for different columns. I tried using a data table but it runs into the same weighting errors as with(). Thanks in advance for any help.

Try <pre class="prettyprint"><code>library(data.table) setDT(data)[, .(x1=weighted.mean(x1, w1), x2=weighted.mean(x2, w2)) , by = n] </code></pre> Or as @thelatemail commented, we can use <code>Map</code> to loop over "x's", corresponding "w's" columns and call with a single <code>weighted.mean</code> <pre class="prettyprint"><code>setDT(data)[, Map(weighted.mean, list(x1,x2), list(w1,w2)), by = n] </code></pre> If there are many "x" and "w" columns, we can use <code>grep</code> to get the column names, <code>mget</code> to return the values inside the <code>Map</code> <pre class="prettyprint"><code>setDT(data)[, Map(weighted.mean, mget(grep('x', names(data), value=TRUE)), mget(grep('w', names(data), value=TRUE))), by = n] </code></pre>

Calculate a series of weighted means in R for groups with different weightings

Tags:

r

with-statement

mean

weighted-average

I have the following dataset (simple version of my actual data), 'data', and would like to calculate weighted means for variables x1 and x2, using weightings w1 and w2 respectively, split up into two groups (groups determined by the variable n).

data <- data.frame(n = c(1,1,1,2,2,2), x1 = c(4,5,4,7,5,5), x2 = c(7,10,9,NaN,11,12), w1 = c(0,1,1,1,1,1), w2 = c(1,1,1,0,0,1))

I'm trying to do it using with() but get an error when I run this:

with(data, aggregate(x = list(x1=x1, x2=x2), by = list(n = n), FUN = weighted.mean, w = list(w1 = w1,w2 = w2)))

On the otherhand, if weights aren't specified it works, but in this case default level weights are used (i.e. same as using FUN=mean).

with(data, aggregate(x = list(x1=x1, x2=x2), by = list(n = n), FUN = weighted.mean))

This question is similar to weighted means by group and column, except that my question includes different weightings for different columns. I tried using a data table but it runs into the same weighting errors as with(). Thanks in advance for any help.

750

asked Jun 11 '15 04:06

Tina218

1 Answers

Try

library(data.table)
setDT(data)[, .(x1=weighted.mean(x1, w1), x2=weighted.mean(x2, w2)) , by = n]

Or as @thelatemail commented, we can use Map to loop over "x's", corresponding "w's" columns and call with a single weighted.mean

setDT(data)[, Map(weighted.mean, list(x1,x2), list(w1,w2)), by = n]

If there are many "x" and "w" columns, we can use grep to get the column names, mget to return the values inside the Map

setDT(data)[,  Map(weighted.mean, mget(grep('x', names(data), 
    value=TRUE)), mget(grep('w', names(data), value=TRUE))), by = n]

134

answered Sep 28 '22 05:09

akrun

Related questions
                            
                                What is the equivalent of SQL's IN keyword in R?
                            
                                How to order the levels of factors according to the ordering of a data.frame (and not alphabetically)
                            
                                Why rbind throws a warning
                            
                                Self reference when indexing into a vector
                            
                                multiple ggplot linear regression lines
                            
                                Check frequency of data.table value in other data.table
                            
                                add column with row wise mean over selected columns using dplyr
                            
                                Creating edge list in R
                            
                                Trying to understand R structure: what does a dot in function names signify?
                            
                                Rank based on several variables
                            
                                R Shiny Date range input
                            
                                iteratively adding elements to list in one step
                            
                                RStudio and Shiny in one dockerfile
                            
                                Use color names specified in data as fill color in geom_bar [duplicate]
                            
                                Finding critical values for the Pearson correlation coefficient
                            
                                How to get terminal nodes for a new observation from an rpart object?
                            
                                Creating formula using very long strings in R
                            
                                How to remove row based on condition of row above or below
                            
                                R: Roll up column values containing NA's by sum while grouping by ID's
                            
                                How to vectorize a "for" loop that returns a vector after applying a function for each ID

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With