Fitting a weighted distribution in R

Q: How does Fitdistr in R work?

The fitdistr function estimates distribution parameters by maximizing the likelihood function using the optim function. No distinction between parameters with different roles (e.g., main parameter and nuisance parameter) is made, as this paper focuses on parameter estimation from a general point-of-view.

Q: How do you assign weights?

To calculate how much weight you need, divide the known population percentage by the percent in the sample. For this example: Known population females (51) / Sample Females (41) = 51/41 = 1.24. Known population males (49) / Sample males (59) = 49/59 = .

Tags:

r

I'm looking to fit a weighted distribution to a data set I have.

I'm currently using the fitdist command but don't know if there is a way to add weighting.

library(fitdistrplus)
df<-data.frame(value=rlnorm(100,1,0.5),weight=runif(100,0,2))

#This is what I'm doing but not really what I want
fit_df<-fitdist(df$value,"lnorm")

#How to do this
fit_df_weighted<-fitdist(df$value,"lnorm",weight=df$weight)

I'm sure this has been answered before somewhere but I've looked and can't find anything.

thanks in advance,

Gordon

972

asked Nov 12 '13 19:11

gtwebb

1 Answers

Perhaps you could use the rep() function and a quick loop to approximate the distribution.

You could multiply each weighted value by, say, 10000, round the number, and then use it to indicate how many multiples of the value you need in your vector. After running a quick loop, you could then run the vector through the fitdist() algorithm.

df$scaled_weight <- round(df$weight*10000,0)
my_vector <- vector()

## quick loop
for (i in 1:nrow(df)){
  values <- rep(df$value[i], df$scaled_weight[i])
  my_vector <- c(my_vector, values)
}

## find parameters
fit_df_weighted <- fitdist(my_vector,"lnorm")

The standard errors would be rubbish, but the estimated parameters should be sufficient.

164

answered Oct 25 '22 12:10

Hip Hop Physician

Related questions
                            
                                Efficient way to sample from different probability vectors
                            
                                full precision may not have been achieved in 'qbeta'
                            
                                httr GET function running out of space when downloading a large file
                            
                                R: promise already under evaluation
                            
                                knitr - change code indentation
                            
                                How to get the list of all Yahoo Finance mutual funds in R?
                            
                                Counting species occurrence in a grid
                            
                                Robust cross-platform method of moving a directory
                            
                                R coda "The leading minor of order 3 is not positive definite"
                            
                                auto.arima not parallelizing
                            
                                Downloading the entire Bitcoin transaction chain with R
                            
                                Show code in appendix using knitr
                            
                                How to implement parallel jags on Windows with foreach?
                            
                                R CMD INSTALL --build package --> "vignettes missing"
                            
                                combn unclasses factor variables
                            
                                Change alpha level of geom_point in legend on top of stat_smooth
                            
                                Error with function fortify of ggplot2
                            
                                Levy Walk simulation in R
                            
                                Is it possible to provide a list of custom stopwords to RTextTools package?
                            
                                R Wrong encoding in Rstudio console (but ok in R GUI and ggplot2)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With