Plot Multiple Imputation Results

Tags:

I have successfully completed a multiple imputation on the missing data of my questionnaire research using the MICE package in R and performed a linear regression on the pooled imputed variables. I can't seem to work out how to extract single pooled variables and plot in a graph. Any ideas?

e.g.

>imp <- mice(questionnaire) 
>fit <- with(imp, lm(APE~TMAS+APB+APA+FOAP))  
>summary(pool(fit))

I want to plot pooled APE by TMAS.

Reproducible Example using nhanes:

> library(mice)
> nhanes
> imp <-mice(nhanes)
> fit <-with(imp, lm(bmi~chl+hyp))
> fit
> summary(pool(fit))

I would like to plot pooled chl against pooled bmi (for example).

Best I have been able to achieve is

> mat <-complete(imp, "long")
> plot(mat$chl~mat$bmi)

Which I believe gives the combined plot of all 5 imputations and is not quite what I am looking for (I think).

470

asked Aug 27 '10 08:08

Frank Zafka

1 Answers

the underlying with.mids() function lets the regression be carried out on each imputed dataframe. So it is not one regression, but 5 regressions that happened. pool() just averages the estimated coefficients and adjusts the variances for the statistical inference according to the amount of imputation.

So there aren't single pooled variables to plot. What you could do is average the 5 imputed sets and recreate some kind of "regression line" based on the pooled coefficients, eg :

# Averaged imputed data
combchl <- tapply(mat$chl,mat$.id,mean)
combbmi <- tapply(mat$bmi,mat$.id,mean)
combhyp <- tapply(mat$hyp,mat$.id,mean)

# coefficients
coefs <- pool(fit)$qbar

# regression results
x <- data.frame(
        int = rep(1,25),
        chl = seq(min(combchl),max(combchl),length.out=25),
        hyp = seq(min(combhyp),max(combhyp),length.out=25)
      )

y <- as.matrix(x) %*%coefs


# a plot
plot(combbmi~combchl)
lines(x$chl,y,col="red")

answered Sep 27 '22 21:09

Joris Meys

Related questions
                            
                                How to loop through columns, check if a particular value exists in any of the columns, mutate a new column and enter 1 if it exists, 0 if not?
                            
                                Replace part of string with mutate (in a pipe)
                            
                                Plotting one variable both line-only and points-only, depending on value
                            
                                Converting data from wide to long format when id variables are encoded in column header [duplicate]
                            
                                lme4 error: boundary (singular) fit: see ?isSingular
                            
                                What's the preferred means for defining an S3 method in an R package without introducing a dependency?
                            
                                How to connect R conda env to jupyter notebook
                            
                                Problems merging data frames in R [duplicate]
                            
                                Selecting observations within a data frame and reversing their order
                            
                                Combining .SD with renamed variable messes with names of .SD columns
                            
                                Count the new element added and removed from the previous group from a dataframe
                            
                                TypeError: use() got an unexpected keyword argument 'warn' when importing matplotlib
                            
                                r-studio: is there a "strict mode"?
                            
                                R >4.1 syntax: Error: function 'function' not supported in RHS call of a pipe
                            
                                Combining time trend plot with timeline
                            
                                Create group based on fuzzy criteria
                            
                                Best way to integrate R and Flash/Flex
                            
                                Renaming rows and columns in R
                            
                                Efficient calculation of matrix cumulative standard deviation in r
                            
                                Writing a Simple Triplet Matrix to a File?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Plot Multiple Imputation Results

Tags:

plot

r

missing-data

imputation

r-mice

Frank Zafka

People also ask

1 Answers

Joris Meys

Recent Activity

Donate For Us