How to combine the output of amelia

for(i in 1:impute$m) { model <- rpart(Y ~X1+X2+X3+X4+X5, data=impute$imputations[[i]],method="anova",control=rpart.control(cp=0.001)) b.out <- rbind(b.out, model$coef) se.out <- rbind(se.out, coef(summary(model))[,2]) } combined.results <- mi.meld(q = b.out, se = se.out)

You may combine all imputed data sets in the Amelia output by using the command below:

#save Amelia output:
a.out <- amelia(data, ...)

# stack up all imputed datasets while adding a new column called ImputationNumber to be able to track them:
df_imputed_all <- do.call(rbind, Map(cbind, a.out$imputations, ImputationNumber = seq_along(a.out$imputations)))

Or, you can also use write.amelia function in Amelia package to save the multiple imputed datasets in a single (or multiple) files and examine them.

The code below saves the combined imputed datasets in .dta format (Stata data format). (Change format option to csv or table if you want to use these formats.)

# save all imputed datasets in a single dta file in stacked version:
write.amelia(obj=a.out, separate = FALSE, file.stem = "ameliaimputations", format = "dta")

Using this combined imputed dataset to train your model makes sense to me. Just make sure you don't cause any data leakage issues while imputing the test data (In another words, don't use parameters obtained in training data to impute the test data.)

Recommended topics

Hot tags