Open Journal of Statistics

Volume 3, Issue 5 (October 2013)

ISSN Print: 2161-718X   ISSN Online: 2161-7198

Google-based Impact Factor: 0.53  Citations  

Multiple Imputation of Missing Data: A Simulation Study on a Binary Response

HTML  Download Download as PDF (Size: 336KB)  PP. 370-378  
DOI: 10.4236/ojs.2013.35043    7,895 Downloads   14,080 Views  Citations

ABSTRACT

Currently, a growing number of programs become available in statistical software for multiple imputation of missing values. Among others, two algorithms are mainly implemented: Expectation Maximization (EM) and Multiple Imputation by Chained Equations (MICE). They have been shown to work well in large samples or when only small proportions of missing data are to be imputed. However, some researchers have begun to impute large proportions of missing data or to apply the method to small samples. A simulation was performed using MICE on datasets with 50, 100 or 200 cases and four or eleven variables. A varying proportion of data (3% - 63%) was set as missing completely at random and subsequently substituted using multiple imputation by chained equations. In a logistic regression model, four coefficients, i.e. non-zero and zero main effects as well as non-zero and zero interaction effects were examined. Estimations of all main and interaction effects were unbiased. There was a considerable variance in the estimates, increasing with the proportion of missing data and decreasing with sample size. The imputation of missing data by chained equations is a useful tool for imputing small to moderate proportions of missing data. The method has its limits, however. In small samples, there are considerable random errors for all effects.

Share and Cite:

J. Hardt, M. Herke, T. Brian and W. Laubach, "Multiple Imputation of Missing Data: A Simulation Study on a Binary Response," Open Journal of Statistics, Vol. 3 No. 5, 2013, pp. 370-378. doi: 10.4236/ojs.2013.35043.

Cited by

[1] Are you prepared? Efficacy, contextual vulnerability, and disaster readiness
International Journal of Disaster Risk …, 2022
[2] Feasibility of an online training and support program for dementia carers: results from a mixed-methods pilot randomized controlled trial
BMC geriatrics, 2022
[3] Coping strategies and quality of life among Thai family carers of community‐dwelling persons living with dementia: A cross‐sectional study
Journal of Advanced …, 2022
[4] Sleep and chronotype in adults with persistent tic disorders
Journal of Clinical …, 2022
[5] Missing data in bioarchaeology II: A test of ordinal and continuous data imputation
American Journal of …, 2022
[6] Multiple imputation of a derived variable in a survival analysis context
2022
[7] Fatal and Non-Fatal Police Shootings in the United States, 2015: An Examination of Open-Source Data
2022
[8] The Effect of Sample Size and Missingness on Inference with Missing Data
arXiv preprint arXiv:2112.09275, 2021
[9] Engaging the osteological paradox: A study of frailty and survivorship in the 1918 influenza pandemic
2021
[10] Before the Lightning Strikes: Preparedness, Capacities, and Social Welfare Policy. Micro, Mezzo, and Macro Correlates of Disaster Preparedness
2021
[11] The role of gender in the evolution of peer networks: Individual differences in relation to the Big Five
2021
[12] Factor Retention in Exploratory Factor Analysis With Missing Data
2021
[13] Machine Learning Models for Classification of Cushing's Syndrome Using Retrospective Data
2021
[14] Bias dynamics for parameter estimation with missing data mechanisms under logistic model
2021
[15] Evaluation of Multiple Imputation with Large Proportions of Missing Data: How Much Is Too Much?
2021
[16] A Comparison of the Heckman Selection Model, Ibrahim, and Lipsitz Methods for Dealing with Nonignorable Missing Data
Journal of Psychiatry and Behavioral Sciences, 2021
[17] Limitations and potential facilitators and benefits of managing chronic conditions in community pharmacy settings
2021
[18] Assessing causality of the association between maternal smoking during pregnancy and offspring intellectual disability
2021
[19] Machine Learning Models for Diagnosis of Cushing's Syndrome Using Retrospective Data
2020
[20] Health and well-being of adolescents in different family structures in Germany and the importance of family climate
2020
[21] Diet Quality Index associated with Digital Food Guide: update and validation
Cadernos de Saúde …, 2019
[22] Unusual Experiences, Beliefs and Paranoia: Exploring the Relationships with Shame Memories and Compassion
2019
[23] The proportion of missing data should not be used to guide decisions on multiple imputation
2019
[24] Xử lý dữ liệu thiếu bằng biểu đồ chuẩn hóa đơn vị (SLP) và Support Vector Regression (SVR)
2019
[25] Handling Missing Data Using Standardized Load Profile (SLP) and Support Vector Regression (SVR)
2019
[26] Comparison of Methods for Processing Missing Values in Large Sample Survey Data
2019
[27] Iniquidades étnico-raciais nas hospitalizações por causas evitáveis em menores de cinco anos no Brasil, 2009-2014
2019
[28] Seed dispersal syndromes in the Madagascan flora: the unusual importance of primates
Oryx, 2018
[29] Multiple Imputation for Dichotomous MNAR Items Using Recursive Structural Equation Modeling With Rasch Measures as Predictors
SAGE Open, 2018
[30] Flexible imputation of missing data
Flexible Imputation of Missing Data, Second Edition, 2018
[31] The use of Rasch Measurement Theory to address measurement and analysis challenges in social science research
2018
[32] Music in Malaysian higher education: the relationships among personal environmental factors and measured achievement of students' music performance
2017
[33] Análisis del crecimiento económico y la contaminación del aire en México de 1980-2012, basado en el proceso de la curva ambiental de Kuznets
2017
[34] Social Networks as Predictors of the Harm Suffered by Victims of a Large-Scale Ponzi Scheme
2017
[35] Bayesian-based parallel ant system for missing value estimation in large databases
International Journal of Bio-Inspired Computation, 2017
[36] Privacy-preserving of SVM over vertically partitioned with imputing missing data
Distributed and Parallel Databases, 2017
[37] Heuristically repopulated Bayesian ant colony optimization for treating missing values in large databases
Knowledge-Based Systems, 2017
[38] Dealing with missing data for the power load studies using support vector regression (SVR)
Tạp chí Khoa học và Công nghệ-Đại học Đà Nẵng, 2017
[39] Missing Data in Alcohol Clinical Trials with Binary Outcomes
Alcoholism: Clinical and Experimental Research, 2016
[40] Estimation of the Incidence of Hepatocellular Carcinoma and Cholangiocarcinomain Songkhla, Thailand, 1989-2013, Using Multiple Imputation Method
CANCER RESEARCH AND TREATMENT, 2016
[41] Nearest neighbor imputation algorithms: a critical evaluation
2016
[42] Perceived burdensomeness and suicide ideation in adult outpatients receiving exposure therapy for anxiety disorders
Behaviour Research and Therapy, 2016
[43] How Does Caregiver Well-Being Relate to Perceived Quality of Care in Patients With Cancer? Exploring Associations and Pathways
2016
[44] Survival modelling, missing values and frailty with application to cervical cancer data/Nuradhiathy Abd Razak
2016
[45] Incidence of cholangiocarcinoma and prevalence of opisthorchis viverrini infestation in Songkhla province, Southern Thailand
2016
[46] Comparison of Methods of Handling Missing Data: A Case Study of KDHS 2010 Data
American Journal of Theoretical and Applied Statistics, 2015
[47] Exploring Home Visitation as an Intervention for Child Abuse and Neglect: Is Worker-Parent Alliance Predictive of Maternal Outcomes?
ProQuest Dissertations Publishing, 2015
[48] A Comparison Of Multiple Imputation Methods For Categorical Data
2015

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.