Open Journal of Statistics

Volume 11, Issue 4 (August 2021)

ISSN Print: 2161-718X   ISSN Online: 2161-7198

Google-based Impact Factor: 1.45  Citations  

Study on the Missing Data Mechanisms and Imputation Methods

HTML  XML Download Download as PDF (Size: 319KB)  PP. 477-492  
DOI: 10.4236/ojs.2021.114030    970 Downloads   5,773 Views  Citations

ABSTRACT

The absence of some data values in any observed dataset has been a real hindrance to achieving valid results in statistical research. This paper aimed at the missing data widespread problem faced by analysts and statisticians in academia and professional environments. Some data-driven methods were studied to obtain accurate data. Projects that highly rely on data face this missing data problem. And since machine learning models are only as good as the data used to train them, the missing data problem has a real impact on the solutions developed for real-world problems. Therefore, in this dissertation, there is an attempt to solve this problem using different mechanisms. This is done by testing the effectiveness of both traditional and modern data imputation techniques by determining the loss of statistical power when these different approaches are used to tackle the missing data problem. At the end of this research dissertation, it should be easy to establish which methods are the best when handling the research problem. It is recommended that using Multivariate Imputation by Chained Equations (MICE) for MAR missingness is the best approach to dealing with missing data.

Share and Cite:

Alruhaymi, A. and Kim, C. (2021) Study on the Missing Data Mechanisms and Imputation Methods. Open Journal of Statistics, 11, 477-492. doi: 10.4236/ojs.2021.114030.

Cited by

[1] Estimation of missing weather variables using different data mining techniques for avalanche forecasting
Natural Hazards, 2024
[2] Addressing Missing Data in Surveys and Implementing Imputation Methods with SPSS
International Journal of …, 2024
[3] Enhancing Missing Values Imputation through Transformer-Based Predictive Modeling
2024
[4] Analysis of Incomplete Data Under Different Missingness Mechanism using Imputation Methods for Wheat Genotypes.
Current Agriculture Research …, 2023
[5] AI-enabled modeling and monitoring of data-rich advanced manufacturing systems
2023
[6] Missing signal imputation for multi-channel sensing signals on rotary machinery by tensor factorization
Manufacturing …, 2023
[7] Missing Data Imputation of an Off‐Grid Solar Power Model for a Small‐Scale System
Smart Grids for Smart …, 2023
[8] Manufacturing Letters
2023
[9] Experience: Differentiating Between Isolated and Sequence Missing Data
ACM Journal of Data and Information …, 2023
[10] PRINCIPAL COMPONENT REGRESSION WITH VARIATIONAL BAYESIAN PRINCIPAL COMPONENT ANALYSIS APPROACH TO HANDLE …
Journal of Theoretical …, 2022
[11] Principal Component Regression Modelling with Variational Bayesian Approach to Overcome Multicollinearity at Various Levels of Missing Data Proportion
JTAM (Jurnal Teori dan …, 2022
[12] Avoiding Blind Spots Of Missing Data With Deep Learning
Journal of Optoelectronics Laser, 2022
[13] Classification of breast cancer recurrence based on imputed data: a simulation study
BioData Mining, 2022
[14] The extent of COVID-19's influence on the customer experience of online food ordering applications in South Africa
2022
[15] Eğitim araştırmalarında kayıp veri durumunda farklı kappa katsayılarının incelenmesi
2022

Copyright © 2025 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.