Intelligent Information Management

Volume 7, Issue 3 (May 2015)

ISSN Print: 2160-5912   ISSN Online: 2160-5920

Google-based Impact Factor: 1.6  Citations  

Comparing Data Mining Techniques in HIV Testing Prediction

HTML  XML Download Download as PDF (Size: 661KB)  PP. 153-180  
DOI: 10.4236/iim.2015.73014    5,826 Downloads   7,346 Views  Citations
Author(s)

ABSTRACT

Introduction: The present work compared the prediction power of the different data mining techniques used to develop the HIV testing prediction model. Four popular data mining algorithms (Decision tree, Naive Bayes, Neural network, logistic regression) were used to build the model that predicts whether an individual was being tested for HIV among adults in Ethiopia using EDHS 2011. The final experimentation results indicated that the decision tree (random tree algorithm) performed the best with accuracy of 96%, the decision tree induction method (J48) came out to be the second best with a classification accuracy of 79%, followed by neural network (78%). Logistic regression has also achieved the least classification accuracy of 74%. Objectives: The objective of this study is to compare the prediction power of the different data mining techniques used to develop the HIV testing prediction model. Methods: Cross-Industry Standard Process for Data Mining (CRISP-DM) was used to predict the model for HIV testing and explore association rules between HIV testing and the selected attributes. Data preprocessing was performed and missing values for the categorical variable were replaced by the modal value of the variable. Different data mining techniques were used to build the predictive model. Results: The target dataset contained 30,625 study participants. Out of which 16,515 (54%) participants were women while the rest 14,110 (46%) were men. The age of the participants in the dataset ranged from 15 to 59 years old with modal age of 15 - 19 years old. Among the study participants, 17,719 (58%) have never been tested for HIV while the rest 12,906 (42%) had been tested. Residence, educational level, wealth index, HIV related stigma, knowledge related to HIV, region, age group, risky sexual behaviour attributes, knowledge about where to test for HIV and knowledge on family planning through mass media were found to be predictors for HIV testing. Conclusion and Recommendation: The results obtained from this research reveal that data mining is crucial in extracting relevant information for the effective utilization of HIV testing services which has clinical, community and public health importance at all levels. It is vital to apply different data mining techniques for the same settings and compare the model performances (based on accuracy, sensitivity, and specificity) with each other. Furthermore, this study would also invite interested researchers to explore more on the application of data mining techniques in healthcare industry or else in related and similar settings for the future.

Share and Cite:

Hailu, T. (2015) Comparing Data Mining Techniques in HIV Testing Prediction. Intelligent Information Management, 7, 153-180. doi: 10.4236/iim.2015.73014.

Cited by

[1] HIV/AIDS predictive model using random forest based on socio-demographical, biological and behavioral data
Egyptian Informatics Journal, 2022
[2] Performance Evaluation of Classification Models for HIV/AIDS Dataset
Data Management, Analytics and Innovation, 2021
[3] Analysis of inflammation, metabolic and clinical markers in predicting fat mass in HIV-positive males using artificial neural network/Nurul Farhah Shamsuddin
2021
[4] Application of Machine Learning in Assignment of Child Delivery Service in Afghanistan
2021
[5] Use of machine learning techniques to identify HIV predictors for screening in sub-Saharan Africa
BMC medical …, 2021
[6] DATA MINING BASED CRITICAL ANALYSIS AND CLASSIFICATION OF INFECTIOUS DISEASE FOR EFFECTIVE DIAGNOSIS AND TREATMENT
2020
[7] Analisa Perbandingan Nilai Bahasa Inggris 1 dan 3 dengan Nilai Reading dan Listening TOEIC Universitas Multimedia Nusantara
2019
[8] Predictive Models of HIV/AIDS Epidemic Status using Data Mining Techniques: A Review
2019
[9] A Model for Predicting Non-adherence Among Preexposure Prophylaxis (Prep) Clients At Suba Region.
2018
[10] Application of Data Mining Technology on Surveillance Report Data of HIV/AIDS High-Risk Group in Urumqi from 2009 to 2015
2018
[11] Data Mining Usage and Applications in Health Services
2018
[12] A model for predicting non-adherence among per-exposure prophylaxis (prep) clients at Suba region.
2018
[13] A novel data-mining model for automated prediction of low birth weight
2017
[14] A Tool For Predicting Loss-to-follow-up Among People Living With Hiv At Busia Border
2017
[15] A Hybrid Based Classification and Regression Model for Predicting Diseases Outbreak in Datasets
2017
[16] A Survey and Analysis on Classification and Regression Data Mining Techniques for Diseases Outbreak Prediction in Datasets
2016
[17] BIG DATA IN HEALTH CARE REVOLUTION–A SURVEY
International Research Journal of Engineering and Technology, 2016
[18] Big Data in Health Care Revolution–A Survey‖
2016
[19] Disease Monitoring System Using CRISP Data Mining: the Case Study for Caloocan City

Copyright © 2023 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.