Systematic review on missing data imputation techniques with machine learning algorithms for healthcare

Missing data is one of the most common issues encountered in data cleaning process especially when dealing with medical dataset. A real collected dataset is prone to be incomplete, inconsistent, noisy and redundant due to potential reasons such as human errors, instrumental failures, and adverse dea...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلفون الرئيسيون: Ismail, Amelia Ritahani, Zainal Abidin, Nadzurah, Maen, Mohd Khaled
التنسيق: مقال
اللغة:English
English
منشور في: Department of Electrical Engineering, Universitas Muhammadiyah Yogyakarta 2022
الموضوعات:
الوصول للمادة أونلاين:http://irep.iium.edu.my/98894/7/98894_Systematic%20review%20on%20missing%20data%20imputation%20techniques_SCOPUS.pdf
http://irep.iium.edu.my/98894/8/98894_Systematic%20review%20on%20missing%20data%20imputation%20techniques.pdf
http://irep.iium.edu.my/98894/
https://journal.umy.ac.id/index.php/jrc/article/download/13133/7111
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
الوصف
الملخص:Missing data is one of the most common issues encountered in data cleaning process especially when dealing with medical dataset. A real collected dataset is prone to be incomplete, inconsistent, noisy and redundant due to potential reasons such as human errors, instrumental failures, and adverse death. Therefore, to accurately deal with incomplete data, a sophisticated algorithm is proposed to impute those missing values. Many machine learning algorithms have been applied to impute missing data with plausible values. However, among all machine learning imputation algorithms, KNN algorithm has been widely adopted as an imputation for missing data due to its robustness and simplicity and it is also a promising method to outperform other machine learning methods. This paper provides a comprehensive review of different imputation techniques used to replace the missing data. The goal of the review paper is to bring specific attention to potential improvements to existing methods and provide readers with a better grasps of imputation technique trends.