Komparasi Algoritma Machine Learning (Random Forest, Gradient Boosting, dan Ada Boosting) untuk Prediksi Tingkat Penyakit Alzheimer
DOI:
https://doi.org/10.55606/jutiti.v5i3.6227Keywords:
Machine Learning, Random Forest, Alzheimer, Boosting, KomparasiAbstract
Alzheimer’s disease is one of the most common forms of progressive dementia and has become a major global health challenge as the aging population continues to increase. Early detection of this disease is crucial to mitigate its social, economic, and health impacts. In this context, data-driven approaches using machine learning algorithms can be utilized to predict Alzheimer’s risk more accurately. This study aims to compare the performance of three ensemble learning algorithms—Gradient Boosting, Random Forest, and AdaBoost—in predicting the risk level of Alzheimer’s disease using the public Alzheimer’s Disease Dataset, which includes demographic, clinical, and lifestyle data. The research process involved several stages, including data preprocessing, splitting data into training and testing sets, model training using cross-validation, and performance evaluation based on accuracy, precision, recall, F1-score, and AUC metrics. The experimental results show that the Gradient Boosting algorithm achieved the best performance with an accuracy of 0.956, an F1-score of 0.956, and an AUC of 0.985, demonstrating its ability to capture complex non-linear relationships among features such as age, MMSE score, and lifestyle factors. Meanwhile, Random Forest and AdaBoost achieved competitive yet slightly lower performance. These findings indicate that ensemble boosting approaches, particularly Gradient Boosting, hold great potential for medical decision-support systems in the early detection of Alzheimer’s disease and can serve as a foundation for developing more accurate and adaptive predictive models in the future.
Downloads
References
Adler, A. I., & Painsky, A. (2021). Feature importance in gradient boosting trees with cross-validation feature selection. Entropy, 24(5), 687. https://doi.org/10.3390/e24050687
Ahmed, A. M., Hanandeh, F. A., Soliman, A. M., Gaber, A. M., Sharif, A., Al Enany, M. O., & Khattab, M. M. (2024). Early Alzheimer's disease detection with multiple machine learning models. Research Square. https://doi.org/10.21203/rs.3.rs-5656284/v1
Azwar, M. K., & Setiati, S. (2021). Modifiable risk factors for dementia in Indonesia's urban population. Acta Medica Indonesiana, 53(1).
Hong, J., Seol, Y., Lee, S., Yoon, J., Lee, J., Park, K.-S., & Ha, J.-W. (2024). Prediction of cognitive impairment using sleep lifelog data and LSTM model. Mathematics, 12(3208). https://doi.org/10.3390/math12203208
Javeed, A., Dallora, A. L., Berglund, J. S., Ali, A., Ali, L., & Anderberg, P. (2023). Machine learning for dementia prediction: A systematic review and future research directions. Journal of Medical Systems, 47(17). https://doi.org/10.1007/s10916-023-01906-7
Kamatham, P. T., Shukla, R., Khatri, D. K., & Vora, L. K. (2024). Pathogenesis, diagnostics, and therapeutics for Alzheimer's disease: Breaking the memory barrier. Ageing Research Reviews. https://doi.org/10.1016/j.arr.2024.102481
Kharoua, R. E. (2024). Alzheimer's disease dataset. Kaggle. https://www.kaggle.com/datasets/rabieelkharoua/alzheimers-disease-dataset
Li, W., Zeng, L., Yuan, S., Shang, Y., Zhuang, W., Chen, Z., & Lyu, J. (2023). Machine learning for the prediction of cognitive impairment in older adults. Frontiers in Aging Neuroscience, 15. https://doi.org/10.3389/fnins.2023.1158141
Moon, C., Schneider, A., Cho, Y.-E., Zhang, M., Dang, H., & Vu, K. (2024). Sleep duration, sleep efficiency, and amyloid β among cognitively healthy later-life adults: A systematic review and meta-analysis. BMC Geriatrics, 24(408). https://doi.org/10.1186/s12877-024-05010-4
Musto, H., Stamate, D., Pu, I., & Stahl, D. (2023). A machine learning approach for predicting deterioration in Alzheimer's disease.
Nurhalizah, R. S., Setiawan, R. A., & Ardianto, R. (2024). Comparison of machine learning algorithms for Alzheimer's risk classification. International Conference on Health and Biological Science.
Orlunwo, P. O., & Onuodu, F. E. (2024). Comparison of ensemble techniques for early prediction of Alzheimer disease. Research Square. https://doi.org/10.21203/rs.3.rs-5644910/v1
Rekawati, E., Eriska, W., Rachmawati, U., Kusuma Wati, D. N., Sahar, J., Andriyanto, A., Wang, J.-J., Susanty, S., & Hasan, F. (2025). Cognitive function and its determinants in elderly Indonesians residing in long-term care: Insights from a cross-sectional study. F1000Research, 13(1384). https://doi.org/10.12688/f1000research.158490.2
Santoso, L., & Priyadi, P. (2024). Comparative study of feature engineering techniques for predictive data analytics. Journal of Technology Informatics and Engineering (JTIE), 3(2). https://doi.org/10.51903/jtie.v3i2.225
Sarica, A., Cerasa, A., & Quattrone, A. (2017). Random forest algorithm for the classification of neuroimaging data in Alzheimer's disease: A systematic review. Frontiers in Aging Neuroscience, 9(239). https://doi.org/10.3389/fnagi.2017.00329
Schliep, K. C., Thornhill, J., Tschanz, J. T., Facelli, J. C., Østbye, T., Sorweid, M. K., Smith, K. R., Varner, M., Boyce, R. D., Cliatt Brown, C. J., Meeks, H., & Abdelrahman, S. (2024). Predicting the onset of Alzheimer's disease and related dementia using electronic health records: Findings from the Cache County Study on Memory in Aging (1995–2008). BMC Medical Informatics and Decision Making, 34(316). https://doi.org/10.1186/s12911-024-02728-4
Sun, W., Zhuo, S., Wu, H., & Cai, X. (2023). Association between coronary heart disease, heart failure, and risk of Alzheimer's disease: A systematic review and meta-analysis. Annals of Indian Academy of Neurology, 26(6). https://doi.org/10.4103/aian.aian_361_23
Tarigan, T. E., Susanti, E., Siami, M. I., Arfiani, I., Permana, A. A. J., & Raharja, I. M. S. (2023). Performance metrics of AdaBoost and Random Forest in multi-class eye disease identification: An imbalanced dataset approach. International Journal of Artificial Intelligence in Medical Issues, 1(2). https://doi.org/10.56705/ijaimi.v1i2.98
Velazquez, M., & Lee, Y. (2021). Random forest model for feature-based Alzheimer's disease conversion prediction from early mild cognitive impairment subjects. PLOS ONE. https://doi.org/10.1371/journal.pone.0244773
World Health Organization. (2022). Global Dementia Observatory (GDO). https://www.who.int/data/gho/data/themes/global-dementia-observatory-gdo
Ye, E. M., Sun, H., Krishnamurthy, P. V., Adra, N., Ganglberger, W., Thomas, R. J., Lam, A. D., & Westover, M. B. (2023). Dementia detection from brain activity during sleep. Sleep Research Society, 1(11). https://doi.org/10.1093/sleep/zsac286
Yi, F., Yang, H., Chen, D., Qin, Y., Han, H., Cui, J., Bai, W., Ma, Y., Zhang, R., & Yu, H. (2023). XGBoost-SHAP-based interpretable diagnostic framework for Alzheimer's disease. BMC Medical Informatics and Decision Making, 23(137). https://doi.org/10.1186/s12911-023-02238-9
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Jurnal Teknik Informatika dan Teknologi Informasi

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.




