A Comparative Evaluation of Machine Learning Architectures for Early Multi-Disease Prediction and Clinical Diagnostics

Authors

  • Kajal Kumari
  • Kazi Minhazul Islam
  • Sonali Lakra

Keywords:

Classification, Data mining, Disease prediction, Health care, Machine learning, Neural network, Random forest, SVM

Abstract

Early disease detection is very important in modern healthcare because many serious illnesses are diagnosed only after they become severe. Diseases such as heart disease, diabetes, cancer, and kidney disease are responsible for millions of deaths every year across the world. Traditional medical diagnosis methods are often time-consuming, expensive, and sometimes unable to identify diseases at an early stage. Because of this, researchers are now focusing on machine learning techniques to help doctors make faster, more accurate decisions. This study examines the role of machine learning in predicting diseases using patient health data. Different machine learning models, including Decision Tree, Random Forest, Support Vector Machine, K-Nearest Neighbor, Logistic Regression, Gradient Boosting, and Neural Network, were applied and compared. Publicly available healthcare datasets were used for model training and testing. Before model development, data preprocessing methods such as handling missing values, normalization, and feature selection were performed to improve prediction quality. The models were evaluated using accuracy, precision, recall, and F1-score measures. The experimental results showed that the Neural Network model achieved the highest prediction accuracy of 93.1%, while Gradient Boosting and Random Forest also produced strong results with accuracies of 90.7% and 89.4%, respectively. The study demonstrates that machine learning can become an effective support system for early disease diagnosis and may help healthcare professionals provide timely treatment and reduce mortality rates.

References

H. Kaur and S. K. Wasan, “Empirical Study on Applications of Data Mining in Healthcare,” Journal of Computer Science, vol. 2, no. 2, pp. 194–200, 2006.

T. Fawcett, “An Introduction to ROC Analysis,” Pattern Recognition Letters, vol. 27, no. 8, pp. 861–874, 2006,

J. Soni, U. Ansari, D. Sharma, and S. Soni, “Predictive Data Mining for Medical Diagnosis: An Overview of Heart Disease Prediction,” International Journal of Computer Applications, vol. 17, no. 8, pp. 43–48, 2011

L. Wang, “Expert Systems with Applications | ScienceDirect.com,” Sciencedirect.com, 2019.

T. Liu et al., “Machine Learning Based Prediction Models for Cardiovascular Disease Risk Using Electronic Health Records Data: Systematic Review and Meta-analysis,” European Heart Journal Digital Health, vol. 6, no. 1, pp. 7–22, 2025.

“Kaggle, “Pima Indians Diabetes Database,” www.kaggle.com, 2016.

S. Palaniappan and R. Awang, “Intelligent Heart Disease Prediction System Using Data Mining Techniques,” In Proceedings of the IEEE/ACS International Conference on Computer Systems and Applications), 2008, pp. 108–115.

A. Rajkomar, J. Dean, and I. Kohane, “Machine Learning in Medicine,” New England Journal of Medicine, vol. 380, no. 14, pp. 1347–1358, 2019.

D. Sisodia and D. S. Sisodia, “Prediction of Diabetes Using Classification Algorithms,” Procedia Computer Science, vol. 132, pp. 1578–1585, 2018.

T. Davenport and R. Kalakota, “The Potential for Artificial Intelligence in Healthcare,” Future Healthcare Journal, vol. 6, no. 2, pp. 94–98, 2019.

A. Al-Nafjan, “Artificial Intelligence in Predictive Healthcare: A Systematic Review,” Journal of Clinical Medicine, vol. 14, no. 19, p. 6752, 2025.

M. Khalifa and M. Albadawy, “Artificial intelligence for clinical prediction: Exploring key domains and essential functions,” Computer Methods and Programs in Biomedicine Update, vol. 5, Mar. 2024.

P. K. Anooj, “Clinical Decision Support System: Risk Level Prediction of Heart Disease Using Weighted Fuzzy Rules,” Journal of King Saud University – Computer and Information Sciences, vol. 24, no. 1, pp. 27–40, 2012.

“Heart Disease UCI Dataset,” UCI Machine Learning Repository.

Luukka, “Feature Selection Using Fuzzy Entropy Measures with Similarity Classifier,” Expert Systems with Applications, vol. 38, no. 4, pp. 4600–4607, 2011.

R. Islam et al., “A Comprehensive Review for Chronic Disease Prediction Using Machine Learning Techniques,” International Journal of Retina and Vitreous, vol. 10, no. 1, p. 150, 2024.

J. Han, M. Kamber, and J. Pei, Data Mining: Concepts and Techniques, 3rd ed. Burlington, MA, USA: Morgan Kaufmann, 2011.

R. R. Chandan et al., “Reviewing the Impact of Machine Learning on Disease Diagnosis and Prognosis: A Comprehensive Analysis,” Current Medical Imaging, 2024.

L. Breiman, “Random Forests,” Machine Learning, vol. 45, no. 1, pp. 5–32, 2001.

C. Cortes and V. Vapnik, “Support Vector Networks,” Machine Learning, vol. 20, no. 3, pp. 273–297, 1995.

N. H. Alhumaidi et al., “The Use of Machine Learning for Analyzing Real-World Data in Disease Prediction and Management,” JMIR Medical Informatics, vol. 13, p. e68898, 2025.

R. Miotto, F. Wang, S. Wang, X. Jiang, and J. T. Dudley, “Deep Learning for Healthcare,” Briefings in Bioinformatics, vol. 19, no. 6, pp. 1236–1246, 2018.

J. H. Friedman, “Greedy Function Approximation: A Gradient Boosting Machine,” Annals of Statistics, vol. 29, no. 5, pp. 1189–1232, 2001.

T. Cover and P. Hart, “Nearest Neighbour Pattern Classification,” IEEE Transactions on Information Theory, vol. 13, no. 1, pp. 21–27, 1967.

S. Yadav, H. Sehrawat, Y. Singh, and V. Jaglan, “Machine Learning Approaches for Disease Prediction: A Review,” in Proceedings IEEE World Conference on Applied Intelligence and Computing (AIC), 2022, pp. 682–687.

Z. Ahmed, K. Mohamed, S. Zeeshan, and X. Dong, “Artificial Intelligence with Multi-functional Machine Learning Platform Development for Better Healthcare and Precision Medicine,” Database, vol. 2024, p. baae011, 2024.

R. Kavita, “A Study of Cross-Validation and Bootstrap for Accuracy Estimation,” in Proc. International Joint Conference on Artificial Intelligence (IJCAI), vol. 14, no. 2, pp. 1137–1145, 1995.

S. B. Priya and R. A. Uthra, “Heart Disease Prediction Using Machine Learning Techniques,” International Journal of Advanced Science and Technology, vol. 29, no. 3, pp. 3657–3664, 2020

Y. Khourdifi and M. Bahaj, “Heart Disease Prediction and Classification Using Machine Learning Algorithms,” Journal of Engineering Science and Technology Review, vol. 12, no. 1, pp. 17–24, 2019.

D. Delen, G. Walker, and A. Kadam, “Predicting Breast Cancer Survivability: A Comparison of Three Data Mining Methods,” Artificial Intelligence in Medicine, vol. 34, no. 2, pp. 113–127, 2005.

P. J. Raihan and S. M. S. Islam, “Prediction of Heart Disease Using Logistic Regression,” in IEEE WIECON-ECE, 2016, pp. 1–5.

A. Esteva, B. Kuprel, R. A. Novoa, et al., “Dermatologist-Level Classification of Skin Cancer with Deep Neural Networks,” Nature, vol. 542, pp. 115–118, 2017.

Z. Obermeyer and E. J. Emanuel, “Predicting the Future: Big Data, Machine Learning, and Clinical Medicine,” New England Journal of Medicine, vol. 375, no. 13, pp. 1216–1219, 2016.

E. J. Topol, “High-Performance Medicine: The Convergence of Human and Artificial Intelligence,” Nature Medicine, vol. 25, no. 1, pp. 44–56, 2019.

B. J. Erickson, P. Korfiatis, Z. Akkus, and T. L. Kline, “Machine Learning for Medical Imaging,” Radiographics, vol. 37, no. 2, pp. 505–515, 2017.

Z. Obermeyer et al., “Dissecting Racial Bias in an Algorithm Used to Manage Health,” Science, vol. 366, no. 6464, pp. 447–453, 2019.

D. Ravi, C. Wong, F. Deligianni, et al., “Deep Learning for Health Informatics,” IEEE Journal of Biomedical and Health Informatics, vol. 21, no. 1, pp. 4–21, 2017.

G. Litjens, T. Kooi, B. E. Bejnordi, et al., “A Survey on Deep Learning in Medical Image Analysis,” Medical Image Analysis, vol. 42, pp. 60–88, 2017.

D. S. W. Ting, L. R. Pasquale, L. Peng, et al., “Artificial Intelligence and Deep Learning in Ophthalmology,” British Journal of Ophthalmology, vol. 103, no. 2, pp. 167–175, 2019.

K. H. Yu, A. L. Beam, and I. S. Kohane, “Artificial Intelligence in Healthcare,” Nature Biomedical Engineering, vol. 2, no. 10, pp. 719–731, 2018.

A. L. Beam and I. S. Kohane, “Big Data and Machine Learning in Health Care,” JAMA, vol. 319, no. 13, pp. 1317–1318, 2018.

M. Chen, M. Decary, et al., “Artificial Intelligence in Healthcare: An Essential Guide for Health Leaders,” Healthcare Management Forum, vol. 33, no. 1, pp. 10–18, 2024.

M. P. Sendak, J. D’Arcy, S. Kashyap, et al., “A Path for Translation of Machine Learning Products into Healthcare Delivery,” EMJ Innovations, vol. 3, no. 1, pp. 67–73, 2019.

C. Krittanawong, K. W. Johnson, R. S. Rosenson, et al., “Deep Learning for Cardiovascular Medicine: A Practical Primer,” European Heart Journal, vol. 40, no. 25, pp. 2058–2073, 2019.

R. C. Deo, “Machine Learning in Medicine,” Circulation, vol. 132, no. 20, pp. 1920–1930, 2015.

J. A. M. Sidey-Gibbons and C. J. Sidey-Gibbons, “Machine Learning in Medicine: A Practical Introduction,” BMC Medical Research Methodology, vol. 19, no. 1, p. 64, 2019.

G. S. Handelman, H. K. Kok, R. V. Chandra, et al., “Peering into the Black Box of Artificial Intelligence: Evaluation Metrics of Machine Learning Methods,” AJR American Journal of Roentgenology, vol. 212, no. 1, pp. 38–43, 2019.

M. Komorowski, L. A. Celi, O. Badawi, et al., “The Artificial Intelligence Clinician Learns Optimal Treatment Strategies for Sepsis in Intensive Care,” Nature Medicine, vol. 24, no. 11, pp. 1716–1720, 2018.

K. P. Murphy, Machine Learning: A Probabilistic Perspective. Cambridge, MA, USA: MIT Press, 2012.

F. Tajidini and M. J. Kheiri, “Recent Advancement in Disease Diagnostic Using Machine Learning: Systematic Survey of Decades, Comparisons, and Challenges,” Arxiv Preprint Arxiv:2308.01319, 2023.

Published

2026-06-16

How to Cite

Kajal Kumari, Kazi Minhazul Islam, & Sonali Lakra. (2026). A Comparative Evaluation of Machine Learning Architectures for Early Multi-Disease Prediction and Clinical Diagnostics. Journal of Data Mining and Management, 11(2), 12–21. Retrieved from https://www.matjournals.net/engineering/index.php/JoDMM/article/view/3720

Issue

Section

Articles