Ensemble Learning Approaches for Cardiovascular Disease Prediction

Authors

  • Ahmad Muzammil Halim Faculty of Computer and Mathematical Sciences, University Teknologi MARA, Shah Alam, Malaysia
  • Marshima Mohd Rosli Faculty of Computer and Mathematical Sciences, University Teknologi MARA, Shah Alam, Malaysia
  • Norzilah Musa Faculty of Computer and Mathematical Sciences, University Teknologi MARA, Shah Alam, Malaysia
  • Nurain Ibrahim Faculty of Computer and Mathematical Sciences, University Teknologi MARA, Shah Alam, Malaysia
Volume: 16 | Issue: 1 | Pages: 31632-31637 | February 2026 | https://doi.org/10.48084/etasr.14877

Abstract

Cardiovascular Disease (CVD) is a primary cause of mortality worldwide, which requires precise and timely predictions to implement effective prevention and intervention strategies. Existing prediction methods, dependent on a single classifier, frequently prove inadequate due to data complexity and class imbalance, leading to incorrect classification. This study aimed to compare the performance of ensemble learning approaches for predicting CVD using clinical datasets. Six ensemble classifiers were evaluated: XGBoost and Gradient Boosting (Boosting), Random Forest and Decision Tree (Bagging), and K-Nearest-Neighbor and Logistic Regression (Stacking). Boosting classifiers achieved the highest accuracy (73.0%) and G-mean (72.0%) for CVD prediction, while the bagging classifier for Random Forest achieved similar precision in identifying high-risk patients with CVD. Boosting classifiers demonstrate strong discriminative power in differentiating between risk, as confirmed by the ROC AUC (0.80), indicating their effectiveness for reliable CVD prediction.

Keywords:

component, cardiovascular, ensemble learning, prediction, classifiers

Downloads

Download data is not yet available.

References

D. Adhikary, S. Barman, R. Ranjan, and H. Stone, "A Systematic Review of Major Cardiovascular Risk Factors: A Growing Global Health Concern," Cureus, Oct. 2022, https://doi.org/10.7759/cureus.30119. DOI: https://doi.org/10.7759/cureus.30119

F. Rezaei et al., "Comparison of laboratory-based and non-laboratory-based WHO cardiovascular disease risk charts: a population-based study," Journal of Translational Medicine, vol. 20, no. 1, Mar. 2022, Art. no. 133. DOI: https://doi.org/10.1186/s12967-022-03336-4

C. Bernand, E. Mirand, and M. Aryun, "Coronary heart disease prediction models using machine learning and deep learning algorithms," AIP Conference Proceedings, vol. 2838, no. 1, Surakarta, Indonesia, Feb. 2024, Art. no. 070009. DOI: https://doi.org/10.1063/5.0179929

C. Venkatesh, L. Sivayamini, T. Supriya, J. V. Kumar, B. V. K. Reddy, and N. Sujaritha, "A Hybrid Model for the Detection and Classification of Cardiovascular Diseases Based on Deep Learning and Optimization Techniques," in Proceedings of the 6th International Conference on Communications and Cyber Physical Engineering, Hyderabad, India, 2024, pp. 747–762. DOI: https://doi.org/10.1007/978-981-99-7137-4_73

K. Gunasekaran, V. D. A. Kumar, and K. Jayashree, "Prediction and risk analysis of Cardio Vascular diseases in IoHT by enhanced CHIO-based Residual and dilated gated network with Attention Mechanism," Biomedical Signal Processing and Control, vol. 94, Aug. 2024, Art. no. 106222. DOI: https://doi.org/10.1016/j.bspc.2024.106222

P. Dhaka, R. Sehrawat, and P. Bhutani, "An Innovative Approach to Cardiovascular Disease Prediction: A Hybrid Deep Learning Model," Engineering, Technology & Applied Science Research, vol. 13, no. 6, pp. 12396–12403, Dec. 2023. DOI: https://doi.org/10.48084/etasr.6503

N. K. Al-Shammari et al., "Cardiac Stroke Prediction Framework using Hybrid Optimization Algorithm under DNN," Engineering, Technology & Applied Science Research, vol. 11, no. 4, pp. 7436–7441, Aug. 2021. DOI: https://doi.org/10.48084/etasr.4277

J. Azmi, M. Arif, M. T. Nafis, M. A. Alam, S. Tanweer, and G. Wang, "A systematic review on machine learning approaches for cardiovascular disease prediction using medical big data," Medical Engineering & Physics, vol. 105, July 2022, Art. no. 103825. DOI: https://doi.org/10.1016/j.medengphy.2022.103825

N. Biswas et al., "Machine Learning-Based Model to Predict Heart Disease in Early Stage Employing Different Feature Selection Techniques," BioMed Research International, vol. 2023, no. 1, 2023, Art. no. 6864343. DOI: https://doi.org/10.1155/2023/6864343

S. Subramani et al., "Cardiovascular diseases prediction by machine learning incorporation with deep learning," Frontiers in Medicine, vol. 10, Apr. 2023, Art. no. 1150933. DOI: https://doi.org/10.3389/fmed.2023.1150933

D. Sharma, N. Gotlieb, M. E. Farkouh, K. Patel, W. Xu, and M. Bhat, "Machine Learning Approach to Classify Cardiovascular Disease in Patients With Nonalcoholic Fatty Liver Disease in the UK Biobank Cohort," Journal of the American Heart Association, vol. 11, no. 1, Jan. 2022, Art. no. e022576. DOI: https://doi.org/10.1161/JAHA.121.022576

A. Khan, M. Qureshi, M. Daniyal, and K. Tawiah, "A Novel Study on Machine Learning Algorithm-Based Cardiovascular Disease Prediction," Health & Social Care in the Community, vol. 2023, pp. 1–10, Feb. 2023. DOI: https://doi.org/10.1155/2023/1406060

B. R. Devi, U. Sivaji, T. Swetha, J. Avanija, A. Suresh, and K. R. Madhavi, "Advanced Cardiovascular Disease Prediction: A Comparative Analysis of Ensemble Stacking and Deep Neural Networks," International Journal of Intelligent Systems and Applications in Engineering, vol. 12, no. 6, pp. 46–55, 2024.

I. Yekkala, S. Dixit, and M. A. Jabbar, "Prediction of heart disease using ensemble learning and Particle Swarm Optimization," in 2017 International Conference On Smart Technologies For Smart Nation (SmartTechCon), Bengaluru, India, Dec. 2017, pp. 691–698. DOI: https://doi.org/10.1109/SmartTechCon.2017.8358460

X. Y. Gao, A. A. Ali, H. S. Hassan, and E. M. Anwar, "Improving the Accuracy for Analyzing Heart Diseases Prediction Based on the Ensemble Method," Complexity, vol. 2021, no. 1, Jan. 2021, Art. no. 6663455. DOI: https://doi.org/10.1155/2021/6663455

D. Zhou, H. Qiu, L. Wang, and M. Shen, "Risk prediction of heart failure in patients with ischemic heart disease using network analytics and stacking ensemble learning," BMC Medical Informatics and Decision Making, vol. 23, no. 1, May 2023, Art. no. 99. DOI: https://doi.org/10.1186/s12911-023-02196-2

P. Gupta and D.D. Seth, "Improving the Prediction of Heart Disease Using Ensemble Learning and Feature Selection," International Journal of Advances in Soft Computing and its Applications, vol. 14, no. 2, pp. 37–40, Aug. 2022. DOI: https://doi.org/10.15849/IJASCA.220720.03

K. Shilpa and T. Adilakshmi, "Applying Ensemble Techniques of Machine Learning to Predict Heart Disease," in Proceedings of the International Conference on Cognitive and Intelligent Computing, A. Kumar, G. Ghinea, S. Merugu, and T. Hashimoto, Springer Nature Singapore, 2022, pp. 775–783. DOI: https://doi.org/10.1007/978-981-19-2350-0_73

E. Akkur, "Prediction of Cardiovascular Disease Based on Voting Ensemble Model and SHAP Analysis," Sakarya University Journal of Computer and Information Sciences, vol. 6, no. 3, pp. 226–238, Dec. 2023. DOI: https://doi.org/10.35377/saucis...1367326

N. Tabassum and T. Ahmed, "A theoretical study on classifier ensemble methods and its applications," in 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom), New Delhi, India, Mar. 2016, pp. 374–378.

T. N. Rincy and R. Gupta, "Ensemble Learning Techniques and its Efficiency in Machine Learning: A Survey," in 2nd International Conference on Data, Engineering and Applications (IDEA), Bhopal, India, Feb. 2020, pp. 1–6. DOI: https://doi.org/10.1109/IDEA49133.2020.9170675

V. Chang, V. R. Bhavani, A. Q. Xu, and M. Hossain, "An artificial intelligence model for heart disease detection using machine learning algorithms," Healthcare Analytics, vol. 2, Nov. 2022, Art. no. 100016. DOI: https://doi.org/10.1016/j.health.2022.100016

A. Ogunpola, F. Saeed, S. Basurra, A. M. Albarrak, and S. N. Qasem, "Machine Learning-Based Predictive Models for Detection of Cardiovascular Diseases," Diagnostics, vol. 14, no. 2, Jan. 2024, Art. no. 144. DOI: https://doi.org/10.3390/diagnostics14020144

M. Pal, S. Parija, G. Panda, K. Dhama, and R. K. Mohapatra, "Risk prediction of cardiovascular disease using machine learning classifiers," Open Medicine, vol. 17, no. 1, pp. 1100–1113, June 2022. DOI: https://doi.org/10.1515/med-2022-0508

M. Shah, K. Gandhi, K. A. Patel, H. Kantawala, R. Patel, and A. Kothari, "Theoretical Evaluation of Ensemble Machine Learning Techniques," in 2023 5th International Conference on Smart Systems and Inventive Technology (ICSSIT), Tirunelveli, India, Jan. 2023, pp. 829–837. DOI: https://doi.org/10.1109/ICSSIT55814.2023.10061139

V. J. Prakash and N. K. Karthikeyan, "Enhanced Evolutionary Feature Selection and Ensemble Method for Cardiovascular Disease Prediction," Interdisciplinary Sciences: Computational Life Sciences, vol. 13, no. 3, pp. 389–412, Sept. 2021. DOI: https://doi.org/10.1007/s12539-021-00430-x

N. V. MahaLakshmi and R. K. Rout, "An intelligence method for heart disease prediction using integrated filter-evolutionary search based feature selection and optimized ensemble classifier," Multimedia Tools and Applications, vol. 83, no. 13, pp. 39841–39865, Oct. 2023. DOI: https://doi.org/10.1007/s11042-023-16924-3

M. R. Islam, Md. Durul Hoda, Md. A. Rashid, S. Alam Suha, and M. T. Islam Miya, "Data-Driven Heart Disease Prediction by Ensemble Feature Selection and Machine Learning Techniques," in 2022 25th International Conference on Computer and Information Technology (ICCIT), Cox's Bazar, Bangladesh, Dec. 2022, pp. 575–580. DOI: https://doi.org/10.1109/ICCIT57492.2022.10054998

J. Chen, H. Huang, A. G. Cohn, D. Zhang, and M. Zhou, "Machine learning-based classification of rock discontinuity trace: SMOTE oversampling integrated with GBT ensemble learning," International Journal of Mining Science and Technology, vol. 32, no. 2, pp. 309–322, Mar. 2022. DOI: https://doi.org/10.1016/j.ijmst.2021.08.004

J. Zhang, T. Wang, W. W. Y. Ng, and W. Pedrycz, "Ensembling perturbation-based oversamplers for imbalanced datasets," Neurocomputing, vol. 479, pp. 1–11, Mar. 2022. DOI: https://doi.org/10.1016/j.neucom.2022.01.049

A. M. Sowjanya and O. Mrudula, "Effective treatment of imbalanced datasets in health care using modified SMOTE coupled with stacked deep learning algorithms," Applied Nanoscience, vol. 13, no. 3, pp. 1829–1840, Mar. 2023. DOI: https://doi.org/10.1007/s13204-021-02063-4

C. S. Tarimo, S. S. Bhuyan, Q. Li, W. Ren, M. J. Mahande, and J. Wu, "Combining Resampling Strategies and Ensemble Machine Learning Methods to Enhance Prediction of Neonates with a Low Apgar Score After Induction of Labor in Northern Tanzania," Risk Management and Healthcare Policy, vol. 14, pp. 3711–3720, Sept. 2021. DOI: https://doi.org/10.2147/RMHP.S331077

S. Ulianova, "Cardiovascular Disease dataset." Kaggle, Nov. 04, 2025, Available: https://www.kaggle.com/datasets/sulianova/cardiovascular-disease-dataset.

Downloads

How to Cite

[1]
A. M. Halim, M. M. Rosli, N. Musa, and N. Ibrahim, “Ensemble Learning Approaches for Cardiovascular Disease Prediction”, Eng. Technol. Appl. Sci. Res., vol. 16, no. 1, pp. 31632–31637, Feb. 2026.

Metrics

Abstract Views: 90
PDF Downloads: 41

Metrics Information