Ensemble Learning Approaches for Cardiovascular Disease Prediction
Received: 18 September 2025 | Revised: 4 November 2025 and 20 November 2025 | Accepted: 3 December 2025 | Online: 9 February 2026
Corresponding author: Marshima Mohd Rosli
Abstract
Cardiovascular Disease (CVD) is a primary cause of mortality worldwide, which requires precise and timely predictions to implement effective prevention and intervention strategies. Existing prediction methods, dependent on a single classifier, frequently prove inadequate due to data complexity and class imbalance, leading to incorrect classification. This study aimed to compare the performance of ensemble learning approaches for predicting CVD using clinical datasets. Six ensemble classifiers were evaluated: XGBoost and Gradient Boosting (Boosting), Random Forest and Decision Tree (Bagging), and K-Nearest-Neighbor and Logistic Regression (Stacking). Boosting classifiers achieved the highest accuracy (73.0%) and G-mean (72.0%) for CVD prediction, while the bagging classifier for Random Forest achieved similar precision in identifying high-risk patients with CVD. Boosting classifiers demonstrate strong discriminative power in differentiating between risk, as confirmed by the ROC AUC (0.80), indicating their effectiveness for reliable CVD prediction.
Keywords:
component, cardiovascular, ensemble learning, prediction, classifiersDownloads
References
D. Adhikary, S. Barman, R. Ranjan, and H. Stone, "A Systematic Review of Major Cardiovascular Risk Factors: A Growing Global Health Concern," Cureus, Oct. 2022, https://doi.org/10.7759/cureus.30119. DOI: https://doi.org/10.7759/cureus.30119
F. Rezaei et al., "Comparison of laboratory-based and non-laboratory-based WHO cardiovascular disease risk charts: a population-based study," Journal of Translational Medicine, vol. 20, no. 1, Mar. 2022, Art. no. 133. DOI: https://doi.org/10.1186/s12967-022-03336-4
C. Bernand, E. Mirand, and M. Aryun, "Coronary heart disease prediction models using machine learning and deep learning algorithms," AIP Conference Proceedings, vol. 2838, no. 1, Surakarta, Indonesia, Feb. 2024, Art. no. 070009. DOI: https://doi.org/10.1063/5.0179929
C. Venkatesh, L. Sivayamini, T. Supriya, J. V. Kumar, B. V. K. Reddy, and N. Sujaritha, "A Hybrid Model for the Detection and Classification of Cardiovascular Diseases Based on Deep Learning and Optimization Techniques," in Proceedings of the 6th International Conference on Communications and Cyber Physical Engineering, Hyderabad, India, 2024, pp. 747–762. DOI: https://doi.org/10.1007/978-981-99-7137-4_73
K. Gunasekaran, V. D. A. Kumar, and K. Jayashree, "Prediction and risk analysis of Cardio Vascular diseases in IoHT by enhanced CHIO-based Residual and dilated gated network with Attention Mechanism," Biomedical Signal Processing and Control, vol. 94, Aug. 2024, Art. no. 106222. DOI: https://doi.org/10.1016/j.bspc.2024.106222
P. Dhaka, R. Sehrawat, and P. Bhutani, "An Innovative Approach to Cardiovascular Disease Prediction: A Hybrid Deep Learning Model," Engineering, Technology & Applied Science Research, vol. 13, no. 6, pp. 12396–12403, Dec. 2023. DOI: https://doi.org/10.48084/etasr.6503
N. K. Al-Shammari et al., "Cardiac Stroke Prediction Framework using Hybrid Optimization Algorithm under DNN," Engineering, Technology & Applied Science Research, vol. 11, no. 4, pp. 7436–7441, Aug. 2021. DOI: https://doi.org/10.48084/etasr.4277
J. Azmi, M. Arif, M. T. Nafis, M. A. Alam, S. Tanweer, and G. Wang, "A systematic review on machine learning approaches for cardiovascular disease prediction using medical big data," Medical Engineering & Physics, vol. 105, July 2022, Art. no. 103825. DOI: https://doi.org/10.1016/j.medengphy.2022.103825
N. Biswas et al., "Machine Learning-Based Model to Predict Heart Disease in Early Stage Employing Different Feature Selection Techniques," BioMed Research International, vol. 2023, no. 1, 2023, Art. no. 6864343. DOI: https://doi.org/10.1155/2023/6864343
S. Subramani et al., "Cardiovascular diseases prediction by machine learning incorporation with deep learning," Frontiers in Medicine, vol. 10, Apr. 2023, Art. no. 1150933. DOI: https://doi.org/10.3389/fmed.2023.1150933
D. Sharma, N. Gotlieb, M. E. Farkouh, K. Patel, W. Xu, and M. Bhat, "Machine Learning Approach to Classify Cardiovascular Disease in Patients With Nonalcoholic Fatty Liver Disease in the UK Biobank Cohort," Journal of the American Heart Association, vol. 11, no. 1, Jan. 2022, Art. no. e022576. DOI: https://doi.org/10.1161/JAHA.121.022576
A. Khan, M. Qureshi, M. Daniyal, and K. Tawiah, "A Novel Study on Machine Learning Algorithm-Based Cardiovascular Disease Prediction," Health & Social Care in the Community, vol. 2023, pp. 1–10, Feb. 2023. DOI: https://doi.org/10.1155/2023/1406060
B. R. Devi, U. Sivaji, T. Swetha, J. Avanija, A. Suresh, and K. R. Madhavi, "Advanced Cardiovascular Disease Prediction: A Comparative Analysis of Ensemble Stacking and Deep Neural Networks," International Journal of Intelligent Systems and Applications in Engineering, vol. 12, no. 6, pp. 46–55, 2024.
I. Yekkala, S. Dixit, and M. A. Jabbar, "Prediction of heart disease using ensemble learning and Particle Swarm Optimization," in 2017 International Conference On Smart Technologies For Smart Nation (SmartTechCon), Bengaluru, India, Dec. 2017, pp. 691–698. DOI: https://doi.org/10.1109/SmartTechCon.2017.8358460
X. Y. Gao, A. A. Ali, H. S. Hassan, and E. M. Anwar, "Improving the Accuracy for Analyzing Heart Diseases Prediction Based on the Ensemble Method," Complexity, vol. 2021, no. 1, Jan. 2021, Art. no. 6663455. DOI: https://doi.org/10.1155/2021/6663455
D. Zhou, H. Qiu, L. Wang, and M. Shen, "Risk prediction of heart failure in patients with ischemic heart disease using network analytics and stacking ensemble learning," BMC Medical Informatics and Decision Making, vol. 23, no. 1, May 2023, Art. no. 99. DOI: https://doi.org/10.1186/s12911-023-02196-2
P. Gupta and D.D. Seth, "Improving the Prediction of Heart Disease Using Ensemble Learning and Feature Selection," International Journal of Advances in Soft Computing and its Applications, vol. 14, no. 2, pp. 37–40, Aug. 2022. DOI: https://doi.org/10.15849/IJASCA.220720.03
K. Shilpa and T. Adilakshmi, "Applying Ensemble Techniques of Machine Learning to Predict Heart Disease," in Proceedings of the International Conference on Cognitive and Intelligent Computing, A. Kumar, G. Ghinea, S. Merugu, and T. Hashimoto, Springer Nature Singapore, 2022, pp. 775–783. DOI: https://doi.org/10.1007/978-981-19-2350-0_73
E. Akkur, "Prediction of Cardiovascular Disease Based on Voting Ensemble Model and SHAP Analysis," Sakarya University Journal of Computer and Information Sciences, vol. 6, no. 3, pp. 226–238, Dec. 2023. DOI: https://doi.org/10.35377/saucis...1367326
N. Tabassum and T. Ahmed, "A theoretical study on classifier ensemble methods and its applications," in 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom), New Delhi, India, Mar. 2016, pp. 374–378.
T. N. Rincy and R. Gupta, "Ensemble Learning Techniques and its Efficiency in Machine Learning: A Survey," in 2nd International Conference on Data, Engineering and Applications (IDEA), Bhopal, India, Feb. 2020, pp. 1–6. DOI: https://doi.org/10.1109/IDEA49133.2020.9170675
V. Chang, V. R. Bhavani, A. Q. Xu, and M. Hossain, "An artificial intelligence model for heart disease detection using machine learning algorithms," Healthcare Analytics, vol. 2, Nov. 2022, Art. no. 100016. DOI: https://doi.org/10.1016/j.health.2022.100016
A. Ogunpola, F. Saeed, S. Basurra, A. M. Albarrak, and S. N. Qasem, "Machine Learning-Based Predictive Models for Detection of Cardiovascular Diseases," Diagnostics, vol. 14, no. 2, Jan. 2024, Art. no. 144. DOI: https://doi.org/10.3390/diagnostics14020144
M. Pal, S. Parija, G. Panda, K. Dhama, and R. K. Mohapatra, "Risk prediction of cardiovascular disease using machine learning classifiers," Open Medicine, vol. 17, no. 1, pp. 1100–1113, June 2022. DOI: https://doi.org/10.1515/med-2022-0508
M. Shah, K. Gandhi, K. A. Patel, H. Kantawala, R. Patel, and A. Kothari, "Theoretical Evaluation of Ensemble Machine Learning Techniques," in 2023 5th International Conference on Smart Systems and Inventive Technology (ICSSIT), Tirunelveli, India, Jan. 2023, pp. 829–837. DOI: https://doi.org/10.1109/ICSSIT55814.2023.10061139
V. J. Prakash and N. K. Karthikeyan, "Enhanced Evolutionary Feature Selection and Ensemble Method for Cardiovascular Disease Prediction," Interdisciplinary Sciences: Computational Life Sciences, vol. 13, no. 3, pp. 389–412, Sept. 2021. DOI: https://doi.org/10.1007/s12539-021-00430-x
N. V. MahaLakshmi and R. K. Rout, "An intelligence method for heart disease prediction using integrated filter-evolutionary search based feature selection and optimized ensemble classifier," Multimedia Tools and Applications, vol. 83, no. 13, pp. 39841–39865, Oct. 2023. DOI: https://doi.org/10.1007/s11042-023-16924-3
M. R. Islam, Md. Durul Hoda, Md. A. Rashid, S. Alam Suha, and M. T. Islam Miya, "Data-Driven Heart Disease Prediction by Ensemble Feature Selection and Machine Learning Techniques," in 2022 25th International Conference on Computer and Information Technology (ICCIT), Cox's Bazar, Bangladesh, Dec. 2022, pp. 575–580. DOI: https://doi.org/10.1109/ICCIT57492.2022.10054998
J. Chen, H. Huang, A. G. Cohn, D. Zhang, and M. Zhou, "Machine learning-based classification of rock discontinuity trace: SMOTE oversampling integrated with GBT ensemble learning," International Journal of Mining Science and Technology, vol. 32, no. 2, pp. 309–322, Mar. 2022. DOI: https://doi.org/10.1016/j.ijmst.2021.08.004
J. Zhang, T. Wang, W. W. Y. Ng, and W. Pedrycz, "Ensembling perturbation-based oversamplers for imbalanced datasets," Neurocomputing, vol. 479, pp. 1–11, Mar. 2022. DOI: https://doi.org/10.1016/j.neucom.2022.01.049
A. M. Sowjanya and O. Mrudula, "Effective treatment of imbalanced datasets in health care using modified SMOTE coupled with stacked deep learning algorithms," Applied Nanoscience, vol. 13, no. 3, pp. 1829–1840, Mar. 2023. DOI: https://doi.org/10.1007/s13204-021-02063-4
C. S. Tarimo, S. S. Bhuyan, Q. Li, W. Ren, M. J. Mahande, and J. Wu, "Combining Resampling Strategies and Ensemble Machine Learning Methods to Enhance Prediction of Neonates with a Low Apgar Score After Induction of Labor in Northern Tanzania," Risk Management and Healthcare Policy, vol. 14, pp. 3711–3720, Sept. 2021. DOI: https://doi.org/10.2147/RMHP.S331077
S. Ulianova, "Cardiovascular Disease dataset." Kaggle, Nov. 04, 2025, Available: https://www.kaggle.com/datasets/sulianova/cardiovascular-disease-dataset.
Downloads
How to Cite
License
Copyright (c) 2025 Ahmad Muzammil Halim, Marshima Mohd Rosli, Norzilah Musa, Nurain Ibrahim

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.
