An Improved Auto Categorical PSO with ML for Heart Disease Prediction
Received: 17 February 2022 | Revised: 4 March 2022 | Accepted: 27 March 2022 | Online: 4 April 2022
Corresponding author: A. K. Dubey
Abstract
Cardiovascular or heart diseases consist a global major health concern. Cardiovascular diseases have the highest mortality rate worldwide, and the death rate increases with age, but an accurate prognosis at an early stage may increase the chances of surviving. In this paper, a combined approach, based on Machine Learning (ML) with an optimization method for the prediction of heart diseases is proposed. For this, the Improved Auto Categorical Particle Swarm Optimization (IACPSO) method was utilized to pick an optimum set of features, while ML methods were used for data categorization. Three heart disease datasets were taken from the UCI ML library for testing: Cleveland, Statlog, and Hungarian. The proposed model was assessed for different performance parameters. The results indicated that, with 98% accuracy, Logistic Regression (LR) and Support Vector Machine by Grid Search (SVMGS) performed better for the Statlog, SVMGS outperformed on the Cleveland, while the LR, Random Forest (RF), Support Vector Machine (SVM), and SVMGS performed better with 97% accuracy on the Hungarian dataset. The outcomes were improved by 3 to 33% in terms of performance parameters when ML was applied with IACPSO.
Keywords:
SVMGS, IACPSO, KNN, LRDownloads
References
A. L. Bui, T. B. Horwich, and G. C. Fonarow, "Epidemiology and risk profile of heart failure," Nature Reviews Cardiology, vol. 8, no. 1, pp. 30–41, Jan. 2011. DOI: https://doi.org/10.1038/nrcardio.2010.165
D. Prabhakaran, P. Jeemon, and A. Roy, "Cardiovascular Diseases in India," Circulation, vol. 133, no. 16, pp. 1605–1620, Apr. 2016. DOI: https://doi.org/10.1161/CIRCULATIONAHA.114.008729
"Alarming Statistics from India - Neo CarDiabCare Heartlicare..." http://neocardiabcare.com/alarming-statistics-india.htm (accessed Mar. 31, 2022).
E. Wilkins et al., European Cardiovascular Disease Statistics. Brussels, Belgium: European Heart Network, 2017.
H. Ouyang, "Africa’s Top Health Challenge: Cardiovascular Disease," The Atlantic, Oct. 30, 2014.
H. Kahramanli and N. Allahverdi, "Mining Classification Rules for Liver Disorders," International Journal of Mathematics and Computers in Simulation, vol. 3, no. 1, pp. 9–19, 2009.
M. Durairaj and N. Ramasamy, "A comparison of the perceptive approaches for preprocessing the data set for predicting fertility success rate," International Journal of Control Theory and Applications, vol. 9, no. 27, pp. 255–260, Jan. 2016.
A. Tsanas, M. A. Little, P. E. McSharry, and L. O. Ramig, "Nonlinear speech analysis algorithms mapped to a standard metric achieve clinically useful quantification of average Parkinson’s disease symptom severity," Journal of The Royal Society Interface, vol. 8, no. 59, pp. 842–855, Jun. 2011. DOI: https://doi.org/10.1098/rsif.2010.0456
L. A. Allen et al., "Decision Making in Advanced Heart Failure," Circulation, vol. 125, no. 15, pp. 1928–1952, Apr. 2012. DOI: https://doi.org/10.1161/CIR.0b013e31824f2173
A. K. Dubey and K. Choudhary, "A systematic review and analysis of the heart disease prediction methodology," International Journal of Advanced Computer Research, vol. 8, no. 38, pp. 240–256, 2018. DOI: https://doi.org/10.19101/IJACR.2018.837025
A. K. Dubey, K. Choudhary, and R. Sharma, "Predicting Heart Disease Based on Influential Features with Machine Learning," Intelligent Automation & Soft Computing, vol. 30, no. 3, pp. 229–243, 2021, https://www.techscience.com/iasc/v30n3/44095. DOI: https://doi.org/10.32604/iasc.2021.018382
J. Chen, H. Huang, S. Tian, and Y. Qu, "Feature selection for text classification with Naïve Bayes," Expert Systems with Applications, vol. 36, no. 3, Part 1, pp. 5432–5435, Apr. 2009. DOI: https://doi.org/10.1016/j.eswa.2008.06.054
Y. Li, T. Li, and H. Liu, "Recent advances in feature selection and its applications," Knowledge and Information Systems, vol. 53, no. 3, pp. 551–577, Dec. 2017. DOI: https://doi.org/10.1007/s10115-017-1059-8
J. Li and H. Liu, "Challenges of Feature Selection for Big Data Analytics," IEEE Intelligent Systems, vol. 32, no. 2, pp. 9–15, Nov. 2017. DOI: https://doi.org/10.1109/MIS.2017.38
S. Bharti and S. N. Singh, "Analytical study of heart disease prediction comparing with different algorithms," in International Conference on Computing, Communication & Automation, Greater Noida, India, Dec. 2015, pp. 78–82. DOI: https://doi.org/10.1109/CCAA.2015.7148347
S. Tahzeeb and S. Hasan, "A Neural Network-Based Multi-Label Classifier for Protein Function Prediction," Engineering, Technology & Applied Science Research, vol. 12, no. 1, pp. 7974–7981, Feb. 2022. DOI: https://doi.org/10.48084/etasr.4597
K. Aldriwish, "A Deep Learning Approach for Malware and Software Piracy Threat Detection," Engineering, Technology & Applied Science Research, vol. 11, no. 6, pp. 7757–7762, Dec. 2021. DOI: https://doi.org/10.48084/etasr.4412
H. Alalawi, M. Alsuwat, and H. Alhakami, "A Survey of the Application of Artifical Intellegence on COVID-19 Diagnosis and Prediction," Engineering, Technology & Applied Science Research, vol. 11, no. 6, pp. 7824–7835, Dec. 2021. DOI: https://doi.org/10.48084/etasr.4503
J. Cai, J. Luo, S. Wang, and S. Yang, "Feature selection in machine learning: A new perspective," Neurocomputing, vol. 300, pp. 70–79, Jul. 2018. DOI: https://doi.org/10.1016/j.neucom.2017.11.077
S. Sharma and K. M. Buddhiraju, "A Novel Ant Colony Optimization Based Training Subset Selection Algorithm for Hyperspectral Image Classification," in International Geoscience and Remote Sensing Symposium, Valencia, Spain, Jul. 2018, pp. 5748–5751. DOI: https://doi.org/10.1109/IGARSS.2018.8519217
J. Senthilnath, S. N. Omkar, V. Mani, N. Tejovanth, P. G. Diwakar, and A. Shenoy B., "Multi-spectral satellite image classification using Glowworm Swarm Optimization," in International Geoscience and Remote Sensing Symposium, Vancouver, BC, Canada, Jul. 2011, pp. 47–50. DOI: https://doi.org/10.1109/IGARSS.2011.6048894
Z. Xu and J. Yang, "Model selection based on particle swarm optimization for omics data classification," in 5th International Conference on Mechanical, Control and Computer Engineering, Harbin, China, Dec. 2020, pp. 1338–1341. DOI: https://doi.org/10.1109/ICMCCE51767.2020.00293
L. Demidova and I. Klyueva, "Data classification based on the hybrid versions of the particle swarm optimization algorithm," in 7th Mediterranean Conference on Embedded Computing, Budva, Montenegro, Jun. 2018, pp. 1–4. DOI: https://doi.org/10.1109/MECO.2018.8406069
Z. Lu, "Enhanced Accuracy Enabled by Particle Swarm Optimization in Classification Application," in International Conference on Artificial Intelligence and Computer Engineering, Beijing, China, Oct. 2020, pp. 146–149. DOI: https://doi.org/10.1109/ICAICE51518.2020.00034
P. Bhavani Shankar and Y. Divya Vani, "Conceptual Glance of Genetic Algorithms in the Detection of Heart Diseases," in International Conference on Advances in Electrical, Computing, Communication and Sustainable Technologies, Bhilai, India, Feb. 2021, pp. 1–4. DOI: https://doi.org/10.1109/ICAECT49130.2021.9392604
Md. T. Islam, S. R. Rafa, and Md. G. Kibria, "Early Prediction of Heart Disease Using PCA and Hybrid Genetic Algorithm with k-Means," in 23rd International Conference on Computer and Information Technology, Dhaka, Bangladesh, Dec. 2020, pp. 1–6. DOI: https://doi.org/10.1109/ICCIT51783.2020.9392655
A. T. Saputra, B. Prindo Sugiharto Putro, W. A. Saputro, and M. Muljono, "Optimization Neural Network With PCA And PSO On Heart Disease Classification," in International Seminar on Application for Technology of Information and Communication, Semarang, Indonesia, Sep. 2020, pp. 191–195. DOI: https://doi.org/10.1109/iSemantic50169.2020.9234276
S. K. Prabhakar, H. Rajaguru, and S.-W. Lee, "Metaheuristic-Based Dimensionality Reduction and Classification Analysis of PPG Signals for Interpreting Cardiovascular Disease," IEEE Access, vol. 7, pp. 165181–165206, 2019. DOI: https://doi.org/10.1109/ACCESS.2019.2950220
S. Hendra Wijaya, G. Timur Pamungkas, M. Burhanis Sulthan, and Muljono, "Improving Classifier Performance Using Particle Swarm Optimization on Heart Disease Detection," in International Seminar on Application for Technology of Information and Communication, Semarang, Indonesia, Sep. 2018, pp. 603–608. DOI: https://doi.org/10.1109/ISEMANTIC.2018.8549722
M. G. Feshki and O. S. Shijani, "Improving the heart disease diagnosis by evolutionary algorithm of PSO and Feed Forward Neural Network," in Artificial Intelligence and Robotics, Qazvin, Iran, Apr. 2016, pp. 48–53. DOI: https://doi.org/10.1109/RIOS.2016.7529489
M. A. Jabbar, B. L. Deekshatulu, and P. Chandra, "Computational intelligence technique for early diagnosis of heart disease," in International Conference on Engineering and Technology, Coimbatore, India, Mar. 2015, pp. 1–6. DOI: https://doi.org/10.1109/ICETECH.2015.7275001
L. Ali, A. Rahman, A. Khan, M. Zhou, A. Javeed, and J. A. Khan, "An Automated Diagnostic System for Heart Disease Prediction Based on chi^2 Statistical Model and Optimally Configured Deep Neural Network," IEEE Access, vol. 7, pp. 34938–34945, 2019. DOI: https://doi.org/10.1109/ACCESS.2019.2904800
A. Javeed, S. Zhou, L. Yongjian, I. Qasim, A. Noor, and R. Nour, "An Intelligent Learning System Based on Random Search Algorithm and Optimized Random Forest Model for Improved Heart Disease Detection," IEEE Access, vol. 7, pp. 180235–180243, 2019. DOI: https://doi.org/10.1109/ACCESS.2019.2952107
I. Yekkala, S. Dixit, and M. A. Jabbar, "Prediction of heart disease using ensemble learning and Particle Swarm Optimization," in International Conference On Smart Technologies For Smart Nation, Bengaluru, India, Aug. 2017, pp. 691–698. DOI: https://doi.org/10.1109/SmartTechCon.2017.8358460
S. Chakraborty, S. Paul, and Md. Rahat-uz-Zaman, "Prediction of Apple Leaf Diseases Using Multiclass Support Vector Machine," in 2nd International Conference on Robotics, Electrical and Signal Processing Techniques, DHAKA, Bangladesh, Jan. 2021, pp. 147–151. DOI: https://doi.org/10.1109/ICREST51555.2021.9331132
M. Masud et al., "CROWD: Crow Search and Deep Learning based Feature Extractor for Classification of Parkinson’s Disease," ACM Transactions on Internet Technology, vol. 21, no. 3, Mar. 2021, Art. no. 77. DOI: https://doi.org/10.1145/3418500
M. Kumar, A. Kumar, and V. S. Palaparthy, "Soil Sensors-Based Prediction System for Plant Diseases Using Exploratory Data Analysis and Machine Learning," IEEE Sensors Journal, vol. 21, no. 16, pp. 17455–17468, Dec. 2021. DOI: https://doi.org/10.1109/JSEN.2020.3046295
A. Dubey, U. Gupta, and S. Jain, "Medical Data Clustering and Classification Using TLBO and Machine Learning Algorithms," Computers, Materials and Continua, vol. 70, no. 3, pp. 4523–4543, Oct. 2021. DOI: https://doi.org/10.32604/cmc.2022.021148
A. K. Dubey, U. Gupta, and S. Jain, "Computational Measure of Cancer Using Data Mining and Optimization," in International Conference on Sustainable Communication Networks and Application, Erode, India, Jul. 2019, pp. 626–632. DOI: https://doi.org/10.1007/978-3-030-34515-0_65
J. V. Rosy and S. B. R. Kumar, "Optimized encryption based elliptical curve Diffie-Hellman approach for secure heart disease prediction," International Journal of Advanced Technology and Engineering Exploration, vol. 8, no. 83, pp. 1367–1382, 2021. DOI: https://doi.org/10.19101/IJATEE.2021.874436
"UCI Machine Learning Repository." http://archive.ics.uci.edu/ml/index.php (accessed Mar. 31, 2022).
J. Nahar, T. Imam, K. S. Tickle, and Y.-P. P. Chen, "Association rule mining to detect factors which contribute to heart disease in males and females," Expert Systems with Applications, vol. 40, no. 4, pp. 1086–1093, Mar. 2013. DOI: https://doi.org/10.1016/j.eswa.2012.08.028
S. Sperandei, "Understanding logistic regression analysis," Biochemia Medica, vol. 24, no. 1, pp. 12–18, Feb. 2014. DOI: https://doi.org/10.11613/BM.2014.003
J. C. Stoltzfus, "Logistic Regression: A Brief Primer," Academic Emergency Medicine, vol. 18, no. 10, pp. 1099–1104, 2011. DOI: https://doi.org/10.1111/j.1553-2712.2011.01185.x
X. Wu et al., "Top 10 algorithms in data mining," Knowledge and Information Systems, vol. 14, no. 1, pp. 1–37, Jan. 2008. DOI: https://doi.org/10.1007/s10115-007-0114-2
P. C. Austin, J. V. Tu, J. E. Ho, D. Levy, and D. S. Lee, "Using methods from the data-mining and machine-learning literature for disease classification and prediction: a case study examining classification of heart failure subtypes," Journal of Clinical Epidemiology, vol. 66, no. 4, pp. 398–407, Apr. 2013. DOI: https://doi.org/10.1016/j.jclinepi.2012.11.008
L. Yang, Distance Metric Learning: A Comprehensive Survey. Michigan, MI, USA: Michigan State University, 2006.
C. M. Bishop, Pattern Recognition and Machine Learning. New York, NY, USA: Springer, 2006.
S.-W. Lin, K.-C. Ying, S.-C. Chen, and Z.-J. Lee, "Particle swarm optimization for parameter determination and feature selection of support vector machines," Expert Systems with Applications, vol. 35, no. 4, pp. 1817–1824, Nov. 2008. DOI: https://doi.org/10.1016/j.eswa.2007.08.088
J. F. Easton, C. R. Stephens, and M. Angelova, "Risk factors and prediction of very short term versus short/intermediate term post-stroke mortality: A data mining approach," Computers in Biology and Medicine, vol. 54, pp. 199–210, Nov. 2014. DOI: https://doi.org/10.1016/j.compbiomed.2014.09.003
J. Kennedy and R. Eberhart, "Particle swarm optimization," in International Conference on Neural Networks, Perth, WA, Australia, Dec. 1995, vol. 4, pp. 1942–1948.
W. Y. Dong and R. R. Zhang, "Order-3 stability analysis of particle swarm optimization," Information Sciences, vol. 503, pp. 508–520, Nov. 2019. DOI: https://doi.org/10.1016/j.ins.2019.07.020
M. R. Bonyadi and Z. Michalewicz, "Analysis of Stability, Local Convergence, and Transformation Sensitivity of a Variant of the Particle Swarm Optimization Algorithm," IEEE Transactions on Evolutionary Computation, vol. 20, no. 3, pp. 370–385, Jun. 2016. DOI: https://doi.org/10.1109/TEVC.2015.2460753
S. Ahmed, M. Mafarja, H. Faris, and I. Aljarah, "Feature Selection Using Salp Swarm Algorithm with Chaos," in 2nd International Conference on Intelligent Systems, Metaheuristics & Swarm Intelligence, Phuket, Thailand, Mar. 2018, pp. 65–69. DOI: https://doi.org/10.1145/3206185.3206198
E. Emary, H. M. Zawbaa, and A. E. Hassanien, "Binary grey wolf optimization approaches for feature selection," Neurocomputing, vol. 172, pp. 371–381, Jan. 2016. DOI: https://doi.org/10.1016/j.neucom.2015.06.083
Downloads
How to Cite
License
Copyright (c) 2022 A. K. Dubey, A. K. Sinhal, R. Sharma
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.