A Comparison of the Diagnosis Performance of Machine Learning Algorithms on the Breast Cancer Wisconsin Dataset
Received: 29 July 2025 | Revised: 18 August 2025 and 8 September 2025 | Accepted: 9 September 2025 | Online: 20 November 2025
Corresponding author: Ponnuru Sowjanya
Abstract
Breast cancer survival rate can be dramatically affected if the disease is diagnosed early and the stage is classified accurately. Model-driven or statistical-driven Machine Learning (ML) techniques are powerful and reliable methods of diagnosing breast cancer, with the ability to utilize, sort, and detect patterns in data, and to handle immense databases. This research work considers the Breast Cancer Wisconsin (BCW) dataset to train several ML models (Random Forest, Support Vector Machines, Gradient Boosting, Logistic Regression, k-Nearest Neighbors) to classify images as either benign or malignant and compares the results. The followed methodology involves data preprocessing and feature extraction. All tested algorithms performed satisfactorily with Random Forest surpassing the others.
Keywords:
breast cancer, machine learning, diagnosis, classification, Breast Cancer Wisconsin DatasetDownloads
References
A. Bekkouche, M. Merzoug, M. Hadjila, and W. Ferhi, "Towards Early Breast Cancer Detection: A Deep Learning Approach," Engineering, Technology & Applied Science Research, vol. 14, no. 5, pp. 17517–17523, Oct. 2024. DOI: https://doi.org/10.48084/etasr.8634
R. Dwivedi, K. Ramdev, Er. M. Jain, and N. Agrawal, "Classification of Breast Cancer Diagnosis Using Machine Learning Algorithms," in 2023 International Conference on Computer Communication and Informatics (ICCCI), Coimbatore, India, Jan. 2023, pp. 1–8. DOI: https://doi.org/10.1109/ICCCI56745.2023.10128478
R. Rekha and K. L. Vinoci, "Wisconsin Breast Cancer Detection Using L1 Logistic Regression," in 2023 International Conference on Intelligent Systems for Communication, IoT and Security (ICISCoIS), Coimbatore, India, Oct. 2023, pp. 166–169. DOI: https://doi.org/10.1109/ICISCoIS56541.2023.10100387
A. Singh and K. S. Kaswan, "Breast Cancer Diagnosis using Soft Voting Classifier Approach," in 2024 International Conference on Emerging Innovations and Advanced Computing (INNOCOMP), Sonipat, India, Feb. 2024, pp. 292–297. DOI: https://doi.org/10.1109/INNOCOMP63224.2024.00055
K. G. Devi, K. Balasubramanian, and L. A. Ngoc, Eds., Machine Learning and Deep Learning Techniques for Medical Science. Boca Raton, FL, USA: CRC Press, 2022. DOI: https://doi.org/10.1201/9781003217497
N. N. Caleb, S. Zwalnan, and C. Pahalson, "Breast Cancer Diagnosis using Machine Learning Approach," International Journal of Advanced Research in Science, Communication and Technology, vol. 8, no. 1, pp. 459–466, Aug. 2021. DOI: https://doi.org/10.48175/IJARSCT-1880
Jamal, J. H. Antor, R. Kumar, and P. Rani, "Breast Cancer Prediction Using Machine Learning Classifiers," in 2022 5th International Conference on Advances in Science and Technology (ICAST), Mumbai, India, Sep. 2022, pp. 456–459. DOI: https://doi.org/10.1109/ICAST55766.2022.10039656
S. M. Shaaban, M. Nawaz, Y. Said, and M. Barr, "An Efficient Breast Cancer Segmentation System based on Deep Learning Techniques," Engineering, Technology & Applied Science Research, vol. 13, no. 6, pp. 12415–12422, Dec. 2023. DOI: https://doi.org/10.48084/etasr.6518
N. Behar and M. Shrivastava, "A Novel Model for Breast Cancer Detection and Classification," Engineering, Technology & Applied Science Research, vol. 12, no. 6, pp. 9496–9502, Dec. 2022. DOI: https://doi.org/10.48084/etasr.5115
W. Street, W. Wolberg, and O. Mangasarian, "Breast Cancer Wisconsin (Diagnostic)." UCI Machine Learning Repository, 1993.
Downloads
How to Cite
License
Copyright (c) 2025 Sreerama Murty Maturi, C. Dastagiraiah, Ponnuru Sowjanya, Chanumolu Kiran Kumar, J. S. V. R. S. Sastry

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.
