A Comparative Analysis of Machine Learning Techniques for URL Phishing Detection

Adel Ataih Albishri; Mohamed M. Dessouky

doi:10.48084/etasr.8920

Authors

Adel Ataih Albishri Department of Computer Science and Artificial Intelligence, College of Computer Science and Engineering, University of Jeddah, Saudi Arabia
Mohamed M. Dessouky Department of Computer Science and Artificial Intelligence, College of Computer Science and Engineering, University of Jeddah, Saudi Arabia | Department of Computer Science and Engineering, Faculty of Electronic Engineering, Menoufia University, Egypt

Volume: 14 | Issue: 6 | Pages: 18495-18501 | December 2024 | https://doi.org/10.48084/etasr.8920

Received: 5 September 2024 | Revised: 9 October 2024 and 14 October 2024 | Accepted: 16 October 2024 | Online: 29 October 2024

Corresponding author: Mohamed M. Dessouky

Abstract

The growing threat of URL phishing attacks raises the need for advanced detection systems to protect digital environments. This paper explores the effectiveness of various machine learning models in classifying URLs as phishing or benign, focusing on the random forest model. Using ensemble learning, the random forest demonstrated superior accuracy and reliability compared to traditional methods, achieving consistent performance with accuracy rates between 99.93% and 99.98%. The model's performance was evaluated daily over eight days, highlighting its robustness in handling real-world scenarios. This study utilized GridSearchCV to optimize model hyperparameters, enhancing model robustness and minimizing overfitting. Future research directions include advanced feature engineering, deep learning techniques, and multimodal data integration to further improve phishing detection systems.

Keywords:

phishing attacks, phishing detection, ensemble learning, random forest

Downloads

Download data is not yet available.

References

A. Butnaru, A. Mylonas, and N. Pitropakis, "Towards Lightweight URL-Based Phishing Detection," Future Internet, vol. 13, no. 6, Jun. 2021, Art. no. 154. DOI: https://doi.org/10.3390/fi13060154

S. Srivastava and S. K. Gupta, "Phishing Detection Techniques: A Comparative Study," in 2021 9th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO), Noida, India, Sep. 2021, pp. 1–6. DOI: https://doi.org/10.1109/ICRITO51393.2021.9596093

F. Tchakounte, V. S. Nyassi, D. E. H. Danga, K. P. Udagepola, and M. Atemkeng, "A Game Theoretical Model for Anticipating Email Spear-Phishing Strategies," EAI Endorsed Transactions on Scalable Information Systems, vol. 8, no. 30, 2021.

M. F. Alghenaim, N. A. A. Bakar, F. Abdul Rahim, V. Z. Vanduhe, and G. Alkawsi, "Phishing Attack Types and Mitigation: A Survey," in Data Science and Emerging Technologies, Khulna, Bangladesh, 2023, pp. 131–153. DOI: https://doi.org/10.1007/978-981-99-0741-0_10

C. Balim and E. S. Gunal, "Automatic Detection of Smishing Attacks by Machine Learning Methods," in 2019 1st International Informatics and Software Engineering Conference (UBMYK), Ankara, Turkey, Nov. 2019, pp. 1–3. DOI: https://doi.org/10.1109/UBMYK48245.2019.8965429

S. Bell and P. Komisarczuk, "An Analysis of Phishing Blacklists: Google Safe Browsing, OpenPhish, and PhishTank," in Proceedings of the Australasian Computer Science Week Multiconference, Melbourne, Australia, Feb. 2020, pp. 1–11. DOI: https://doi.org/10.1145/3373017.3373020

A. K. Singh, "Malicious and Benign Webpages Dataset," Data in Brief, vol. 32, Oct. 2020, Art. no. 106304. DOI: https://doi.org/10.1016/j.dib.2020.106304

S. Madakam, R. Ramaswamy, and S. Tripathi, "Internet of Things (IoT): A Literature Review," Journal of Computer and Communications, vol. 3, no. 5, pp. 164–173, May 2015. DOI: https://doi.org/10.4236/jcc.2015.35021

A. Hannousse, "Web page phishing detection." Mendeley, Jun. 25, 2021.

A Comparative Analysis of Machine Learning Techniques for URL Phishing Detection

Authors

Abstract

Keywords:

Downloads

References

Downloads

How to Cite

Metrics

License