Leveraging Convolutional Neural Network (CNN)-based Auto Encoders for Enhanced Anomaly Detection in High-Dimensional Datasets

M. Aetsam Javed; Madiha Anjum; Hassan A. Ahmed; Arshad Ali; H. M. Shahzad; Hamayun Khan; Abdulaziz M. Alshahrani

doi:10.48084/etasr.8619

Authors

M. Aetsam Javed Department of Computer Science, Faculty of Computer Science & IT, Superior University Lahore, Pakistan
Madiha Anjum School of Information Technology, Victoria University, Australia
Hassan A. Ahmed Information Systems, Cleveland State University, Ohio, USA
Arshad Ali Faculty of Computer and Information Systems, Islamic University of Madinah, Al Madinah Al Munawarah, Saudi Arabia
H. M. Shahzad Faculty of Computer Science and Information Technology, Superior University Lahore, Pakistan
Hamayun Khan Department of Computer Science, Faculty of Computer Science & IT, Superior University Lahore, Pakistan
Abdulaziz M. Alshahrani Faculty of Computer and Information Systems, Islamic University of Madinah, Al Madinah Al Munawarah, Saudi Arabia

Volume: 14 | Issue: 6 | Pages: 17894-17899 | December 2024 | https://doi.org/10.48084/etasr.8619

Received: 8 August 2024 | Revised: 29 August 2024 and 20 September 2024 | Accepted: 22 September 2024 | Online: 5 October 2024

Corresponding author: Hamayun Khan

Abstract

This study presents an Auto-Encoder Convolutional Neural Network (AECNNs) approach for anomaly detection in high-dimensional datasets. Unsupervised learning-based algorithms have a strong theoretical foundation and are widely used for anomaly detection in high-dimensional datasets, but some limitations significantly reduce their performance. This study proposes an algorithm to address these limitations. The proposed AECNN combines various convolutional layers, feature extraction, dimensionality reduction, and data preprocessing and was evaluated using accuracy, precision, recall, and F1-score. The performance of the proposed model was evaluated using a large real benchmark dataset. The proposed CNN-based autoencoder distinguished anomalies with an AUC score of 0.83 and remarkable accuracy, precision, recall, and F1 score.

Keywords:

autoencoders, anomaly detection, high-dimensional data, machine learning, data analysis, model evaluation, Convolutional Neural Networks (CNNs), NSL-KDD, UNSW-NB15, MSE

Downloads

Download data is not yet available.

References

M. I. H. Okfie and S. Mishra, "Anomaly Detection in IIoT Transactions using Machine Learning: A Lightweight Blockchain-based Approach," Engineering, Technology & Applied Science Research, vol. 14, no. 3, pp. 14645–14653, Jun. 2024.

P. More and P. Mishra, "Enhanced-PCA based Dimensionality Reduction and Feature Selection for Real-Time Network Threat Detection," Engineering, Technology & Applied Science Research, vol. 10, no. 5, pp. 6270–6275, Oct. 2020.

V. Chandola, A. Banerjee, and V. Kumar, "Anomaly detection: A survey," ACM Computing Surveys, vol. 41, no. 3, Apr. 2009, Art. no. 15.

V. Hodge and J. Austin, "A Survey of Outlier Detection Methodologies," Artificial Intelligence Review, vol. 22, no. 2, pp. 85–126, Oct. 2004.

M. H. Bhuyan, D. K. Bhattacharyya, and J. K. Kalita, "Network Anomaly Detection: Methods, Systems and Tools," IEEE Communications Surveys & Tutorials, vol. 16, no. 1, pp. 303–336, 2014.

C. Zhou and R. C. Paffenroth, "Anomaly Detection with Robust Deep Autoencoders," in Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, Canada, Aug. 2017, pp. 665–674.

C. Baur, B. Wiestler, S. Albarqouni, and N. Navab, "Deep Autoencoding Models for Unsupervised Anomaly Segmentation in Brain MR Images," in Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries, Granada, Spain, 2019, pp. 161–169.

M. Sakurada and T. Yairi, "Anomaly Detection Using Autoencoders with Nonlinear Dimensionality Reduction," in Proceedings of the MLSDA 2014 2nd Workshop on Machine Learning for Sensory Data Analysis, Gold Coast, Australia, Dec. 2014, pp. 4–11.

J. An and S. Cho, "Variational autoencoder based anomaly detection using reconstruction probability," SNU Data Mining Center, Special Lecture on IE, 2015.

G. E. Hinton and R. R. Salakhutdinov, "Reducing the Dimensionality of Data with Neural Networks," Science, vol. 313, no. 5786, pp. 504–507, Jul. 2006.

M. H. H. Khairi, S. H. S. Ariffin, N. M. A. Latiff, A. S. Abdullah, and M. K. Hassan, "A Review of Anomaly Detection Techniques and Distributed Denial of Service (DDoS) on Software Defined Network (SDN)," Engineering, Technology & Applied Science Research, vol. 8, no. 2, pp. 2724–2730, Apr. 2018.

J. Masci, U. Meier, D. Cireşan, and J. Schmidhuber, "Stacked Convolutional Auto-Encoders for Hierarchical Feature Extraction," in Artificial Neural Networks and Machine Learning – ICANN 2011, Espoo, Finland, 2011, pp. 52–59.

U. Khan, K. Khan, F. Hassan, A. Siddiqui, and M. Afaq, "Towards Achieving Machine Comprehension Using Deep Learning on Non-GPU Machines," Engineering, Technology & Applied Science Research, vol. 9, no. 4, pp. 4423–4427, Aug. 2019.

P. Wagner, N. Strodthoff, R.-D. Bousseljot, W. Samek, and T. Schaeffter, "PTB-XL, a large publicly available electrocardiography dataset." PhysioNet, https://doi.org/10.13026/X4TD-X982.

D. M. Powers, "Evaluation: From precision, recall and F-measure to ROC, informedness, markedness and correlation," Journal of Machine Learning Technologies, vol. 1, no. 1, pp. 37–63, 2011.

Vol. 15 (2025)	Vol. 7 (2017)
Vol. 14 (2024)	Vol. 6 (2016)
Vol. 13 (2023)	Vol. 5 (2015)
Vol. 12 (2022)	Vol. 4 (2014)
Vol. 11 (2021)	Vol. 3 (2013)
Vol. 10 (2020)	Vol. 2 (2012)
Vol. 9 (2019)	Vol. 1 (2011)
Vol. 8 (2018)

Leveraging Convolutional Neural Network (CNN)-based Auto Encoders for Enhanced Anomaly Detection in High-Dimensional Datasets

Authors

Abstract

Keywords:

Downloads

References

Downloads

How to Cite

Metrics

License

Most read articles by the same author(s)

An Efficient Optimization System for Early Breast Cancer Diagnosis based on Internet of Medical Things and Deep Learning

A Machine Learning Approach to Reduce Latency in Edge Computing for IoT Devices

An Enhanced Convolutional Neural Network (CNN) based P-EDR Mechanism for Diagnosis of Diabetic Retinopathy (DR) using Machine Learning

A Deep Learning-based Architecture for Diabetes Detection, Prediction, and Classification

A Quantum Encryption Algorithm based on the Rail Fence Mechanism to Provide Data Integrity

An Enhanced Random Forest (ERF)-based Machine Learning Framework for Resampling, Prediction, and Classification of Mobile Applications using Textual Features