A Deep Learning Approach for Malware and Software Piracy Threat Detection
Received: 16 August 2021 | Revised: 8 September 2021 and 22 September 2021 | Accepted: 01 October 2021 | Online: 14 October 2021
Corresponding author: K. Aldriwish
Abstract
Internet of Things (IoT) -based systems need to be up to date on cybersecurity threats. The security of IoT networks is challenged by software piracy and malware attacks, and much important information can be stolen and used for cybercrimes. This paper attempts to improve IoT cybersecurity by proposing a combined model based on deep learning to detect malware and software piracy across the IoT network. The malware’s model is based on Deep Convolutional Neural Networks (DCNNs). Apart from this, TensorFlow Deep Neural Networks (TFDNNs) are introduced to detect software piracy threats according to source code plagiarism. The investigation is conducted on the Google Code Jam (GCJ) dataset. The conducted experiments prove that the classification performance achieves high accuracy of about 98%.
Keywords:
cybersecurity, malware, software piracy, deep learning, Internet of ThingsDownloads
References
J. Huang, J. Chai, and S. Cho, "Deep learning in finance and banking: A literature review and classification," Frontiers of Business Research in China, vol. 14, no. 1, Jun. 2020, Art. no. 13, https://doi.org/10.1186/s11782-020-00082-6.
M. B. Ayed, "Balanced Communication-Avoiding Support Vector Machine when Detecting Epilepsy based on EEG Signals," Engineering, Technology & Applied Science Research, vol. 10, no. 6, pp. 6462–6468, Dec. 2020, https://doi.org/10.48084/etasr.3878.
M. Ben Ayed, A. Massaoudi, and S. A. Alshaya, "Smart Recognition COVID-19 System to Predict Suspicious Persons Based on Face Features," Journal of Electrical Engineering & Technology, vol. 16, no. 3, pp. 1601–1606, May 2021, https://doi.org/10.1007/s42835-021-00671-2.
M. Ramzan, M. S. Farooq, A. Zamir, W. Akhtar, M. Ilyas, and H. U. Khan, “An Analysis of Issues for Adoption of Cloud Computing in Telecom Industries,” Engineering, Technology & Applied Science Research, vol. 8, no. 4, pp. 3157–3161, Aug. 2018, https://doi.org/10.48084/etasr.2101.
H. E. Fazazi, M. Elgarej, M. Qbadou, and K. Mansouri, “Design of an Adaptive e-Learning System based on Multi-Agent Approach and Reinforcement Learning,” Engineering, Technology & Applied Science Research, vol. 11, no. 1, pp. 6637–6644, Feb. 2021, https://doi.org/10.48084/etasr.3905.
S. S. T. Alatawi et al., “A New Model for Enhancing Student Portal Usage in Saudi Arabia Universities,” Engineering, Technology & Applied Science Research, vol. 11, no. 3, pp. 7158–7171, Jun. 2021, https://doi.org/10.48084/etasr.4132.
T. Brito, J. Queiroz, L. Piardi, L. A. Fernandes, J. Lima, and P. Leitão, "A Machine Learning Approach for Collaborative Robot Smart Manufacturing Inspection for Quality Control Systems," Procedia Manufacturing, vol. 51, pp. 11–18, Jan. 2020, https://doi.org/10.1016/j.promfg.2020.10.003.
F. Musumeci et al., "An Overview on Application of Machine Learning Techniques in Optical Networks," IEEE Communications Surveys Tutorials, vol. 21, no. 2, pp. 1383–1408, 2019, https://doi.org/10.1109/COMST.2018.2880039.
C. R. Srinivasan, B. Rajesh, P. Saikalyan, K. Premsagar, and E. S. Yadav, "A Review on the Different Types of Internet of Things (IoT)," Journal of Advanced Research in Dynamic and Control Systems, vol. Volume 11, no. 1, pp. 154–158, 2019.
Y. B. Zikria, R. Ali, M. K. Afzal, and S. W. Kim, "Next-Generation Internet of Things (IoT): Opportunities, Challenges, and Solutions," Sensors, vol. 21, no. 4, Jan. 2021, Art. no. 1174, https://doi.org/10.3390/s21041174.
E. B. Karbab, M. Debbabi, A. Derhab, and D. Mouheb, "Android Malware Detection using Deep Learning on API Method Sequences," arXiv:1712.08996 [cs], Dec. 2017, Accessed: Oct. 07, 2021. [Online]. Available: http://arxiv.org/abs/1712.08996.
F. Ullah, J. Wang, M. Farhan, M. Habib, and S. Khalid, "Software plagiarism detection in multiprogramming languages using machine learning approach," Concurrency and Computation: Practice and Experience, vol. 33, no. 4, 2021, Art. no. e5000, https://doi.org/10.1002/cpe.5000.
A. Orgah, A. Case, and G. Richard, "MemForC: Memory Forensics Corpus Creation for Malware Analysis," in Proceedings of the 16th International Conference on Cyber Warfare and Security, Jan. 2021.
V. Raja, "Introduction to Reverse Engineering," in Reverse Engineering: An Industrial Perspective, V. Raja and K. J. Fernandes, Eds. London, UK: Springer, 2008, pp. 1–9.
H. Lim, H. Park, S. Choi, and T. Han, "A method for detecting the theft of Java programs through analysis of the control flow information," Information and Software Technology, vol. 51, no. 9, pp. 1338–1350, Sep. 2009, https://doi.org/10.1016/j.infsof.2009.04.011.
J. Yasaswi, S. Kailash, A. Chilupuri, S. Purini, and C. V. Jawahar, "Unsupervised Learning Based Approach for Plagiarism Detection in Programming Assignments," in Proceedings of the 10th Innovations in Software Engineering Conference, Feb. 2017, pp. 117–121, https://doi.org/10.1145/3021460.3021473.
V. Kashyap, D. B. Brown, B. Liblit, D. Melski, and T. Reps, "Source Forager: A Search Engine for Similar Source Code," arXiv:1706.02769 [cs], Jun. 2017, Accessed: Oct. 07, 2021. [Online]. Available: http://arxiv.org/abs/1706.02769.
F. Zhang, D. Wu, P. Liu, and S. Zhu, "Program Logic Based Software Plagiarism Detection," in 2014 IEEE 25th International Symposium on Software Reliability Engineering, Naples, Italy, Nov. 2014, pp. 66–77, https://doi.org/10.1109/ISSRE.2014.18.
G. Cosma and M. Joy, "An Approach to Source-Code Plagiarism Detection and Investigation Using Latent Semantic Analysis," IEEE Transactions on Computers, vol. 61, no. 3, pp. 379–394, Mar. 2012, https://doi.org/10.1109/TC.2011.223.
J.-W. Son, T.-G. Noh, H.-J. Song, and S.-B. Park, "An application for plagiarized source code detection based on a parse tree kernel," Engineering Applications of Artificial Intelligence, vol. 26, no. 8, pp. 1911–1918, Sep. 2013, https://doi.org/10.1016/j.engappai.2013.06.007.
H. Cheers, Y. Lin, and S. P. Smith, "Academic Source Code Plagiarism Detection by Measuring Program Behavioral Similarity," IEEE Access, vol. 9, pp. 50391–50412, 2021, https://doi.org/10.1109/ACCESS.2021.3069367.
T. Foltýnek, R. Všianský, N. Meuschke, D. Dlabolová, and B. Gipp, "Cross-Language Source Code Plagiarism Detection using Explicit Semantic Analysis and Scored Greedy String Tilling," in Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020, New York, NY, USA, Aug. 2020, pp. 523–524, https://doi.org/10.1145/3383583.3398594.
M. Siddiqui and M. C. Wang, "Detecting Internet Worms Using Data Mining Techniques," Journal of Systemics, Cybernetics and Informatics, vol. 6, no. 6, pp. 48–53, 2008.
B. Kang, S. Y. Yerima, K. Mclaughlin, and S. Sezer, "N-opcode analysis for android malware classification and categorization," in 2016 International Conference On Cyber Security And Protection Of Digital Services (Cyber Security), London, UK, Jun. 2016, https://doi.org/10.1109/CyberSecPODS.2016.7502343.
A. Moser, C. Kruegel, and E. Kirda, "Limits of Static Analysis for Malware Detection," in Twenty-Third Annual Computer Security Applications Conference (ACSAC 2007), Miami Beach, FL, USA, Dec. 2007, pp. 421–430, https://doi.org/10.1109/ACSAC.2007.21.
M. Christodorescu, S. Jha, and C. Kruegel, "Mining specifications of malicious behavior," in Proceedings of the the 6th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering, New York, NY, USA, Sep. 2007, pp. 5–14, https://doi.org/10.1145/1287624.1287628.
U. Bayer, P. M. Comparetti, C. Hlauschek, C. Kruegel, and E. Kirda, "Scalable, Behavior-Based Malware Clustering," 2009.
I. Santos, J. Nieves, and P. G. Bringas, "Semi-supervised Learning for Unknown Malware Detection," in International Symposium on Distributed Computing and Artificial Intelligence, 2011, pp. 415–422, https://doi.org/10.1007/978-3-642-19934-9_53.
M. Kalash, M. Rochan, N. Mohammed, N. D. B. Bruce, Y. Wang, and F. Iqbal, "Malware Classification with Deep Convolutional Neural Networks," in 2018 9th IFIP International Conference on New Technologies, Mobility and Security (NTMS), Paris, France, Feb. 2018, https://doi.org/10.1109/NTMS.2018.8328749.
R. Kumar, Z. Xiaosong, R. U. Khan, I. Ahad, and J. Kumar, "Malicious Code Detection based on Image Processing Using Deep Learning," in Proceedings of the 2018 International Conference on Computing and Artificial Intelligence, New York, NY, USA, Mar. 2018, pp. 81–85, https://doi.org/10.1145/3194452.3194459.
Z. Cui, F. Xue, X. Cai, Y. Cao, G. Wang, and J. Chen, "Detection of Malicious Code Variants Based on Deep Learning," IEEE Transactions on Industrial Informatics, vol. 14, no. 7, pp. 3187–3196, Jul. 2018, https://doi.org/10.1109/TII.2018.2822680.
R. Oshana, M. A. Thornton, E. C. Larson, and X. Roumegue, "Real-Time Edge Processing Detection of Malicious Attacks Using Machine Learning and Processor Core Events," in 2021 IEEE International Systems Conference (SysCon), Vancouver, Canada, Apr. 2021, https://doi.org/10.1109/SysCon48628.2021.9447078.
J. H. Paik, "A novel TF-IDF weighting scheme for effective ranking," in Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval, New York, NY, USA, Jul. 2013, pp. 343–352, https://doi.org/10.1145/2484028.2484070.
T. Georgiou, Y. Liu, W. Chen, and M. Lew, "A survey of traditional and deep learning-based feature descriptors for high dimensional data in computer vision," International Journal of Multimedia Information Retrieval, vol. 9, no. 3, pp. 135–170, Sep. 2020, https://doi.org/10.1007/s13735-019-00183-w.
M. Ben Ayed, S. A. Alshaya, and A. Alshammari, "Enhanced heart rate estimation based on face features," in 2021 18th International Multi-Conference on Systems, Signals Devices (SSD), Monastir, Tunisia, Mar. 2021, pp. 840–844, https://doi.org/10.1109/SSD52085.2021.9429508.
A. Back and E. Westman, "Comparing programming languages in google code jam," Chalmers University of Technology, University of Gothenburg, Gothenburg, Sweden, 2017.
T. H.-D. Huang and H.-Y. Kao, "R2-D2: ColoR-inspired Convolutional NeuRal Network (CNN)-based AndroiD Malware Detections," in 2018 IEEE International Conference on Big Data (Big Data), Seattle, WA, USA, Dec. 2018, pp. 2633–2642, https://doi.org/10.1109/BigData.2018.8622324.
A. D. Moore, Intellectual Property and Information Control: Philosophic Foundations and Contemporary Issues. New Brunswick, NJ, USA: Routledge, 2004.
S. Elfwing, E. Uchibe, and K. Doya, "Sigmoid-weighted linear units for neural network function approximation in reinforcement learning," Neural Networks, vol. 107, pp. 3–11, Nov. 2018, https://doi.org/10.1016/j.neunet.2017.12.012.
Z. Cui, L. Du, P. Wang, X. Cai, and W. Zhang, "Malicious code detection based on CNNs and multi-objective algorithm," Journal of Parallel and Distributed Computing, vol. 129, pp. 50–58, Jul. 2019, https://doi.org/10.1016/j.jpdc.2019.03.010.
Downloads
How to Cite
License
Copyright (c) 2021 K. Aldriwish
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.