Enhanced Convolutional Neural Network for Fashion Classification
Received: 15 June 2024 | Revised: 8 July 2024 | Accepted: 21 July 2024 | Online: 6 August 2024
Corresponding author: Omar M. Ahmed
Abstract
Fashion items are hard to classify since there are a million variations in style, texture, and pattern. Image classification is among the noted strengths of convolutional neural networks. This research introduces an improved CNN architecture for fashion classification, utilizing image augmentation and batch normalization to improve model performance and generalization. To make the model more robust, image augmentation techniques like rotation, width and height shift, zoom, and flips were employed. In addition, a Batch Normalization layer is added in the middle, which can help on stabilizing the learning process and accelerating convergence. The proposed model was trained on an augmented dataset, achieving a satisfactory improvement in test accuracy of 91.97% compared to a baseline CNN model, which obtained 88.5% accuracy. According to the results, the image augmentation with the application of Batch Normalization improves the CNN architecture for better effectiveness in fashion classification tasks.
Keywords:
fashion classification, convolutional neural networks, batch normalization, image augmentationDownloads
References
S. Shirkhani, H. Mokayed, R. Saini, and H. Y. Chai, "Study of AI-Driven Fashion Recommender Systems," SN Computer Science, vol. 4, no. 5, Jul. 2023, Art. no. 514.
L. Chen, S. Li, Q. Bai, J. Yang, S. Jiang, and Y. Miao, "Review of Image Classification Algorithms Based on Convolutional Neural Networks," Remote Sensing, vol. 13, no. 22, Jan. 2021, Art. no. 4712.
O. M. Ahmed, L. M. Haji, A. M. Ahmed, and N. M. Salih, "Bitcoin Price Prediction using the Hybrid Convolutional Recurrent Model Architecture," Engineering, Technology & Applied Science Research, vol. 13, no. 5, pp. 11735–11738, Oct. 2023.
M. Xu, S. Yoon, A. Fuentes, and D. S. Park, "A Comprehensive Survey of Image Augmentation Techniques for Deep Learning," Pattern Recognition, vol. 137, May 2023, Art. no. 109347.
A. Alzahrani, "Digital Image Forensics: An Improved DenseNet Architecture for Forged Image Detection," Engineering, Technology & Applied Science Research, vol. 14, no. 2, pp. 13671–13680, Apr. 2024.
J. Bjorck, C. Gomes, B. Selman, and K. Q. Weinberger, "Understanding Batch Normalization." arXiv, Nov. 30, 2018.
R. Ramesh and S. Sathiamoorthy, "A Deep Learning Grading Classification of Diabetic Retinopathy on Retinal Fundus Images with Bio-inspired Optimization," Engineering, Technology & Applied Science Research, vol. 13, no. 4, pp. 11248–11252, Aug. 2023.
Y. Chen, "QDCNN: Quantum Dilated Convolutional Neural Network." arXiv, Oct. 29, 2021.
Y. Ji, "A novel CNN+LSTM classification model based on fashion-MNIST," vol. 12258, Art. no. 122580S, Jul. 2022.
F. Lei, X. Liu, Q. Dai, and B. W.-K. Ling, "Shallow convolutional neural network for image classification," SN Applied Sciences, vol. 2, no. 1, Dec. 2019, Art. no. 97.
M. Kayed, A. Anter, and H. Mohamed, "Classification of Garments from Fashion MNIST Dataset Using CNN LeNet-5 Architecture," in 2020 International Conference on Innovative Trends in Communication and Computer Engineering (ITCE), Oct. 2020, pp. 238–243.
O. Nocentini, J. Kim, M. Z. Bashir, and F. Cavallo, "Image Classification Using Multiple Convolutional Neural Networks on the Fashion-MNIST Dataset," Sensors, vol. 22, no. 23, Jan. 2022, Art. no. 9544.
A. Vijayaraj et al., "Deep Learning Image Classification for Fashion Design," Wireless Communications and Mobile Computing, vol. 2022, no. 1, 2022, Art. no. 7549397.
B. Ruff, T. Beck, and J. Bach, "Mean Shift Rejection: Training Deep Neural Networks Without Minibatch Statistics or Normalization." arXiv, Nov. 29, 2019.
C. Shorten and T. M. Khoshgoftaar, "A survey on Image Data Augmentation for Deep Learning," Journal of Big Data, vol. 6, no. 1, Jul. 2019, Art. no. 60.
Downloads
How to Cite
License
Copyright (c) 2024 Lailan M. Haji, Omar M. Mustafa, Sherwan A. Abdullah, Omar M. Ahmed
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.