Advanced Gesture Recognition in Gaming: Implementing EfficientNetV2-B1 for "Rock, Paper, Scissors"

Chander Prabha; Retinderdeep  Singh; Meena Malik; Manas Ranjan Pradhan; Biswaranjan Acharya

doi:10.48084/etasr.10373

Authors

Chander Prabha Chitkara University Institute of Engineering and Technology, Chitkara University, Punjab, India https://orcid.org/0000-0002-2322-7289
Retinderdeep Singh Chitkara University Institute of Engineering and Technology, Chitkara University, Punjab, India https://orcid.org/0009-0003-9242-6454
Meena Malik Chandigarh University, Mohali, Punjab, India https://orcid.org/0000-0001-8654-7143
Manas Ranjan Pradhan School of Computing, Skyline University College, Sharjah, United Arab Emirates https://orcid.org/0000-0002-0115-2722
Biswaranjan Acharya Department of Computer Engineering – AI & BDA, Marwadi University, Rajkot, Gujarat, India https://orcid.org/0000-0002-6506-9207

Volume: 15 | Issue: 3 | Pages: 23386-23392 | April 2205 | https://doi.org/10.48084/etasr.10373

Received: 29 January 2025 | Revised: 27 February 2025 | Accepted: 6 March 2025 | Online: 6 May 2025

Corresponding author: Biswaranjan Acharya

Abstract

The study introduces a gesture recognition system for the classic "Rock, Paper, Scissors" game, based on a modified EfficientNetV2-B1 architecture. The dataset comprises 2,700 images, evenly divided among the three classes: "Rock", "Paper", and "Scissors". Leveraging the efficiency and accuracy of the EfficientNetV2-B1 model in image recognition tasks, the system was trained to classify these gestures effectively, and after fine-tuning, it achieved an accuracy of 98.89% and an Area Under the Curve (AUC) of ~1.0, indicating near-perfect classification across all classes. This performance highlights the potential of EfficientNetV2-B1 for real-time gesture recognition, with applications in interactive gaming and other gesture-based user interfaces. The proposed system also offers a foundation for further research and development in gesture recognition technologies.

Keywords:

rock-paper-scissors, deep learning, EfficientNetV2-B1, gesture recognition

Downloads

Download data is not yet available.

References

Rautaray, S. S., & Agrawal, A. (2011, December). Interaction with virtual game through hand gesture recognition. In 2011 International Conference on Multimedia, Signal Processing and Communication Technologies (pp. 244-247). IEEE.

Mohanty, A., Rambhatla, S. S., & Sahay, R. R. (2017). Deep gesture: static hand gesture recognition using CNN. In Proceedings of International Conference on Computer Vision and Image Processing: CVIP 2016, Volume 2 (pp. 449-461). Springer Singapore.

Alqethami, S., Almtanni, B., Alzhrani, W., & Alghamdi, M. (2022). Disease detection in apple leaves using image processing techniques. Engineering, Technology & Applied Science Research, 12(2), 8335-8341.

Kim, B., & Seo, S. (2023). EfficientNetV2-based dynamic gesture recognition using transformed scalogram from triaxial acceleration signal. Journal of Computational Design and Engineering, 10(4), 1694-1706.

J. Qi, L. Ma, Z. Cui, and Y. Yu, “Computer vision-based hand gesture recognition for human-robot interaction: a review,” Complex Intell. Syst., vol. 10, no. 1, pp. 1581–1606, 2024.

Alshammari, S. A., & Albalawi, N. S. (2024). Enhancing Healthcare Monitoring: A Deep Learning Approach to Human Activity Recognition using Wearable Sensors. Engineering, Technology & Applied Science Research, 14(6), 18843-18848..

A. Amrutesh, K. P. Asha Rani, A. Amruthamsh, S. Gowrishankar, and C. G. Gowtham Bhat, “Quantitative study on variation of glaucoma eye images using various EfficientNetV2 models,” in AI-Centric Modeling and Analytics, Boca Raton: CRC Press, 2023, pp. 173–197.

H. G. Doan and N. T. Nguyen, “Fusion machine learning strategies for multi-modal sensor-based hand gesture recognition,” Eng. Technol. Appl. Sci. Res., vol. 12, no. 3, pp. 8628–8633, 2022.

Z. Mohammadi, A. Akhavanpour, R. Rastgoo, and M. Sabokrou, “Diverse hand gesture recognition dataset,” Multimed. Tools Appl., vol. 83, no. 17, pp. 50245–50267, 2023.

A. S. M. Miah, M. A. M. Hasan, Y. Tomioka, and J. Shin, “Hand gesture recognition for multi-culture sign language using graph and general deep learning network,” IEEE Open J. Comput. Soc., pp. 1–12, 2024.

R. Bhumika and R. K. Hussain Laskar, “mIV3Net: modified inception V3 network for hand gesture recognition,” Multimedia Tools and Applications, vol. 83, pp. 10587–10613, 2024.

C. Griffin, L. Feng, and R. Wu, “Spatial dynamics of higher order rock-paper-scissors and generalisations,” arXiv [nlin.PS], 2023.

M. N. Ichsan, N. Armita, A. E. Minarno, F. D. S. Sumadi, and Hariyady, “Increased accuracy on image classification of game rock Paper Scissors using CNN,” J. RESTI (Rekayasa Sist. Dan Teknol. Inf.), vol. 6, no. 4, pp. 606–611, 2022.

A. Donciu-Julin, “Rock-Paper-Scissors-Dataset.” 21-Feb-2024.

H. Kırğıl, E. Nur, and Ç. B. Erdaş, “Enhancing Skin Disease Diagnosis Through Deep Learning: A Comprehensive Study on Dermoscopic Image Preprocessing and Classification,” International Journal of Imaging Systems and Technology, vol. 34, no. 4, 2024.

O. A. Abioye, A. E. Evwiekpaefe, and A. J. Awujoola, “Performance evaluation of efficientnetv2 models on the classification of histopathological benign breast cancer images,” Science Journal of University of Zakho, vol. 12, no. 2, pp. 208–214, 2024.

A. B. S. Salamh and H. I. Akyüz, “A novel feature extraction descriptor for face recognition,” Eng. Technol. Appl. Sci. Res., vol. 12, no. 1, pp. 8033–8038, 2022.

F. Ahmed, W. A. Khan, M. Iqbal, A. R. Ahmad Abazeed, H. Alrababah, and M. F. Khan, “Rock-paper-scissors image classification using transfer learning,” in 2023 International Conference on Business Analytics for Technology and Security (ICBATS), 2023.

D. Ye et al., “Towards playing full MOBA games with deep reinforcement learning,” arXiv [cs.AI], 2020.

Goyal, E. K., & Singh, A. (2014). Indian sign language recognition system for differently-able people. Journal on Today s Ideas-Tomorrow s Technologies, 2(2), 145–151. doi:10.15415/jotitt.2014.22011

Kumar, A., & Mantri, A. (2020). Gesture-based model of mixed reality human-computer interface. 2020 9th International Conference System Modeling and Advancement in Research Trends (SMART), 226–230. IEEE.

Bhalekar, M., & Bedekar, M. (2022). D-CNN: a new model for generating image captions with text extraction using deep learning for visually challenged individuals. Engineering, Technology & Applied Science Research, 12(2), 8366-8373.

Jiao, L., & Zhao, J. (2019). A survey on the new generation of deep learning in image processing. Ieee Access, 7, 172231-172263.

Tian, C., Xu, Y., Fei, L., & Yan, K. (2019). Deep learning for image denoising: A survey. In Genetic and Evolutionary Computing: Proceedings of the Twelfth International Conference on Genetic and Evolutionary Computing, December 14-17, Changzhou, Jiangsu, China 12 (pp. 563-572). Springer Singapore.

Koller, O., Zargaran, S., Ney, H., & Bowden, R. (2016, September). Deep Sign: Hybrid CNN-HMM for Continuous Sign Language Recognition. In BMVC (pp. 136-1).

Vol. 15 (2025)	Vol. 7 (2017)
Vol. 14 (2024)	Vol. 6 (2016)
Vol. 13 (2023)	Vol. 5 (2015)
Vol. 12 (2022)	Vol. 4 (2014)
Vol. 11 (2021)	Vol. 3 (2013)
Vol. 10 (2020)	Vol. 2 (2012)
Vol. 9 (2019)	Vol. 1 (2011)
Vol. 8 (2018)

Advanced Gesture Recognition in Gaming: Implementing EfficientNetV2-B1 for "Rock, Paper, Scissors"

Authors

Abstract

Keywords:

Downloads

References

Downloads

How to Cite

Metrics

License

Most read articles by the same author(s)

A Real-Time Analytic Face Thermal Recognition System Integrated with Email Notification

Α Solar-Integrated Wireless Charging System for Electric Vehicles