Application of Synthetic Data on Object Detection Tasks
Received: 24 May 2024 | Revised: 5 June 2024 | Accepted: 12 June 2024 | Online: 19 June 2024
Corresponding author: Hong Hai Hoang
Abstract
Object detection is a computer vision task that identifies and locates one or more effective targets from image or video data. The accuracy of object detection heavily depends on the size and the diversity of the utilized dataset. However, preparing and labeling an adequate dataset to guarantee a high level of reliability can be time-consuming and labor-intensive, because the process of building data requires manually setting up the environment and capturing the dataset while keeping its variety in scenarios. There have been several efforts on object detection that take a long time to prepare the input data for training the models. To deal with this problem, synthetic data have emerged as a potential for the replacement of real-world data in data preparation for model training. In this paper, we provide a technique that can generate an enormous synthetic dataset with little human labor. Concretely, we have simulated the environment by applying the pyBullet library and capturing various types of input images. In order to examine its performance on the training model, we integrated a YOLOv5 object detection model to investigate the dataset. The output of the conducted model was deployed in a simulation robot system to examine its potential. YOLOv5 can reach a high accuracy of object detection at 93.1% mAP when solely training on our generated data. Our research provides a novelistic method to facilitate the understanding of data generation process in preparing datasets for deep learning models.
Keywords:
object detection, YOLO, synthetic dataDownloads
References
R. Szeliski, Computer Vision: Algorithms and Applications. London, UK ; New York, NY, USA: Springer, 2010.
A. Buetti-Dinh et al., "Deep neural networks outperform human expert’s capacity in characterizing bioleaching bacterial biofilm composition," Biotechnology Reports, vol. 22, Jun. 2019, Art. no. e00321.
C. Joshi, I. Kaloskampis, and L. Nolan, "Generative adversarial networks (GANs) for synthetic dataset generation with binary classes," Data Science Campus, Feb. 21, 2019. https://datasciencecampus.ons.gov.uk/projects/generative-adversarial-networks-gans-for-synthetic-dataset-generation-with-binary-classes/.
J. Young, P. Graham, and R. Penny, "Using Bayesian Networks to Create Synthetic Data," Journal of Official Statistics, vol. 25, no. 4, pp. 549–567, 2009.
H. Zhang, W. Gao, X. Chen, and D. Zhao, "Object detection using spatial histogram features," Image and Vision Computing, vol. 24, no. 4, pp. 327–341, Apr. 2006.
P. Piccinini, A. Prati, and R. Cucchiara, "Real-time object detection and localization with SIFT-based clustering," Image and Vision Computing, vol. 30, no. 8, pp. 573–587, Aug. 2012.
Y. Ren, C. Zhu, and S. Xiao, "Object Detection Based on Fast/Faster RCNN Employing Fully Convolutional Architectures," Mathematical Problems in Engineering, vol. 2018, no. 1, 2018, Art. no. 3598316.
Z. Zou, K. Chen, Z. Shi, Y. Guo, and J. Ye, "Object Detection in 20 Years: A Survey," Proceedings of the IEEE, vol. 111, no. 3, pp. 257–276, Jan. 2023.
D. Kalashnikov et al., "Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation," in Proceedings of The 2nd Conference on Robot Learning, Oct. 2018, pp. 651–673.
M. Kothapalli and P. Manohar, "The Challenges of Data Quality and Data Quality Assessment in the Big Data," Apr. 2023.
B. Calli et al., "Yale-CMU-Berkeley dataset for robotic manipulation research," The International Journal of Robotics Research, vol. 36, no. 3, pp. 261–268, Mar. 2017.
D. Thuan, "Evolution of Yolo algorithm and Yolov5: The State-of-the-Art object detention algorithm," B.Sc. thesis, Oulu University of Applied Sciences, 2021.
T. Saidani, "Deep Learning Approach: YOLOv5-based Custom Object Detection," Engineering, Technology & Applied Science Research, vol. 13, no. 6, pp. 12158–12163, Dec. 2023.
S. Sahel, M. Alsahafi, M. Alghamdi, and T. Alsubait, "Logo Detection Using Deep Learning with Pretrained CNN Models," Engineering, Technology & Applied Science Research, vol. 11, no. 1, pp. 6724–6729, Feb. 2021.
S. Chitta, I. Sucan, and S. Cousins, "Movelt![ROS topics]," IEEE Robotics & Automation Magazine, vol. 19, no. 1, pp. 18-19, 2012.
Downloads
How to Cite
License
Copyright (c) 2024 Huu Long Nguyen, Duc Toan Le, Hong Hai Hoang
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.