LCEA-YOLO: Improving Small Object Detection in Aerial Imagery Using Local Contrast Enhancement Attention and Inner-CIoU Loss
Corresponding author: Sara Ennaama
Abstract
This study proposed the Local Contrast Enhancement Attention - You Only Look Once (LCEA-YOLO) model to address the challenge of detecting small objects in high-altitude Unmanned Aerial Vehicle (UAV) images. LCEA-YOLO is a customized detector developed using the YOLOv10n architecture that embeds an LCEA module to emphasize local subtle contrast variations, as well as an Inner-CIoU loss function that enhances the regression ability for smaller objects. Experiments on the UAVDT dataset were conducted, with the results demonstrating that LCEA-YOLO exceeded other baseline algorithms with an overall mAP@0.5 of 47.2% and mAP@0.5:0.95 of 29.9%. Additionally, the model presented remarkable resilience on challenging under-represented categories, yielding improvements of 15.0% and 7.6% in detection accuracy for trucks and buses, respectively. These findings validated the benefit of targeted local contrast enhancement and scale-aware regression for real-time UAV detection.
Keywords:
object detection, Unmanned Aerial Vehicles (UAV), YOLOv10, attention mechanism, Inner-CIoU loss, small object detection, deep learningDownloads
References
P. Zhu, L. Wen, X. Bian, H. Ling, and Q. Hu, "Vision Meets Drones: A Challenge." arXiv, Apr. 23, 2018.
D. Du et al., "The unmanned aerial vehicle benchmark: Object detection and tracking," in 15th European Conference on Computer Vision (ECCV 2018), Sept. 2018, pp. 370–386.
J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You Only Look Once: Unified, Real-Time Object Detection," in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2016.
Ultralytics YOLOv8, G. Jocher, J. Qiu, and A. Chaurasia, 2023.
A. Wang et al., "YOLOv10: Real-Time End-to-End Object Detection," Advances in Neural Information Processing Systems, vol. 37, pp. 107984–108011, Dec. 2024.
M. Chaman et al., "A Real-Time Vehicle Detection System for ADAS in Autonomous Vehicles Using YOLOv11 Deep Neural Network on Embedded Edge Platforms," Engineering, Technology & Applied Science Research, vol. 15, no. 5, pp. 28077–28082, Oct. 2025.
Y. Zhao et al., "Detrs beat yolos on real-time object detection," in 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Sept. 2024.
L. Tan, C. Zhang, H. Bai, Z. Liu, and Y. Li, "A real-time and efficient detector for small object in UAV aerial images," Scientific Reports, vol. 15, no. 1, Nov. 2025, Art. no. 39233.
Z. Xu, H. Zhao, P. Liu, L. Wang, G. Zhang, and Y. Chai, "SRTSOD-YOLO: Stronger Real-Time Small Object Detection Algorithm Based on Improved YOLO11 for UAV Imageries," Remote Sensing, vol. 17, no. 20, Dec. 2025, Art. no. 3414.
J. Hu, L. Shen, and G. Sun, "Squeeze-and-Excitation Networks," in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 2018.
S. Woo, J. Park, J.-Y. Lee, and I. S. Kweon, "CBAM: Convolutional Block Attention Module," in 15th European Conference on Computer Vision (ECCV 2018), Sept. 2018.
Y. Dai, Y. Wu, F. Zhou, and K. Barnard, "Attentional Local Contrast Networks for Infrared Small Target Detection," IEEE Transactions on Geoscience and Remote Sensing, vol. 59, no. 11, pp. 9813–9824, Nov. 2021.
H. Rezatofighi, N. Tsoi, J. Gwak, A. Sadeghian, I. Reid, and S. Savarese, "Generalized Intersection Over Union: A Metric and a Loss for Bounding Box Regression," in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2019.
Z. Zheng, P. Wang, W. Liu, J. Li, R. Ye, and D. Ren, "Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression," Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, no. 07, pp. 12993–13000, Apr. 2020.
H. Zhang, C. Xu, and S. Zhang, "Inner-IoU: More Effective Intersection over Union Loss with Auxiliary Bounding Box." arXiv, Nov. 14, 2023.
Aerial Vehicles Computer Vision Dataset, Roboflow, https://universe.roboflow.com/uavdt/aerial-vehicles-hjarh.
I. Loshchilov and F. Hutter, "Decoupled Weight Decay Regularization." arXiv, Jan. 04, 2019.
T. Y. Lin et al., "Microsoft COCO: Common Objects in Context," in 13th European Conference of Computer Vision – ECCV, Sept. 2014.
Downloads
How to Cite
License
Copyright (c) 2026 Sara Ennaama, Hassan Silkan, Abderrahim Tahiri, Faouzia Ennaama

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.
