MSUD-YOLO: A Novel Multiscale Small Object Detection Model for UAV Aerial Images-Reference-Cited by-同舟云学术

MSUD-YOLO: A Novel Multiscale Small Object Detection Model for UAV Aerial Images

Published:2025-06-13 Issue:6 Volume:9 Page:429
ISSN:2504-446X
Container-title:Drones
language:en
Short-container-title:Drones

Author:

Zhao Xiaofeng¹^ORCID,Zhang Hui¹^ORCID,Zhang Wenwen¹,Ma Junyi¹,Li Chenxiao¹,Ding Yao¹^ORCID,Zhang Zhili¹

Affiliation:

1. The National Key Laboratory of Optical Engineering, the Rocket Force University of Engineering, Xi’an 710025, China

Abstract

Due to the objects in UAV aerial images often presenting characteristics of multiple scales, small objects, complex backgrounds, etc., the performance of object detection using current models is not satisfactory. To address the above issues, this paper designs a multiscale small object detection model for UAV aerial images, namely MSUD-YOLO, based on YOLOv10s. First, the model uses an attention scale sequence fusion mode to achieve more efficient multiscale feature fusion. Meanwhile, a tiny prediction head is incorporated to make the model focus on the low-level features, thus improving its ability to detect small objects. Secondly, a novel feature extraction module named CFormerCGLU has been designed, which improves feature extraction capability in a lighter way. In addition, the model uses lightweight convolution instead of standard convolution to reduce the model’s computation. Finally, the WIoU v3 loss function is used to make the model more focused on low-quality examples, thereby improving the model’s object localization ability. Experimental results on the VisDrone2019 dataset show that MSUD-YOLO improves mAP50 by 8.5% compared with YOLOv10s. Concurrently, the overall model reduces parameters by 6.3%, verifying the model’s effectiveness for object detection in UAV aerial images in complex environments. Furthermore, compared with multiple latest UAV object detection algorithms, our designed MSUD-YOLO offers higher detection accuracy and lower computational cost; e.g., mAP50 reaches 43.4%, but parameters are only 6.766 M.

Funder

National Natural Science Foundation of China

National Foundation for Enhancing Fundamental Sciences in China

Publisher

MDPI AG

Link

https://www.mdpi.com/2504-446X/9/6/429/pdf

Reference52 articles.

1. UAV recognition algorithm for ground military targets based on improved Yolov5n;Wang;Comput. Meas. Control,2024

2. Path planning for dual UAVs cooperative suspension transport based on artificial potential field-A* algorithm;Rao;Knowl.-Based Syst.,2023

3. PROSAIL-Net: A transfer learning-based dual stream neural network to estimate leaf chlorophyll and leaf angle of crops from UAV hyperspectral images;Bhadra;ISPRS J. Photogramm. Remote Sens.,2024

4. UAV-aided distribution line inspection using double-layer offloading mechanism;Duo;IET Gener. Transm Distrib.,2024

5. AANet: An ambiguity-aware network for remote-sensing image change detection;Hang;IEEE Trans. Geosci. Remote Sens.,2024