z-logo
open-access-imgOpen Access
Improved UAV target detection model for RT-DETR
Author(s) -
Yong He,
Yufan Pang,
Guolin Ou,
Renfeng Xiao,
Yifan Tang
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3575189
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
In light of the shortcomings pertaining to UAV small target detection, the detection of complex scenes, and the detection of multi-scale targets, a time-frequency dual-domain feature extraction algorithm, TF-DETR, has been proposed. This algorithm has been optimized for RT-DETR.Firstly, a time-frequency domain feature extraction module, TF-CSPNet, has been introduced into the backbone network. This module facilitates the efficient extraction and fusion of multi-source features. Secondly, the Extreme Perceptive Linear Attention (EPLA) mechanism is designed and introduced to improve the AIFI module, which enhances the model’s attention to the key information by considering the positive and negative polarity interactions between the query and the key. Furthermore, the Focaler-MPDIoU loss function has been developed to address the challenge of suboptimal localization accuracy for hard-to-detect targets and diminutive targets. On the VisDrone2019 dataset, the mAP0.5 of the enhanced model demonstrates a 3.5% improvement, accompanied by a 6.1% and 2.9% reduction in parameters and computations, respectively. The efficacy of these enhancements is substantiated by the model’s superior performance in comparison to other target detection models at equivalent levels.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom