z-logo
open-access-imgOpen Access
Adaptive feedback connection with a single‐level feature for object detection
Author(s) -
Ruan Zhongling,
Cao Jianzhong,
Wang Hao,
Guo Huinan,
Yang Xin
Publication year - 2022
Publication title -
iet computer vision
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.38
H-Index - 37
eISSN - 1751-9640
pISSN - 1751-9632
DOI - 10.1049/cvi2.12121
Subject(s) - computer science , feature (linguistics) , object detection , artificial intelligence , pyramid (geometry) , encoder , pattern recognition (psychology) , benchmark (surveying) , convergence (economics) , detector , computer vision , convolutional neural network , feature extraction , mathematics , telecommunications , philosophy , linguistics , geometry , geodesy , economic growth , economics , geography , operating system
From the perspective of detector optimisation, detecting objects using only a one‐level feature cannot provide good performance for a wide range of scales. Various complex feature pyramidal structures address this problem using the divide‐and‐conquer strategy and multi‐scale feature fusion. However, this requires adding too many additional convolutional layers and fusion operations. To address the issue, a simple detection part is proposed, which includes three components, namely a one‐level feature map for detection, the encoder structure with feedback connection, and a decoupled head. The redesigned encoder and decoupled head can successfully address the performance decline caused by the one‐level feature‐based detection. Moreover, the proposed method can accelerate the convergence of the detector and achieve a faster inference time. Based on the optimised detection part, an adaptive feedback connection with a single‐level feature (AFS) is proposed for object detection. The experiments conducted on the MS COCO 2017 benchmark show that the proposed method can achieve comparable results with its multi‐scale pyramid counterpart, You Only Look Once v4 (YOLOv4). In addition, AFS can help the YOLOv4 achieve 44.9 mAP at 27 frame per second and converging 82 epochs earlier under the image size of 608×608, which represents a 42.1% improvements in the convergence speed.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here