
Dynamically Tunable Multidimensional Feature Focusing and Diffusion Networks for Water Surface Debris Detection
Author(s) -
Chong Zhang,
Jie Yue,
Jianglong Fu
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3597727
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
With the increasing level of industrialization, water surface debris detection has emerged as a significant research topic. However, challenges such as complex backgrounds, overlapping targets, and reflections on water surfaces often hinder effective waste detection, resulting in low model accuracy and unreliable recognition. To address these challenges, the Water Surface Debris DEtection TRansformer (WSD-DETR) was proposed. First, a Self-moving Point Convolutional Gating Network (SPCG-Net) was designed, which integrated an adaptive point-moving mechanism with a convolutional gating linear unit to enhance the flexibility and accuracy of feature extraction. Second, an Attention-based Re-parameterization of Intra-scale Feature Interactions (ARIFI) module was constructed to process high-level features extracted from the backbone. This module employed a single-scale transformer encoder with Re-parameterized Batch Normalization (RepBN) to improve focus on small and medium-sized targets in water surface waste detection, thereby capturing relationships between semantic concepts and conceptual entities. Furthermore, a Focal-Diffuse Feature Pyramid Network (FD-FPN) was introduced to accurately capture and integrate key feature information through focused feature fusion techniques while utilizing cross-scale diffusion analysis to efficiently transfer and enhance feature information across different scales. This approach significantly improved feature expression capability and overall model performance. Experimental results indicated that WSD-DETR achieved a precision of 88.1%, and reduced model parameters by 12.6% compared to the Real-Time DEtection TRansformer (RT-DETR), and increased mAP@50 and mAP@50:90 values by 4.4% and 5.5%, respectively. These outcomes demonstrated substantial potential for water surface debris detection.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom