Multiscale Feature Learning Based on Enhanced Feature Pyramid for Vehicle Detection
Author(s) -
Hoanh Nguyen
Publication year - 2021
Publication title -
complexity
Language(s) - English
Resource type - Journals
eISSN - 1099-0526
pISSN - 1076-2787
DOI - 10.1155/2021/5555121
Subject(s) - computer science , pyramid (geometry) , artificial intelligence , feature (linguistics) , discriminative model , backbone network , pattern recognition (psychology) , pascal (unit) , pooling , feature extraction , object detection , benchmark (surveying) , exploit , feature learning , fuse (electrical) , computer vision , engineering , mathematics , computer network , philosophy , linguistics , geometry , computer security , geodesy , programming language , geography , electrical engineering
Vehicle detection is a crucial task in autonomous driving systems. Due to large variance of scales and heavy occlusion of vehicle in an image, this task is still a challenging problem. Recent vehicle detection methods typically exploit feature pyramid to detect vehicles at different scales. However, the drawbacks in the design prevent the multiscale features from being completely exploited. This paper introduces a feature pyramid architecture to address this problem. In the proposed architecture, an improving region proposal network is designed to generate intermediate feature maps which are then used to add more discriminative representations to feature maps generated by the backbone network, as well as improving the computational cost of the network. To generate more discriminative feature representations, this paper introduces multilayer enhancement module to reweight feature representations of feature maps generated by the backbone network to increase the discrimination of foreground objects and background regions in each feature map. In addition, an adaptive RoI pooling module is proposed to pool features from all pyramid levels for each proposal and fuse them for the detection network. Experimental results on the KITTI vehicle detection benchmark and the PASCAL VOC 2007 car dataset show that the proposed approach obtains better detection performance compared with recent methods on vehicle detection.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom