z-logo
open-access-imgOpen Access
Deep Layer and Spatial Aggregation neural network for human pose estimation
Author(s) -
Deyuan Zhang,
Chao Weng,
Haoguang Wang,
Xun Shi
Publication year - 2021
Publication title -
journal of physics. conference series
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.21
H-Index - 85
eISSN - 1742-6596
pISSN - 1742-6588
DOI - 10.1088/1742-6596/2010/1/012166
Subject(s) - fuse (electrical) , computer science , pooling , artificial intelligence , layer (electronics) , pyramid (geometry) , backbone network , object (grammar) , pattern recognition (psychology) , artificial neural network , feature extraction , pose , spatial analysis , feature (linguistics) , computer vision , data mining , geography , mathematics , telecommunications , engineering , remote sensing , linguistics , chemistry , philosophy , geometry , organic chemistry , electrical engineering
The simplebaseline model achieves high performance of human pose estimation with simple network structure. But the model lacks the layer and spatial information fusion. In this paper, we propose DLSAnet, which fuse layers and spatial information efficetively. DLSAnet uses DLA as backbone which has excellent feature extraction capabilities in the field of object detection. In addition, a modified spatial pyramid pooling is introduced to pool and connect multi-scale local area features, allowing the network to learn object features more comprehensively. Using a four-branch SPP module instead of a single-branch SPP module connected by a single hopping layer. This method is effective in alleviating the problem of slow loss drop late in training. Experiments show that DLSAnet can achieve better accuracy.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here