z-logo
Premium
An adaptive stacked hourglass network with Kalman filter for estimating 2D human pose in video
Author(s) -
Hu Tao,
Xiao Chunxia,
Min Geyong,
Najjari Noushin
Publication year - 2021
Publication title -
expert systems
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.365
H-Index - 38
eISSN - 1468-0394
pISSN - 0266-4720
DOI - 10.1111/exsy.12552
Subject(s) - computer science , artificial intelligence , computer vision , pose , hourglass , kalman filter , frame (networking) , telecommunications , archaeology , history
One of the main challenges in computer science and image processing is 2D human pose estimation. Specifically, occlusion and in particular occlusion of human joints caused by camera angle are of paramount importance. In this paper, a new highly accurate network was proposed that can estimate 2D human poses in video images using deep learning. We employ the Single Shot MultiBox Detector network to detect the centre position of each human within a video frame and then use the stacked hourglass network to estimate the 2D human pose. We approximate the human motion as a linear motion between different frames in a certain period; and optimize the human centres based on the local outlier factor and Kalman filters. The same method is applied to optimize the human pose estimations in video, which can address the inaccurate prediction caused by human joints occlusion. The proposed adaptive network is tested using the two well‐known benchmarks for human pose estimation (MPII and Joint‐annotated Human Motion Data Base datasets), and we also generate some 2D human pose estimating qualitative results of single and multiple people in Internet videos. The experimental results show that the proposed network has strong practicability and can achieve high accuracy on adaptive estimating the 2D human pose in video.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here