
ResFusion: deeply fused scene parsing network for RGB‐D images
Author(s) -
Dai Juting,
Tang Xinyi
Publication year - 2018
Publication title -
iet computer vision
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.38
H-Index - 37
eISSN - 1751-9640
pISSN - 1751-9632
DOI - 10.1049/iet-cvi.2018.5218
Subject(s) - computer science , artificial intelligence , rgb color model , parsing , segmentation , feature (linguistics) , convolutional neural network , pyramid (geometry) , pattern recognition (psychology) , computer vision , benchmark (surveying) , pooling , network architecture , mathematics , cartography , philosophy , linguistics , geometry , computer security , geography
Scene parsing is a very challenging work for complex and diverse scenes. In this study, the authors address the problem of semantic segmentation of indoor scenes for red, green, blue‐depth (RGB‐D) images. Most existing works use only the colour or photometric information for this problem. Here, they present an approach to fusing feature maps between colour network branch and depth network branch to integrate the photometric information and geometric information, which improves the semantic segmentation performance. They propose a novel convolutional neural network that uses ResNet as a baseline network. Their proposed network adopts a spatial pyramid pooling module to make full use of different sub‐region representations. Their proposed network also adopts multiple feature maps fusion modules to integrate texture and structure information between the colour branch and depth branch. Moreover, their proposed network has multiple auxiliary loss branches together with the main loss function to prevent the gradient of frontal layers disappear and accelerate the training phase of the fusion part. Comprehensive experimental evaluations show that their proposed network ‘ResFusion’ improves the performance greatly over the baseline network and has achieved competitive performance compared with other state‐of‐the‐art methods on the challenging SUN RGB‐D benchmark.