z-logo
open-access-imgOpen Access
Bilateral attention network for semantic segmentation
Author(s) -
Wang Dongli,
Li Nanjun,
Zhou Yan,
Mu Jinzhen
Publication year - 2021
Publication title -
iet image processing
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.401
H-Index - 45
eISSN - 1751-9667
pISSN - 1751-9659
DOI - 10.1049/ipr2.12129
Subject(s) - computer science , segmentation , encoder , pascal (unit) , artificial intelligence , feature (linguistics) , channel (broadcasting) , pattern recognition (psychology) , focus (optics) , benchmark (surveying) , image segmentation , computer vision , linguistics , philosophy , physics , geodesy , optics , programming language , geography , operating system , computer network
Enhancing network feature representation capabilities and reducing the loss of image details have become the focus of semantic segmentation task. This work proposes the bilateral attention network for semantic segmentation. The authors embed two attention modules in the encoder and decoder structures . Specifically, high‐level features of the encoder structure integrate all channel maps through dense channel relationships learned by the channel correlation coefficient attention module. The positively correlated channels promote each other, and the negatively correlated channels suppress each other. In the decoder structure, low‐level features selectively emphasize the edge detail information in the feature map through the position attention module. The feature expression of semantic segmentation is improved by feature fusion of the two attention modules to obtain more accurate segmentation results . Finally, to verify the effectiveness of the model, the authors conduct experiments on the PASCAL VOC 2012 and Cityscapes scene analysis benchmark data sets and achieve a mean intersection‐over‐union of 74.92% and 66.63%, respectively.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here