z-logo
open-access-imgOpen Access
Multi-Scale Context Enhanced Network for Monocular Depth Estimation
Author(s) -
Wenju Wang,
Yue Ning
Publication year - 2021
Publication title -
journal of physics. conference series
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.21
H-Index - 85
eISSN - 1742-6596
pISSN - 1742-6588
DOI - 10.1088/1742-6596/1848/1/012023
Subject(s) - monocular , computer science , context (archaeology) , dimension (graph theory) , scale (ratio) , artificial intelligence , residual , convolution (computer science) , ranging , block (permutation group theory) , field (mathematics) , computer vision , pattern recognition (psychology) , algorithm , artificial neural network , mathematics , geography , geometry , telecommunications , cartography , archaeology , pure mathematics
Monocular depth estimation is a classical computer vision task. At present, most CNN methods cannot effectively combine high-level and low-level features, leading to the loss of details and blurring of boundaries. To solve the problem, we propose a Multi-Scale Context Enhanced Network (MCEN) to learn more abundant context and expand its receptive field for high-accuracy estimation. Our method employs CRE-HRNet (Context and Receptive Enhanced High-Resolution Network) with four branches ranging from low-dimension to high-dimension features to obtain richer contextual information and extract multi-scale features. It then uses RM (Refinement Module) adopting the residual dilated convolution to retains detailed information and improve the receptive field. Finally, non-local block enables our network to capture the longdistance context through its special non-local operation. Experiments with the NYU Depth V2 dataset show its outstanding performance.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here