Video super‐resolution with non‐local alignment network | Zendy

Zhou Chao | Zendy; Chen Can | Zendy; Ding Fei | Zendy; Zhang Dengyin | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Video super‐resolution with non‐local alignment network

Author(s) -

Zhou Chao,

Chen Can,

Ding Fei,

Zhang Dengyin

Publication year - 2021

Publication title -

iet image processing

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.401

H-Index - 45

eISSN - 1751-9667

pISSN - 1751-9659

DOI - 10.1049/ipr2.12134

Subject(s) - computer science , fuse (electrical) , artificial intelligence , frame (networking) , computer vision , image resolution , exploit , reference frame , face (sociological concept) , superresolution , artificial neural network , pattern recognition (psychology) , image (mathematics) , telecommunications , social science , computer security , sociology , electrical engineering , engineering

Abstract Video super‐resolution (VSR) aims at recovering high‐resolution frames from their low‐resolution counterparts. Over the past few years, deep neural networks have dominated the video super‐resolution task because of its strong non‐linear representational ability. To exploit temporal correlations, most deep neural networks have to face two challenges: (1) how to align consecutive frames containing motions, occlusions and blurring, and establish accurate temporal correspondences, (2) how to effectively fuse aligned frames and balance their contributions. In this work, a novel video super‐resolution network, named NLVSR, is proposed to solve above problems in an efficient and effective manner. For alignment, a temporal‐spatial non‐local operation is employed to align each frame to the reference frame. Compared with existing alignment approaches, the proposed temporal‐spatial non‐local operation is able to integrate the global information of each frame by a weighted sum, leading to a better performance in alignment. For fusion, an attention‐based progressive fusion framework was designed to integrate aligned frames gradually. To penalize the points with low‐quality in aligned features, an attention mechanism was employed for a robust reconstruction. Experimental results demonstrate the superiority of the proposed network in terms of quantitative and qualitative evaluation, and surpasses other state‐of‐the‐art methods by 0.33 dB at least.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore