DenSE SwinHDR: SDRTV to HDRTV Conversion using Densely Connected Swin Transformer with Squeeze and Excitation Module | Zendy

Joon-ki Bae | Zendy; Subin Yang | Zendy; Sung-Ho Bae | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

DenSE SwinHDR: SDRTV to HDRTV Conversion using Densely Connected Swin Transformer with Squeeze and Excitation Module

Author(s) -

Joon-ki Bae,

Subin Yang,

Sung-Ho Bae

Publication year - 2022

Publication title -

ieee access

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.587

H-Index - 127

ISSN - 2169-3536

DOI - 10.1109/access.2022.3231339

Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation

Modern displays have the capability of rendering video contents encoded with high dynamic range (HDR) standards. HDR contents deliver more realistic visual experiences by wider color gamut and luminance compared to standard dynamic range (SDR) contents. However, most of the available contents are encoded with SDR standards. To provide HDR contents, the technology converts existing SDR contents to HDR contents, what we call SDRTV-to-HDRTV , is highly demanded by providers such as IPTV or broadcasting services. In this paper, we divide SDRTV-to-HDRTV conversion problem into global and local mapping problems. Transformers recently achieve significant performance and are known for conducting global mapping effectively. Convolution Neural Networks (CNN) are specialized for extracting and converting local features. In this regard, we introduce a combined model with a transformer for global mapping and CNN for local mapping, which solves the SDRTD-to-HDRTV problem in a complementary manner. We intensively explore the best combination strategy for transformers and CNNs. Through comprehensive objective/subjective experiments, we verified that the proposed method achieves the highest performance compared to the existing models in both fidelity and visual quality perspectives. To the best of our knowledge, we are the first to utilize Vision Transformer for the SDRTV-to-HDRTV conversion problem. To boost the performance, we combined Vision Transformer with architectural strategies which are previously applied on convolutional neural networks such as residual connection, dense connection, and squeeze-and-excitation module. We introduce a new Vision Transformer architecture denoted as DenSE-SwinHDR . Our method outperforms in terms of objective scores and visual quality compared to the state-of-the-art methods. Specifically, DenSE-SwinHDR achieved 0.79 dB PSNR, 0.93 dB PU-PSNR gain over HDRTVNet. Also, our proposed method achieve best performance on subjective quality assessment.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore