Efficient Region-Wise Packing of Stereoscopic ERP Videos Based on Information Loss Minimization | Zendy

Hossein Pejman | Zendy; Stephane Coulombe | Zendy; Carlos Vazquez | Zendy; Ahmad Vakili | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Efficient Region-Wise Packing of Stereoscopic ERP Videos Based on Information Loss Minimization

Author(s) -

Hossein Pejman,

Stephane Coulombe,

Carlos Vazquez,

Ahmad Vakili

Publication year - 2025

Publication title -

ieee access

Language(s) - English

Resource type - Magazines

SCImago Journal Rank - 0.587

H-Index - 127

eISSN - 2169-3536

DOI - 10.1109/access.2025.3588095

Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation

Utilizing frame-compatible (FC) formats for packing stereoscopic videos often comes with challenges, as they require higher transmission bandwidth and larger memory buffers on the decoder compared to single-view videos. When it comes to stereoscopic 360° videos, as the primary content consumed by virtual reality (VR) applications, these requirements become even more challenging since they ask for ultra-high-resolution formats with high frame rates (e.g., 6K, 8K, or 12K at 100 frames per second). To address these challenges, sub-sampled versions of the left and right views are usually used to form the spatial FC format, leading to a loss of visual quality. In this paper, we propose an efficient region-wise packing method for equirectangular projection (ERP) videos with minimum information loss by exploiting the uneven sampling characteristic of ERP. Moreover, we propose a content-adaptive (CA) packing method for ERP videos, where the sizes of partitions, each with a particular horizontal downsampling factor, are adaptively determined based on spatial complexity. We then utilize a low-complexity frequency-domain approach to estimate the optimal partition sizes of the CA packing. We use these proposed methods to determine optimal packing of the stereoscopic ERP videos in the FC format. Experimental results, using the VVenC Versatile Video Coding (VVC) encoder, show that compared with the standard side-by-side (SbS) format, with uniform horizontal half-downsampling (UHHDS), the proposed CA packing method provides an average 13.84% and 12.02% Bjøntegaard-Delta bitrate (BD-BR) reduction for Random Access (RA) and Low Delay B (LDB) configurations, respectively, with an average encoding time comparable to SbS. In addition, when the performance is measured based on user attention probability, using the Laplacian Distribution model, the coding performance of our proposed packing methods outperforms the state-of-the-art packing method with significantly lower computational complexity.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research