
An Improved Low-Bit-Rate Image Compression Framework Based on Semantic-aware Model and Neighborhood Attention
Author(s) -
Chengbin Zeng,
Liang Zhang
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3596711
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
In recent years, image compression techniques based on deep neural networks have achieved significant advancements, outperforming traditional methods in delivering higher compression efficiency at lower bit rates. However, existing approaches often result in degraded image quality at lowbit rates, leading to distortions in critical regions such as faces and text during decoding. To address these limitations, we propose a robust image compression framework based on a semantic-aware model and a hyper-prior encoder with neighborhood attention. First, we utilize the encoder of the semantic-aware model to transform the input image into a latent space Z. To further improve the information representation within the latent space Z, we design a hyper-prior encoder, which leverages neighborhood attention to perform feature enhancement and transformation. This process can minimize quantization errors and facilitate efficient vector quantization. The refined latent space is then quantized using a residual vector quantization technique to ensure efficient and compact representation. Finally, entropy coding is applied to the quantized latent space, enabling high compression efficiency. Experimental results on public benchmark datasets show that our proposed framework achieves comparable results to current mainstream image compression methods. In addition, our proposed algorithm effectively mitigates distortions in facial and textual regions while preserving the structural integrity and visual fidelity. In future work, we plan to improve the model’s robustness under low-light conditions and enhance compression efficiency through lightweight optimization techniques, enabling broader real-world deployment.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom