An Improved Low-Bit-Rate Image Compression Framework Based on Semantic-aware Model and Neighborhood Attention | Zendy

Chengbin Zeng | Zendy; Liang Zhang | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

An Improved Low-Bit-Rate Image Compression Framework Based on Semantic-aware Model and Neighborhood Attention

Author(s) -

Chengbin Zeng,

Liang Zhang

Publication year - 2025

Publication title -

ieee access

Language(s) - English

Resource type - Magazines

SCImago Journal Rank - 0.587

H-Index - 127

eISSN - 2169-3536

DOI - 10.1109/access.2025.3596711

Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation

In recent years, image compression techniques based on deep neural networks have achieved significant advancements, outperforming traditional methods in delivering higher compression efficiency at lower bit rates. However, existing approaches often result in degraded image quality at lowbit rates, leading to distortions in critical regions such as faces and text during decoding. To address these limitations, we propose a robust image compression framework based on a semantic-aware model and a hyper-prior encoder with neighborhood attention. First, we utilize the encoder of the semantic-aware model to transform the input image into a latent space Z. To further improve the information representation within the latent space Z, we design a hyper-prior encoder, which leverages neighborhood attention to perform feature enhancement and transformation. This process can minimize quantization errors and facilitate efficient vector quantization. The refined latent space is then quantized using a residual vector quantization technique to ensure efficient and compact representation. Finally, entropy coding is applied to the quantized latent space, enabling high compression efficiency. Experimental results on public benchmark datasets show that our proposed framework achieves comparable results to current mainstream image compression methods. In addition, our proposed algorithm effectively mitigates distortions in facial and textual regions while preserving the structural integrity and visual fidelity. In future work, we plan to improve the model’s robustness under low-light conditions and enhance compression efficiency through lightweight optimization techniques, enabling broader real-world deployment.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research