z-logo
open-access-imgOpen Access
SQ-GAN: Semantic Image Communications Using Masked Vector Quantization
Author(s) -
Francesco Pezone,
Sergio Barbarossa,
Giuseppe Caire
Publication year - 2025
Publication title -
ieee transactions on cognitive communications and networking
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 1.421
H-Index - 25
eISSN - 2332-7731
DOI - 10.1109/tccn.2025.3620819
Subject(s) - communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing
This work introduces Semantically Masked Vector Quantized Generative Adversarial Network (SQ-GAN), a novel approach integrating semantically driven image coding and vector quantization to optimize image compression for semantic/task-oriented communications. The method only acts on source coding and is fully compliant with legacy systems. The semantics is extracted from the image computing its semantic segmentation map using off-the-shelf software. A new specifically developed semantic-conditioned adaptive mask module (SAMM) selectively encodes semantically relevant features of the image. The relevance of the different semantic classes is task-specific, and it is incorporated in the training phase by introducing appropriate weights in the loss function. SQ-GAN outperforms state-of-the-art image compression schemes such as JPEG2000, BPG, and deep-learning based methods across multiple metrics, including perceptual quality and semantic segmentation accuracy on the reconstructed image, at extremely low compression rates. The code is available at https://github.com/frapez1/SQ-GAN .

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom