SQ-GAN: Semantic Image Communications Using Masked Vector Quantization
Author(s) -
Francesco Pezone,
Sergio Barbarossa,
Giuseppe Caire
Publication year - 2025
Publication title -
ieee transactions on cognitive communications and networking
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 1.421
H-Index - 25
eISSN - 2332-7731
DOI - 10.1109/tccn.2025.3620819
Subject(s) - communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing
This work introduces Semantically Masked Vector Quantized Generative Adversarial Network (SQ-GAN), a novel approach integrating semantically driven image coding and vector quantization to optimize image compression for semantic/task-oriented communications. The method only acts on source coding and is fully compliant with legacy systems. The semantics is extracted from the image computing its semantic segmentation map using off-the-shelf software. A new specifically developed semantic-conditioned adaptive mask module (SAMM) selectively encodes semantically relevant features of the image. The relevance of the different semantic classes is task-specific, and it is incorporated in the training phase by introducing appropriate weights in the loss function. SQ-GAN outperforms state-of-the-art image compression schemes such as JPEG2000, BPG, and deep-learning based methods across multiple metrics, including perceptual quality and semantic segmentation accuracy on the reconstructed image, at extremely low compression rates. The code is available at https://github.com/frapez1/SQ-GAN .
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom