Global Structural Knowledge Distillation for Semantic Segmentation | Zendy

Hyejin Park | Zendy; Keonhee Ahn | Zendy; Hyesong Choi | Zendy; Dongbo Min | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Global Structural Knowledge Distillation for Semantic Segmentation

Author(s) -

Hyejin Park,

Keonhee Ahn,

Hyesong Choi,

Dongbo Min

Publication year - 2025

Publication title -

ieee access

Language(s) - English

Resource type - Magazines

SCImago Journal Rank - 0.587

H-Index - 127

eISSN - 2169-3536

DOI - 10.1109/access.2025.3575066

Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation

Knowledge distillation (KD) has become a cornerstone for compressing deep neural networks, allowing a smaller student model to learn from a larger teacher model. In the context of semantic segmentation, traditional KD methods primarily focus on pixel-level feature alignment, where the student model is trained to match the teacher’s features at each pixel. Despite performance improvements, the pixel-level alignment can introduce noise and redundant information, particularly in complex scenes, and often overlook the global structural context that is crucial for robust segmentation. To overcome these limitations, we propose Global Structural Knowledge Distillation (GSKD), a novel approach that moves beyond dense pixel-level alignment. Instead of aligning features pixel-by-pixel, we focus on capturing and transferring global structural information within an image. Our method begins with Class-Balanced Sampling (CBS), which ensures that representative features from various classes are sampled evenly from the teacher’s feature maps. This helps the model better represent both common and rare classes, addressing class imbalance. Next, we construct a Global Structural Similarity Map (GSSM) for both the teacher and student models. This map encodes the key structural patterns of the image by calculating pairwise similarities between the sampled points, providing the structural information of the scene. To enhance the knowledge transfer process, we generate Sub-Image Descriptors (SID) through row-wise shuffling and column-wise grouping of the GSSM. These descriptors allow the student model to capture high-level semantic relationships and structural patterns, overcoming the limitations of traditional pixel-level feature alignment. The proposed method is designed to be flexible; It can be used both as a standalone method and as a plug-and-play module for integration with existing KD techniques. Our extensive experiments demonstrate that GSKD consistently outperforms or matches recent KD methods in standalone settings and significantly enhances the performance of state-of-the-art KD methods when incorporated as a plug-in-play module.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research