Squeeze-SwinFormer: Spectral Squeeze and Excitation Swin Transformer Network for Hyperspectral Image Classification | Zendy

Farhan Ullah | Zendy; Irfan Ullah | Zendy; Khalil Khan | Zendy; Salabat Khan | Zendy; Quan Wang | Zendy; Shabbab Ali Algamdi | Zendy; Haya Aldossary | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Squeeze-SwinFormer: Spectral Squeeze and Excitation Swin Transformer Network for Hyperspectral Image Classification

Author(s) -

Farhan Ullah,

Irfan Ullah,

Khalil Khan,

Salabat Khan,

Quan Wang,

Shabbab Ali Algamdi,

Haya Aldossary

Publication year - 2025

Publication title -

ieee journal of selected topics in applied earth observations and remote sensing

Language(s) - English

Resource type - Magazines

SCImago Journal Rank - 1.246

H-Index - 88

eISSN - 2151-1535

pISSN - 1939-1404

DOI - 10.1109/jstars.2025.3595434

Subject(s) - geoscience , signal processing and analysis , power, energy and industry applications

Hyperspectral Images (HSIs) are highly complex, containing a richer spectral dimension compared to conventional images. Deep learning methods are increasingly being applied to process this three-dimensional data for Hyperspectral Image Classification (HSIC). The Vision Transformer model is steadily gaining prominence in computer vision, emerging as a potential alternative to traditional convolutional neural network (CNN) architectures. Although transformers are powerful due to their self-attention mechanisms, they face challenges related to scalability and efficiency, particularly when processing high-resolution images. In order to solve these problems, in this paper, we propose a novel Squeeze-SwinFormer network for HSIC. This novel approach integrates Spectral Squeeze and Excitation (SE) blocks into the Swin Transformer architecture to enhance feature extraction and attention mechanisms in the model. The SE block dynamically recalibrates channel-wise feature responses, improving the model's focus on significant features. The Squeeze-SwinFormer, equipped with window-based multi-head self-attention, efficiently captures long-range dependencies with reduced computational complexity by partitioning input data into non-overlapping windows. Additionally, the SE block's integration within the Swin Transformer enables improved global contextual understanding by selectively weighting the feature maps. We further refine the architecture using various components, such as patch extraction and embedding layers, and a patch merging strategy, ensuring efficient multi-scale feature extraction. The proposed architecture demonstrates superior performance in tasks requiring attention to intricate details in HSIC. The comprehensive experimental results demonstrate that our proposed model achieved higher test accuracies against the state-of-the-art method, with results of 99.93%, 98.31%, 99.50%, and 97.75% on the SA, IP, PU, and KSC public benchmark HSIC datasets, respectively. Our approach also shows good generalization ability when applied to new datasets. Overall, our proposed approach represents a promising direction for HSIC.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research