z-logo
open-access-imgOpen Access
A Recent Survey of Vision Transformers for Medical Image Segmentation
Author(s) -
Asifullah Khan,
Zunaira Rauf,
Abdul Rehman Khan,
Saima Rathore,
Saddam Hussain Khan,
NajamusSaher Shah,
Umair Farooq,
Hifsa Asif,
Aqsa Asif,
Umme Zahoora,
Rafi Ullah Khalil,
Suleman Qamar,
Umme Hani Tayyab,
Faiza Babar Khan,
Abdul Majid,
Jeonghwan Gwak
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3618215
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
Medical image segmentation plays a crucial role in various healthcare applications, enabling accurate diagnosis, treatment planning, and disease monitoring. Convolutional Neural Networks (CNNs) demonstrated exceptional performance in this domain for their proficiency in learning complex patterns from raw data. In recent years, Vision Transformers (ViTs) have gained significant attention as an effective approach for various challenges in image analysis. However, they may lack image-related inductive bias and translational invariance that may affect their performance. To address this, Hybrid Vision Transformers (HVTs) have been introduced, combining CNNs and Transformer layers to effectively analyze features at both local and global scales. Building on the success of ViTs and HVTs, this paper reviews recent advancements in these architectures for medical image segmentation. We classify approaches based on architectural design and review state-of-the-art models for different imaging modalities, analyzing their limitations and potential solutions. Additionally, we highlight key challenges, discuss current trends and propose future research directions in the field. This review aims to provide valuable insights for researchers and professionals working on ViT-based medical image segmentation.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom