
StackTrans–Multimodal Heart Disease Detection Using Stacked Transformer Fusion Framework
Author(s) -
Muhammad Adnan,
Yang Yi,
Enci Wang,
Md Nasir Imtiaz
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3574310
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
Cardiac disease diagnosis demands precise interpretation of complex physiological signals, yet existing systems often rely on unimodal data and lack adaptive fusion strategies. Most conventional frameworks fall short in capturing intermodal dependencies and adjusting to performance variations across heterogeneous inputs. This study introduces StackTrans, a transformer-based multimodal classification framework designed to improve diagnostic accuracy through ECG and PCG signal integration. The architecture comprises modality-specific transformer encoders, a bidirectional cross-modal fusion transformer that facilitates latent-level attention between modalities, and a stacked ensemble mechanism governed by a meta-learner. Residual learning modules enhance prediction refinement, while entropy-guided adaptive voting improves confidence-weighted decision reliability. The PCG and ECG modules are independently trained using the PhysioNet 2016 and MIT-BIH Arrhythmia datasets, respectively, and integrated through joint inference. Evaluations using TensorFlow on an NVIDIA RTX GPU demonstrate that StackTrans attains a precision of 98.6%, an F1 score of 98.4%, and an AUC of 0.99—outperforming unimodal ECG and PCG models by 2.5% and 7.5%, respectively.