z-logo
open-access-imgOpen Access
ESIFCD: A Dual-Temporal Cross-Domain Fusion Network for Remote Sensing Change Detection
Author(s) -
Russo Ashraf,
Adri Priadana,
Xuan-Thuy Vo,
Ge Cao,
Tien-Dat Tran,
Kang-Hyun Jo
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3612956
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
Remote sensing change detection (RSCD) must remain reliable under illumination, seasonal, and viewpoint variability while handling objects that range from narrow roads to block-scale complexes. We present Efficient Spectral Inception Former Change Detector ( ESIFCD ), an efficient dual-temporal network that couples domain-aligned differencing with multiple cross-resolution exchange and lightweight global context encoding. ESIFCD introduces: (i) an Adaptive Difference Fusion Unit (ADFU) that fuses absolute and learned differences over spatial, spectral, and texture features; (ii) a Cross-Resolution Aggregation Module (CRAM) that repeatedly exchanges evidence among four scales to mitigate scale disagreement; (iii) a Patch-Aware Context Encoder (PACE) that models long-range dependencies via tokenized patches with modest cost; and (iv) a fast GLCM-Stats Encoder that supplies texture semantics (ASM/Contrast/Correlation) without analytical histograms. A re-parameterized large-kernel block (RLKB) sharpens local detail, and a late MetaCNNGate learns pixel-wise weights over one primary and four auxiliary heads to consolidate their complementary expertise (small vs. large structures). Across six public benchmarks, ESIFCD attains the best F1 on each dataset: 93.19% (LEVIR-CD), 87.89% (LEVIR-CD+), 70.10% (S2Looking), 95.95% (CDD), 83.48% (SYSU-CD), and 95.30% (WHU-CD). The model is efficient and fast (12.33 GFLOPs; 82.07 images/s at 2562), and with automatic mixed precision reaches over 165 images/s while maintaining accuracy, surpassing recent strong baselines under the same setting. Ablations confirm that ADFU improves robustness to radiometric and registration noise, CRAM preserves thin structures and prevents fragmentation on large objects, PACE improves cluttered scenes, and MetaCNNGate consistently lifts F1 on all benchmarks.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom