
Pedestrian Trajectory Prediction via Window Attention and Spatial Graph Interaction Network
Author(s) -
Xiang Gu,
Chao Li,
Jie Yang,
Jing Wang,
Qiwei Huang
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3573782
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
The accuracy of pedestrian trajectory prediction is crucial for the safety of autonomous driving systems. However, the task still faces challenges in modeling long-term dependencies, complex spatial interactions, and multi-scale feature fusion. To address these issues, this paper proposes the WAGIN(Windowed Attention Graph Interaction Network) model. First, in the temporal dimension, a window mask mechanism is designed to adjust the attention receptive field at each time step, effectively capturing temporal dependencies. In the spatial dimension, a hierarchical heterogeneous GCN(graph convolutional network) is constructed, combining pedestrian dynamic interaction graphs and scene semantic static graphs. Additionally, an interaction kernel function based on motion consistency is proposed to model the interactions between individual pedestrians. Finally, a multi-scale dilated convolution network is employed for future trajectory generation, capturing multi-scale spatiotemporal features through dilated convolutions to enhance prediction accuracy and robustness. The model is experimentally validated on the public ETH/UCY dataset, and the results demonstrate its effectiveness, achieving improvements of 23% in average displacement error (ADE) and 21% in final displacement error (FDE) over baseline methods. Moreover, qualitative analysis reveals the model’s excellent generalization ability in handling different scenarios.