Pedestrian Trajectory Prediction via Window Attention and Spatial Graph Interaction Network | Zendy

Xiang Gu | Zendy; Chao Li | Zendy; Jie Yang | Zendy; Jing Wang | Zendy; Qiwei Huang | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Pedestrian Trajectory Prediction via Window Attention and Spatial Graph Interaction Network

Author(s) -

Xiang Gu,

Chao Li,

Jie Yang,

Jing Wang,

Qiwei Huang

Publication year - 2025

Publication title -

ieee access

Language(s) - English

Resource type - Magazines

SCImago Journal Rank - 0.587

H-Index - 127

eISSN - 2169-3536

DOI - 10.1109/access.2025.3573782

Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation

The accuracy of pedestrian trajectory prediction is crucial for the safety of autonomous driving systems. However, the task still faces challenges in modeling long-term dependencies, complex spatial interactions, and multi-scale feature fusion. To address these issues, this paper proposes the WAGIN(Windowed Attention Graph Interaction Network) model. First, in the temporal dimension, a window mask mechanism is designed to adjust the attention receptive field at each time step, effectively capturing temporal dependencies. In the spatial dimension, a hierarchical heterogeneous GCN(graph convolutional network) is constructed, combining pedestrian dynamic interaction graphs and scene semantic static graphs. Additionally, an interaction kernel function based on motion consistency is proposed to model the interactions between individual pedestrians. Finally, a multi-scale dilated convolution network is employed for future trajectory generation, capturing multi-scale spatiotemporal features through dilated convolutions to enhance prediction accuracy and robustness. The model is experimentally validated on the public ETH/UCY dataset, and the results demonstrate its effectiveness, achieving improvements of 23% in average displacement error (ADE) and 21% in final displacement error (FDE) over baseline methods. Moreover, qualitative analysis reveals the model’s excellent generalization ability in handling different scenarios.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore