Premium
Predictive models for influence of primary delays using high‐speed train operation records
Author(s) -
Li Zhongcan,
Huang Ping,
Wen Chao,
Tang Yixiong,
Jiang Xi
Publication year - 2020
Publication title -
journal of forecasting
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.543
H-Index - 59
eISSN - 1099-131X
pISSN - 0277-6693
DOI - 10.1002/for.2685
Subject(s) - support vector machine , train , computer science , predictive value , predictive modelling , boosting (machine learning) , nat , data mining , machine learning , algorithm , artificial intelligence , geography , medicine , computer network , cartography
Primary delays are the driving force behind delay propagation, and predicting the number of affected trains (NAT) and the total time of affected trains (TTAT) due to primary delay (PD) can provide reliable decision support for real‐time train dispatching. In this paper, based on real operation data from 2015 to 2016 at several stations along the Wuhan–Guangzhou high‐speed railway, NAT and TTAT influencing factors were determined after analyzing the PD propagation mechanism. The eXtreme Gradient BOOSTing (XGBOOST) algorithm was used to establish a NAT predictive model, and several machine learning methods were compared. The importance of different delayinfluencing factors was investigated. Then, the TTAT predictive model (using support vector regression (SVR) algorithms) was established based on the NAT predictive model. Results indicated that the XGBOOST algorithm performed well with the NAT predictive model, and SVR was the optimal model for TTAT prediction under the verification index (i.e., the ratio of the difference between the actual and predicted value was less than 1/2/3/4/5 min). Real operational data in 2018 were used to test the applicability of the NAT and TTAT models over time, and findings suggest that these models exhibit sound applicability over time based on XGBOOST and SVR, respectively.