Intelligent Train Timetable Generation Technology Based on Monte Carlo Tree Search Algorithm
Downloads
This paper presents an innovative approach to train timetable generation using Monte Carlo tree search (MCTS) integrated with a deep reinforcement learning technique. The generation and adjustment of train timetables for high-speed railways represent a complex optimisation problem with numerous rule-based constraints that traditional mathematical methods struggle to solve efficiently. Therefore, the train timetable generation problem is modelled as a discrete spatiotemporal Markov decision process, and a comprehensive MCTS-based algorithm is developed to effectively balance exploration and exploitation through a structured tree search mechanism. The result of the comparative analysis demonstrates that MCTS-based algorithms significantly outperform state-of-the-art reinforcement learning algorithms, including double deep Q-network (DDQN) and proximal policy optimisation (PPO), achieving optimal solutions 6.5 times faster with superior training stability. To validate the scalability and real-world applicability, a large-scale case study involving 120 pairs of trains on the Beijing-Shanghai High-Speed Rail corridor over an 18-hour period successfully resolved all 45,600 initial conflicts. The optimised timetables yield significant operational improvements, including a 16.4% reduction in average delay time, 22.8% improvement in track utilisation efficiency and 9.7% reduction in energy consumption. This research contributes to the advancement of intelligent railway operations optimisation and demonstrates the potential of MCTS-based approaches to transform complex transportation problems.
Downloads
Xu G, et al. Optimize train capacity allocation for the high-speed railway mixed transportation of passenger and freight. Computers & Industrial Engineering. 2022;174:108788. DOI: 10.1016/j.cie.2022.108788.
Zhang C, et al. Integrated optimization of train scheduling and maintenance planning on high-speed railway corridors. Omega. 2019;87:86-104. DOI: 10.1016/j.omega.2018.08.005.
Zhang C, et al. Joint optimization of train scheduling and maintenance planning in a railway network: A heuristic algorithm using Lagrangian relaxation. Transportation Research Part B: Methodological. 2020;134:64-92. DOI: 10.1016/j.trb.2020.02.008.
Meng L, Zhou X. An integrated train service plan optimization model with variable demand: A team-based scheduling approach with dual cost information in a layered network. Transportation Research Part B: Methodological. 2019;125:1-28. DOI: 10.1016/j.trb.2019.02.017.
Cacchiani V, Qi J, Yang L. Robust optimization models for integrated train stop planning and timetabling with passenger demand uncertainty. Transportation Research Part B: Methodological. 2020;136:1-29. DOI: 10.1016/j.trb.2020.03.009.
Sparing D, Goverde RM. A cycle time optimization model for generating stable periodic railway timetables. Transportation Research Part B: Methodological. 2017;98:198-223. DOI: 10.1016/j.trb.2016.12.020.
Yuan F, Sun H, Kang L, Zhang S. Joint optimization of train scheduling and dynamic passenger flow control strategy with headway-dependent demand. Transportmetrica B: Transport Dynamics. 2022;10(1):627-51. DOI: 10.1080/21680566.2022.2025951.
Peeters LW. Cyclic railway timetable optimization. PhD thesis. Erasmus University Rotterdam; 2003. https://repub.eur.nl/pub/429
Li W, Ni S. Train timetabling with the general learning environment and multi-agent deep reinforcement learning. Transportation Research Part B: Methodological. 2022;157:230-51. DOI: 10.1016/j.trb.2022.02.006.
Yang F, et al. Single-track railway scheduling with a novel gridworld model and scalable deep reinforcement learning. Transportation Research Part C: Emerging Technologies. 2023;154:104237. DOI: 10.1016/j.trc.2023.104237.
Wei Z, et al. Editors. Reinforcement learning to rank with Markov decision process. Proceedings of the 40th international ACM SIGIR conference on research and development in information retrieval; 2017. DOI: 10.1145/3077136.3080685.
Wang X, Yang Z, Chen G, Liu Y. A reinforcement learning method of solving Markov decision processes: an adaptive exploration model based on temporal difference error. Electronics. 2023;12(19):4176. DOI: 10.3390/electronics12194176.
Conserva M, Rauber P. Hardness in Markov decision processes: Theory and practice. Advances in Neural Information Processing Systems. 2022;35:14824-38. DOI: 10.48550/arXiv.2210.13075.
Li D, Li T, Li J, Li Y. Integrated optimization of flexible train timetable, stop plan and exogenous passenger assignment under demand responsive condition. Journal of Intelligent Transportation Systems. 2024:1-28. DOI: 10.1080/15472450.2024.2438818.
Zhang X, Nie L, Wu X, Ke Y. How to optimize train stops under diverse passenger demand: a new line planning method for large-scale high-speed rail networks. Networks and Spatial Economics. 2020;20(4):963-88. DOI: 10.1007/s11067-020-09506-5.
Hassannayebi E, Sajedinejad A, Mardani S. Disruption management in urban rail transit system: a simulation based optimization approach. Handbook of research on emerging innovations in rail transportation engineering: IGI Global; 2016. p. 420-50. DOI: 10.4018/978-1-5225-0084-1.ch018.
Amorosi L, Dell’Olmo P, Giacco GL. Mathematical models for on-line train calendars generation. Computers & Operations Research. 2019;102:1-9. DOI: 10.1016/j.cor.2018.09.009.
Arenas D, Chevrier R, Hanafi S, Rodriguez J. Solving the train timetabling problem, a mathematical model and a genetic algorithm solution approach. HAL. 2015;2015. https://hal.science/hal-01338609v1
Cacchiani V, Furini F, Kidd MP. Approaches to a real-world train timetabling problem in a railway node. Omega. 2016;58:97-110. DOI: 10.1016/j.omega.2015.04.006.
Cacchiani V, Caprara A, Toth P. A column generation approach to train timetabling on a corridor. 4or. 2008;6(2):125-42. DOI: 10.1007/s10288-007-0037-5.
Chen KH, Du D, Zhang P. Monte-Carlo tree search and computer Go. Advances in Information and Intelligent Systems: Springer; 2009. p. 201-25. DOI: 10.1007/978-3-642-04141-9_10.
Yan XH, Cai BG, Ning B, ShangGuan W. Moving horizon optimization of dynamic trajectory planning for high-speed train operation. IEEE Transactions on Intelligent Transportation Systems. 2015;17(5):1258-70. DOI: 10.1109/TITS.2015.2499254
Fang Y, Zhang Y. China’s High-Speed Rail Technology: An International Perspective. Springer. 2018. DOI: 10.1007/978-981-10-5610-9.
Mahapatra R, Bhattacharyya S, Nag A. Intelligent communication networks: Research and applications. CRC Press; 2024. DOI: 10.1201/9781003303114.
Zhang X, Nie L. Integrating capacity analysis with high-speed railway timetabling: A minimum cycle time calculation model with flexible overtaking constraints and intelligent enumeration. Transportation Research Part C: Emerging Technologies. 2016;68:509-31. DOI: 10.1016/j.trc.2016.05.005.
Copyright (c) 2026 Junyuan HE, Zhangdui ZHONG, Bo LI, Jiaming FAN, Jianwen DING, Wei CHEN

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.













