Deep Reinforcement Learning for Emergency Resource Allocation in Urban Rail Stations

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

As the lifeline of modern urban transportation, urban rail transit faces complex challenges caused by rapid urbanization and population growth. Especially for subway stations, due to the spatiotemporal complexity of their passenger flow, the operational problems have become increasingly complex, and the challenge lies in the effective scheduling of subway station resources. Currently, the resource scheduling mainly depends on the work experience of the station leaders. However, excessive reliance on experience can easily lead to resource waste and low decision-making efficiency, especially in emergency or unexpected situations that may affect the overall system’s operational quality and passenger safety. To address this issue, this study proposes a deep reinforcement learning scheduling decision model based on a single objective and multiple constraints. This model comprehensively considers factors such as employee skill matching, task urgency, and multi-site linkage. By collecting on-site survey data and resource scheduling plans from subway stations, this model aims to intelligently optimize station resource scheduling. We implemented the Proximal Policy Optimization (PPO) algorithm and constructed two feedforward neural networks as critics and actors. In a reward-based personnel scheduling environment, this model shows higher scores compared to manual experience allocation and traditional heuristic algorithms.

Original languageEnglish
Title of host publicationCICTP 2024
Subtitle of host publicationResilient, Intelligent, Connected, and Lowcarbon Multimodal Transportation - Proceedings of the 24th COTA International Conference of Transportation Professionals
EditorsJianming Ma, Qin Luo, Lijun Sun, Baicheng Li, Jingjing Chen, Guohui Zhang
PublisherAmerican Society of Civil Engineers (ASCE)
Pages1411-1422
Number of pages12
ISBN (Electronic)9780784485484
DOIs
Publication statusPublished - 2024
Event24th COTA International Conference of Transportation Professionals: Resilient, Intelligent, Connected, and Lowcarbon Multimodal Transportation, CICTP 2024 - Shenzhen, China
Duration: 23 Jul 202426 Jul 2024

Publication series

NameCICTP 2024: Resilient, Intelligent, Connected, and Lowcarbon Multimodal Transportation - Proceedings of the 24th COTA International Conference of Transportation Professionals

Conference

Conference24th COTA International Conference of Transportation Professionals: Resilient, Intelligent, Connected, and Lowcarbon Multimodal Transportation, CICTP 2024
Country/TerritoryChina
CityShenzhen
Period23/07/2426/07/24

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 11 - Sustainable Cities and Communities
    SDG 11 Sustainable Cities and Communities

Keywords

  • Deep reinforcement learning
  • Resource scheduling
  • Urban rail transit

Fingerprint

Dive into the research topics of 'Deep Reinforcement Learning for Emergency Resource Allocation in Urban Rail Stations'. Together they form a unique fingerprint.

Cite this