A Mobile Data-Driven Hierarchical Deep Reinforcement Learning Approach for Real-time Demand-Responsive Railway Rescheduling and Station Overcrowding Mitigation

Enze Liu,Zhiyuan Lin,Judith Y.T. Wang,Hong Chen
2023-11-07
Abstract:Real-time railway rescheduling is an important technique to enable operational recovery in response to unexpected and dynamic conditions in a timely and flexible manner. Current research relies mostly on OD based data and model-based methods for estimating train passenger demands. These approaches primarily focus on averaged disruption patterns, often overlooking the immediate uneven distribution of demand over time. In reality, passenger demand deviates significantly from predictions, especially during a disaster. Disastrous situations such as flood in Zhengzhou, China in 2022 has created not only unprecedented effect on Zhengzhou railway station itself, which is a major railway hub in China, but also other major hubs connected to Zhengzhou, e.g., Xi'an, the closest hub west of Zhengzhou. In this study, we define a real-time demand-responsive (RTDR) railway rescheduling problem focusing two specific aspects, namely, volatility of the demand, and management of station crowdedness. For the first time, we propose a data-driven approach using real-time mobile data (MD) to deal with this RTDR problem. A hierarchical deep reinforcement learning (HDRL) framework is designed to perform real-time rescheduling in a demand-responsive manner. The use of MD has enabled the modelling of passenger dynamics in response to train delays and station crowdedness, and a real-time optimisation for rescheduling of train services in view of the change in demand as a result of passengers' behavioural response to disruption. Results show that the agent can steadily satisfy over 62% of the demand with only 61% of the original rolling stock, ensuring continuous operations without overcrowding. Moreover, the agent exhibits adaptability when transferred to a new environment with increased demand, highlighting its effectiveness in addressing unforeseen disruptions in real-time settings.
Systems and Control,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: when the railway system encounters long - term and complex emergencies (such as natural disasters or man - made accidents), how to deal with the fluctuation of passenger demand and the overcrowding of stations through real - time demand - responsive train rescheduling. Specifically, the paper focuses on two aspects: 1. **Demand Fluctuation Management**: During emergencies, passenger demand may deviate significantly from historical data and model predictions. Especially in disaster situations, passenger behavior is affected by multiple factors. Therefore, how to accurately capture and respond to these real - time changing demands is a challenge. 2. **Station Overcrowding Alleviation**: When trains are delayed, the number of passengers in the station will gradually increase, leading to overcrowding, which in turn will cause a series of secondary risks, such as pedestrian panic, stampede accidents, and disease transmission. Therefore, how to effectively manage and alleviate the overcrowding in the station is also a key issue. To solve these problems, the author proposes a Mobile Data (MD) - driven Hierarchical Deep Reinforcement Learning (HDRL) framework. This framework can use real - time mobile data to model passenger dynamics and perform real - time optimal scheduling of train services according to passenger behavior responses. In this way, the HDRL framework can effectively adjust the train scheduling plan in the face of emergencies to meet the changing demands and avoid overcrowding in stations. ### Main Contributions - **Data - Driven Method**: For the first time, a real - time demand - responsive method based on mobile data is introduced, which solves the limitations of relying on static OD data and model predictions in traditional methods. - **Hierarchical Deep Reinforcement Learning Framework**: A HDRL framework is designed, which can make efficient real - time scheduling decisions in complex and changeable environments. - **Experimental Verification**: The effectiveness and adaptability of the proposed method are verified through actual cases (such as railway scheduling during the 2022 Zhengzhou flood), showing its advantages in dealing with unforeseen emergencies. ### Formula Representation Some key formulas involved in the paper can be represented as follows: - **Demand Response Function**: \[ D(t)=f(P_t, T_t, C_t) \] where \(D(t)\) represents the demand at time \(t\), \(P_t\) is the passenger behavior characteristic, \(T_t\) is the train state, and \(C_t\) is the station condition. - **Reward Function**: \[ R(s_t, a_t)=w_1\cdot U_{\text{demand}}+w_2\cdot U_{\text{capacity}}-w_3\cdot P_{\text{overcrowding}}-w_4\cdot P_{\text{delay}} \] where \(s_t\) is the current state, \(a_t\) is the action taken, \(w_i\) are weight coefficients, \(U_{\text{demand}}\) and \(U_{\text{capacity}}\) are demand satisfaction and capacity utilization respectively, and \(P_{\text{overcrowding}}\) and \(P_{\text{delay}}\) are overcrowding penalty and delay penalty respectively. Through these formulas, the HDRL framework can dynamically adjust the scheduling strategy according to real - time data, so as to better deal with complex situations in emergencies.