Visual-information-driven model for crowd simulation using temporal convolutional network

Xuanwen Liang,Eric Wai Ming Lee
DOI: https://doi.org/10.1109/TITS.2024.3375528
2024-04-09
Abstract:Crowd simulations play a pivotal role in building design, influencing both user experience and public safety. While traditional knowledge-driven models have their merits, data-driven crowd simulation models promise to bring a new dimension of realism to these simulations. However, most of the existing data-driven models are designed for specific geometries, leading to poor adaptability and applicability. A promising strategy for enhancing the adaptability and realism of data-driven crowd simulation models is to incorporate visual information, including the scenario geometry and pedestrian locomotion. Consequently, this paper proposes a novel visual-information-driven (VID) crowd simulation model. The VID model predicts the pedestrian velocity at the next time step based on the prior social-visual information and motion data of an individual. A radar-geometry-locomotion method is established to extract the visual information of pedestrians. Moreover, a temporal convolutional network (TCN)-based deep learning model, named social-visual TCN, is developed for velocity prediction. The VID model is tested on three public pedestrian motion datasets with distinct geometries, i.e., corridor, corner, and T-junction. Both qualitative and quantitative metrics are employed to evaluate the VID model, and the results highlight the improved adaptability of the model across all three geometric scenarios. Overall, the proposed method demonstrates effectiveness in enhancing the adaptability of data-driven crowd models.
Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
The main problem this paper attempts to address is improving the adaptability and realism of data-driven pedestrian simulation models. Specifically: 1. **Problems with existing models**: - Current data-driven models are usually designed for specific geometric structures, resulting in poor generality and applicability in different scenarios. - Most existing data-driven models use pedestrian motion data from the test scenarios during training, which limits the practicality of the models in real-world applications. 2. **Proposed new method**: - The paper proposes a new Visual Information Driven (VID) pedestrian simulation model, which improves the adaptability and realism of the model by combining scene geometry and pedestrian visual information. - The model utilizes the Radar-Geometry-Motion (RGL) method to extract visual information and employs a deep learning model based on Temporal Convolutional Network (TCN), called Social-Visual TCN, for velocity prediction. 3. **Objectives**: - By introducing visual information, improve the adaptability and realism of data-driven models under different geometric structures. - Validate the effectiveness of the model on three public pedestrian motion datasets, which include different geometric structures such as corridors, corners, and T-junctions. 4. **Contributions**: - Emphasize the importance of visual information in improving the adaptability and realism of data-driven pedestrian simulation models. - Propose a VID model that includes a Data Processing module (DP), a Velocity Prediction module (VP), and a Rolling Forecast module (RF). - Experimental results show that the proposed VID model performs well under different geometric structures, both qualitatively and quantitatively. In summary, this paper aims to address the issue of insufficient adaptability of existing data-driven pedestrian simulation models under different geometric structures by introducing visual information, thereby improving the realism and practicality of the models.