Abstract:Crowd simulations play a pivotal role in building design, influencing both user experience and public safety. While traditional knowledge-driven models have their merits, data-driven crowd simulation models promise to bring a new dimension of realism to these simulations. However, most of the existing data-driven models are designed for specific geometries, leading to poor adaptability and applicability. A promising strategy for enhancing the adaptability and realism of data-driven crowd simulation models is to incorporate visual information, including the scenario geometry and pedestrian locomotion. Consequently, this paper proposes a novel visual-information-driven (VID) crowd simulation model. The VID model predicts the pedestrian velocity at the next time step based on the prior social-visual information and motion data of an individual. A radar-geometry-locomotion method is established to extract the visual information of pedestrians. Moreover, a temporal convolutional network (TCN)-based deep learning model, named social-visual TCN, is developed for velocity prediction. The VID model is tested on three public pedestrian motion datasets with distinct geometries, i.e., corridor, corner, and T-junction. Both qualitative and quantitative metrics are employed to evaluate the VID model, and the results highlight the improved adaptability of the model across all three geometric scenarios. Overall, the proposed method demonstrates effectiveness in enhancing the adaptability of data-driven crowd models.

What problem does this paper attempt to address?

The main problem this paper attempts to address is improving the adaptability and realism of data-driven pedestrian simulation models. Specifically: 1. **Problems with existing models**: - Current data-driven models are usually designed for specific geometric structures, resulting in poor generality and applicability in different scenarios. - Most existing data-driven models use pedestrian motion data from the test scenarios during training, which limits the practicality of the models in real-world applications. 2. **Proposed new method**: - The paper proposes a new Visual Information Driven (VID) pedestrian simulation model, which improves the adaptability and realism of the model by combining scene geometry and pedestrian visual information. - The model utilizes the Radar-Geometry-Motion (RGL) method to extract visual information and employs a deep learning model based on Temporal Convolutional Network (TCN), called Social-Visual TCN, for velocity prediction. 3. **Objectives**: - By introducing visual information, improve the adaptability and realism of data-driven models under different geometric structures. - Validate the effectiveness of the model on three public pedestrian motion datasets, which include different geometric structures such as corridors, corners, and T-junctions. 4. **Contributions**: - Emphasize the importance of visual information in improving the adaptability and realism of data-driven pedestrian simulation models. - Propose a VID model that includes a Data Processing module (DP), a Velocity Prediction module (VP), and a Rolling Forecast module (RF). - Experimental results show that the proposed VID model performs well under different geometric structures, both qualitatively and quantitatively. In summary, this paper aims to address the issue of insufficient adaptability of existing data-driven pedestrian simulation models under different geometric structures by introducing visual information, thereby improving the realism and practicality of the models.

Visual-information-driven model for crowd simulation using temporal convolutional network

Visually-Guided Pedestrian Crowd Simulation

A Radar-Nearest-Neighbor Based Data-Driven Approach for Crowd Simulation

A Data-driven Crowd Simulation Framework Integrating Physics-informed Machine Learning with Navigation Potential Fields

Visual information based social force model for crowd evacuation

Modeling social interaction and intention for pedestrian trajectory prediction

SocialCVAE: Predicting Pedestrian Trajectory via Interaction Conditioned Latents

Pedestrian Flow Prediction in Open Public Places Using Graph Convolutional Network

Pedestrian Volume Prediction Using a Diffusion Convolutional Gated Recurrent Unit Model

Top-view Trajectories: A Pedestrian Dataset of Vehicle-Crowd Interaction from Controlled Experiments and Crowded Campus

Deep Fundamental Diagram Network for Real-Time Pedestrian Dynamics Analysis

Multi-level Crowd Simulation Using Social LSTM

The Large-Scale Crowd Behavior Perception Based on Spatio-Temporal Viscous Fluid Field

Enhancing Pedestrian Trajectory Prediction with Crowd Trip Information

A Data-Driven Method for Crowd Simulation in Urban Scenes

Learning to Simulate Crowd Trajectories with Graph Networks.

3DGCN: 3-Dimensional Dynamic Graph Convolutional Network for Citywide Crowd Flow Prediction

Dynamic Path Planning of Virtual Pedestrian in a Real Scene Video

Modeling Spatial-Temporal Interactions for Robot Crowd Navigation

MSTCNN: multi-modal spatio-temporal convolutional neural network for pedestrian trajectory prediction

Multi-information-based convolutional neural network with attention mechanism for pedestrian trajectory prediction