Pedestrian Motion Prediction Using Transformer-based Behavior Clustering and Data-Driven Reachability Analysis

Kleio Fragkedaki,Frank J. Jiang,Karl H. Johansson,Jonas Mårtensson
2024-08-09
Abstract:In this work, we present a transformer-based framework for predicting future pedestrian states based on clustered historical trajectory data. In previous studies, researchers propose enhancing pedestrian trajectory predictions by using manually crafted labels to categorize pedestrian behaviors and intentions. However, these approaches often only capture a limited range of pedestrian behaviors and introduce human bias into the predictions. To alleviate the dependency on manually crafted labels, we utilize a transformer encoder coupled with hierarchical density-based clustering to automatically identify diverse behavior patterns, and use these clusters in data-driven reachability analysis. By using a transformer-based approach, we seek to enhance the representation of pedestrian trajectories and uncover characteristics or features that are subsequently used to group trajectories into different "behavior" clusters. We show that these behavior clusters can be used with data-driven reachability analysis, yielding an end-to-end data-driven approach to predicting the future motion of pedestrians. We train and evaluate our approach on a real pedestrian dataset, showcasing its effectiveness in forecasting pedestrian movements.
Computer Vision and Pattern Recognition,Robotics,Systems and Control
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve This paper aims to address two main issues in pedestrian motion prediction: 1. **Limitations of Existing Models**: Current research often relies on manually annotated behavior labels to classify pedestrian behavior intentions. These methods typically capture only a limited range of behaviors and introduce human bias. 2. **Conservative Reachability Analysis**: Even with a suitable model, the results of reachability analysis remain overly conservative due to the degrees of freedom in pedestrian decision-making. To mitigate these issues, the authors propose a Transformer-based framework that combines Hierarchical Density-Based Spatial Clustering of Applications with Noise (HDBSCAN) to automatically identify different behavior patterns and use these patterns for data-driven reachability analysis. This approach reduces the conservativeness of predictions while maintaining safety. Specific contributions include: 1. Developing a Transformer-based framework for clustering pedestrian trajectory data. 2. Combining the generated behavior clusters with data-driven reachability analysis. 3. Demonstrating the advantages of Transformer-encoded trajectory clustering in data-driven reachability analysis using real pedestrian trajectory data. Through this method, the system can automatically identify pedestrian behavior patterns without the need for manual intervention and use them to predict future states.