Deciphering Human Mobility: Inferring Semantics of Trajectories with Large Language Models

Yuxiao Luo,Zhongcai Cao,Xin Jin,Kang Liu,Ling Yin
2024-05-30
Abstract:Understanding human mobility patterns is essential for various applications, from urban planning to public safety. The individual trajectory such as mobile phone location data, while rich in spatio-temporal information, often lacks semantic detail, limiting its utility for in-depth mobility analysis. Existing methods can infer basic routine activity sequences from this data, lacking depth in understanding complex human behaviors and users' characteristics. Additionally, they struggle with the dependency on hard-to-obtain auxiliary datasets like travel surveys. To address these limitations, this paper defines trajectory semantic inference through three key dimensions: user occupation category, activity sequence, and trajectory description, and proposes the Trajectory Semantic Inference with Large Language Models (TSI-LLM) framework to leverage LLMs infer trajectory semantics comprehensively and deeply. We adopt spatio-temporal attributes enhanced data formatting (STFormat) and design a context-inclusive prompt, enabling LLMs to more effectively interpret and infer the semantics of trajectory data. Experimental validation on real-world trajectory datasets demonstrates the efficacy of TSI-LLM in deciphering complex human mobility patterns. This study explores the potential of LLMs in enhancing the semantic analysis of trajectory data, paving the way for more sophisticated and accessible human mobility research.
Artificial Intelligence
What problem does this paper attempt to address?
The problem this paper attempts to address is: how to infer deeper semantic information from individual trajectory data to enhance the understanding and analysis of human mobility patterns. Specifically, existing methods can infer basic daily activity sequences from trajectory data such as mobile phone location data, but these methods lack a deep understanding of complex human behaviors and user characteristics (such as occupational categories) and rely on auxiliary datasets (such as travel surveys) that are difficult to obtain. To address these issues, this paper proposes a Trajectory Semantic Inference framework based on Large Language Models (TSI-LLM), aiming to comprehensively and deeply infer trajectory semantics through three key dimensions (user occupational categories, activity sequences, and trajectory descriptions). The main contributions of the paper include: 1. Defining three dimensions of trajectory semantic inference: occupational categories, activity sequences, and trajectory descriptions. 2. Proposing a novel Trajectory Semantic Inference framework based on Large Language Models (TSI-LLM), which combines spatiotemporal attribute-enhanced data formatting (STFormat) and trajectory semantic inference prompts to achieve comprehensive analysis and inference of individual trajectory semantics. 3. Validating the effectiveness of TSI-LLM through experiments on real trajectory datasets.