Abstract:Predicting the distribution of people in the time window approaching a disaster is crucial for post-disaster assistance activities and can be useful for evacuation route selection and shelter planning. However, two major limitations have not yet been addressed: (1) Most spatiotemporal prediction models incorporate spatiotemporal features either directly or indirectly, which results in high information redundancy in the parameters of the prediction model and low computational efficiency. (2) These models usually incorporate certain basic and external features, and they can neither change spatiotemporal addressed features according to spatiotemporal features nor change them in real-time according to spatiotemporal features. The spatiotemporal feature embedding methods for these models are inflexible and difficult to interpret. To overcome these problems, a lightweight population density distribution prediction framework that considers both basic and external spatiotemporal features is proposed. In the study, an autoencoder is used to extract spatiotemporal coded information to form a spatiotemporal attention mechanism, and basic and external spatiotemporal feature attention is fused by a fusion framework with learnable weights. The fused spatiotemporal attention is fused with Resnet as the prediction backbone network to predict the people distribution. Comparison and ablation experimental results show that the computational efficiency and interpretability of the prediction framework are improved by maximizing the scalability of the spatiotemporal features of the model by unleashing the scalability of the spatiotemporal features of the model while enhancing the interpretability of the spatiotemporal information as compared to the classical and popular spatiotemporal prediction frameworks. This study has a multiplier effect and provides a reference solution for predicting population distributions in similar regions around the globe.

MSTEM: Masked Spatiotemporal Event Series Modeling for Urban Undisciplined Events Forecasting

Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos

Spatial-Temporal-Decoupled Masked Pre-training for Spatiotemporal Forecasting

Revealing the Power of Masked Autoencoders in Traffic Forecasting

Meteorology-Assisted Spatio-Temporal Graph Network for Uncivilized Urban Event Prediction.

Meteorology -Assisted Spatio-Temporal Graph Network for Uncivilized Urban Fvent Prediction

Graph Masked Autoencoder for Spatio-Temporal Graph Learning

Spatial-Temporal Meta-path Guided Explainable Crime Prediction

STORM: A Spatio-Temporal Context-Aware Model for Predicting Event-Triggered Abnormal Crowd Traffic

W-MAE: Pre-trained weather model with masked autoencoder for multi-variable weather forecasting

Unsupervised Representation Learning of Player Behavioral Data with Confidence Guided Masking

SS-MAE: Spatial-Spectral Masked Auto-Encoder for Multi-Source Remote Sensing Image Classification

EMIT- Event-Based Masked Auto Encoding for Irregular Time Series

MSTAD: A masked subspace-like transformer for multi-class anomaly detection

UniM$^2$AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving

GeoMAE: Masked Geometric Target Prediction for Self-supervised Point Cloud Pre-Training

UniM^2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving

A Case Study of Lujiazui Financial District

Masked Autoencoders As Spatiotemporal Learners

Traj-MAE: Masked Autoencoders for Trajectory Prediction

HiMTM: Hierarchical Multi-Scale Masked Time Series Modeling with Self-Distillation for Long-Term Forecasting