SERT: A Transfomer Based Model for Spatio-Temporal Sensor Data with Missing Values for Environmental Monitoring

Amin Shoari Nejad,Rocío Alaiz-Rodríguez,Gerard D. McCarthy,Brian Kelleher,Anthony Grey,Andrew Parnell

2023-06-09

Abstract:Environmental monitoring is crucial to our understanding of climate change, biodiversity loss and pollution. The availability of large-scale spatio-temporal data from sources such as sensors and satellites allows us to develop sophisticated models for forecasting and understanding key drivers. However, the data collected from sensors often contain missing values due to faulty equipment or maintenance issues. The missing values rarely occur simultaneously leading to data that are multivariate misaligned sparse time series. We propose two models that are capable of performing multivariate spatio-temporal forecasting while handling missing data naturally without the need for imputation. The first model is a transformer-based model, which we name SERT (Spatio-temporal Encoder Representations from Transformers). The second is a simpler model named SST-ANN (Sparse Spatio-Temporal Artificial Neural Network) which is capable of providing interpretable results. We conduct extensive experiments on two different datasets for multivariate spatio-temporal forecasting and show that our models have competitive or superior performance to those at the state-of-the-art.

Machine Learning,Artificial Intelligence

What problem does this paper attempt to address?

The paper primarily focuses on addressing the issue of spatiotemporal data prediction in environmental monitoring, particularly dealing with sensor data that contains missing values. Specifically, the paper proposes two models to handle such data: 1. **SERT (Spatiotemporal Encoder Representation from Transformer)**: This is a model based on the Transformer architecture, capable of multivariate spatiotemporal prediction and can naturally handle missing data without the need for data imputation. 2. **SST-ANN (Sparse Spatiotemporal Artificial Neural Network)**: This is a simpler, more interpretable model that can also handle missing data. Although it may be slightly less accurate than SERT, it has the advantage of faster computation speed and can provide insights into how the prediction results are derived. Both models aim to address the key challenge in spatiotemporal data prediction—how to effectively handle data missing due to sensor failures or maintenance issues. Additionally, the paper proposes a method for handling positional information in the input data and designs a mask loss function for training the models to address the issue of missing values in the output data. Through experimental evaluation on both simulated and real-world datasets, the study shows that these models can effectively cope with different levels of sparsity and demonstrate good performance in practical applications (such as environmental monitoring data from Dublin Bay). Notably, the SERT model performs best in experiments with 7-hour ahead predictions, while the SST-ANN model provides interpretability of the prediction results.

SERT: A Transfomer Based Model for Spatio-Temporal Sensor Data with Missing Values for Environmental Monitoring

SERT: A transfomer based model for multivariate temporal sensor data with missing values for environmental monitoring

Spatiotemporal Transformer for Imputing Sparse Data: A Deep Learning Approach

Using Temporal Correlation and Time Series to Detect Missing Activity-Driven Sensor Events.

SiET: Spatial information enhanced transformer for multivariate time series anomaly detection

ALERT-Transformer: Bridging Asynchronous and Synchronous Machine Learning for Real-Time Event-based Spatio-Temporal Data

Missing Value Imputation of Wireless Sensor Data for Environmental Monitoring

ImputeFormer: Low Rankness-Induced Transformers for Generalizable Spatiotemporal Imputation

Graph Transformer Network Incorporating Sparse Representation for Multivariate Time Series Anomaly Detection

TENT: Tensorized Encoder Transformer for Temperature Forecasting

Unsupervised Spatio-Temporal State Estimation for Fine-grained Adaptive Anomaly Diagnosis of Industrial Cyber-physical Systems

Unsupervised Anomaly Detection in Spatio‐Temporal Stream Network Sensor Data

A spatio-temporal LSTM model to forecast across multiple temporal and spatial scales

Generalizability Under Sensor Failure: Tokenization + Transformers Enable More Robust Latent Spaces

Spatial-temporal Forecasting for Regions without Observations

Time Series Representation Models

Recover Missing Sensor Data with Iterative Imputing Network

STARS: Sensor-agnostic Transformer Architecture for Remote Sensing

Increasing the Robustness of Model Predictions to Missing Sensors in Earth Observation

Decoupling Long-and Short-Term Patterns in Spatiotemporal Inference

A Cost-Sensitive Transformer Model for Prognostics Under Highly Imbalanced Industrial Data