A Benchmark Dataset for Tornado Detection and Prediction using Full-Resolution Polarimetric Weather Radar Data

Mark S. Veillette,James M. Kurdzo,Phillip M. Stepanian,John Y. N. Cho,Siddharth Samsi,Joseph McDonald
2024-01-27
Abstract:Weather radar is the primary tool used by forecasters to detect and warn for tornadoes in near-real time. In order to assist forecasters in warning the public, several algorithms have been developed to automatically detect tornadic signatures in weather radar observations. Recently, Machine Learning (ML) algorithms, which learn directly from large amounts of labeled data, have been shown to be highly effective for this purpose. Since tornadoes are extremely rare events within the corpus of all available radar observations, the selection and design of training datasets for ML applications is critical for the performance, robustness, and ultimate acceptance of ML algorithms. This study introduces a new benchmark dataset, TorNet to support development of ML algorithms in tornado detection and prediction. TorNet contains full-resolution, polarimetric, Level-II WSR-88D data sampled from 10 years of reported storm events. A number of ML baselines for tornado detection are developed and compared, including a novel deep learning (DL) architecture capable of processing raw radar imagery without the need for manual feature extraction required for existing ML algorithms. Despite not benefiting from manual feature engineering or other preprocessing, the DL model shows increased detection performance compared to non-DL and operational baselines. The TorNet dataset, as well as source code and model weights of the DL baseline trained in this work, are made freely available.
Atmospheric and Oceanic Physics,Machine Learning
What problem does this paper attempt to address?
The paper aims to address the challenges of meteorological radar in tornado detection and prediction by introducing a new benchmark dataset to facilitate the development of machine learning (ML) algorithms. Specifically, the main objectives of the study include: 1. **Proposing the TorNet benchmark dataset**: This dataset includes full-resolution polarimetric WSR-88D meteorological radar data sampled from 10 years of reported storm events. These data are designed to support the development of ML algorithms in the field of tornado detection and prediction. 2. **Evaluating existing algorithms**: The study compares several existing ML baseline algorithms for tornado detection, including traditional non-deep learning methods and a novel deep learning architecture. The latter can directly process raw radar images without the need for manual feature extraction. 3. **Validating the effectiveness of the deep learning model**: Despite not utilizing manual feature engineering or preprocessing steps, the proposed deep learning model shows higher detection performance compared to non-deep learning algorithms and current operational baselines. 4. **Promoting research community collaboration**: By publicly releasing the TorNet dataset, source code, and trained deep learning model weights, the authors hope to foster collaboration and advancement within the research community in this important field. In summary, the main goal of the paper is to improve the accuracy of tornado detection, reduce false alarm rates, and accelerate related research progress by introducing the TorNet dataset and deep learning model.