A Database Engineered System for Big Data Analytics on Tornado Climatology

Fengfan Bian,Carson K. Leung,Piers Grenier,Harry Pu,Samuel Ning,Alfredo Cuzzocrea
2024-09-26
Abstract:Recognizing the challenges with current tornado warning systems, we investigate alternative approaches. In particular, we present a database engi-neered system that integrates information from heterogeneous rich data sources, including climatology data for tornadoes and data just before a tornado warning. The system aids in predicting tornado occurrences by identifying the data points that form the basis of a tornado warning. Evaluation on US data highlights the advantages of using a classification forecasting recurrent neural network (RNN) model. The results highlight the effectiveness of our database engineered system for big data analytics on tornado climatology-especially, in accurately predict-ing tornado lead-time, magnitude, and location, contributing to the development of sustainable cities.
Databases
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is the limited warning time and insufficient accuracy provided by the current tornado warning systems. Specifically: 1. **Insufficient warning time**: The existing tornado warning systems can only provide limited preparation time for residents and authorities, and are unable to effectively deal with this natural disaster. 2. **Low prediction accuracy**: The existing warning systems have limitations in predicting the occurrence time, intensity, and location of tornadoes, resulting in inaccurate warnings. For this reason, the author proposes a solution based on a database engineering system. This system improves tornado prediction by integrating data from multiple heterogeneous data sources (such as climate data and data before tornado warnings). The system utilizes classification prediction models, especially the Long - Short - Term Memory (LSTM) model based on Recurrent Neural Networks (RNN), to improve the accuracy of tornado prediction and the early warning time. ### Specific objectives include: - **Improve prediction accuracy**: By identifying and analyzing data points related to tornado warnings, improve the prediction accuracy of the occurrence time, intensity, and location of tornadoes. - **Extend warning time**: By identifying meteorological conditions that may lead to tornadoes earlier, provide more preparation time for residents and authorities. - **Support sustainable urban development**: By accurately predicting tornadoes, reduce the impact of disasters on infrastructure and socio - economic aspects, and promote the sustainable development of cities. ### Main methods: - **Data integration**: Integrate rich data from multiple data sources, including climate data and real - time data before warnings. - **Machine learning model**: Use the RNN model, especially the LSTM model, to process time - series data and capture the time - dependence and patterns of meteorological conditions. - **Evaluation and verification**: Evaluate the model performance through 10 - fold cross - validation to ensure the reliability and generalization ability of the results. Through these methods, this research aims to develop a more intelligent and efficient tornado prediction system, thereby better protecting public safety and reducing disaster losses.