FloodDamageCast: Building Flood Damage Nowcasting with Machine Learning and Data Augmentation

Chia-Fu Liu,Lipai Huang,Kai Yin,Sam Brody,Ali Mostafavi
2024-05-24
Abstract:Near-real time estimation of damage to buildings and infrastructure, referred to as damage nowcasting in this study, is crucial for empowering emergency responders to make informed decisions regarding evacuation orders and infrastructure repair priorities during disaster response and recovery. Here, we introduce FloodDamageCast, a machine learning framework tailored for property flood damage nowcasting. The framework leverages heterogeneous data to predict residential flood damage at a resolution of 500 meters by 500 meters within Harris County, Texas, during the 2017 Hurricane Harvey. To deal with data imbalance, FloodDamageCast incorporates a generative adversarial networks-based data augmentation coupled with an efficient machine learning model. The results demonstrate the model's ability to identify high-damage spatial areas that would be overlooked by baseline models. Insights gleaned from flood damage nowcasting can assist emergency responders to more efficiently identify repair needs, allocate resources, and streamline on-the-ground inspections, thereby saving both time and effort.
Machine Learning
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve The paper aims to address the issue of real-time prediction (nowcasting) of building damage during flood disasters. Specifically, the study proposes a machine learning framework named **FloodDamageCast** to quickly assess the extent of flood damage in residential areas during disasters. The framework addresses challenges in existing technologies through the following aspects: 1. **Feature Fusion**: The model considers various features, including hydrological and topographical characteristics, historical flood event data, and current flood features, capturing the nonlinear interactions between these features. 2. **Data Imbalance Problem**: Flood damage datasets typically suffer from severe class imbalance, where the proportion of damaged buildings is much smaller than that of undamaged buildings. To address this, the paper employs Conditional Tabular Generative Adversarial Networks (CTGAN) for data augmentation to balance the sample sizes of different classes. 3. **High-Resolution Prediction**: To achieve fine-grained spatial resolution (500 meters × 500 meters), the paper integrates data from the National Flood Insurance Program (NFIP) and Individual Assistance (IA) programs. This includes damage data for both insured and uninsured buildings, thereby enhancing the completeness of the training and testing datasets. Through these methods, FloodDamageCast can provide reliable and efficient damage assessment tools for emergency responders during disaster response and recovery. This helps them better identify repair needs, allocate resources, and streamline on-site inspection processes, saving time and effort.