FloodGenome: Interpretable Machine Learning for Decoding Features Shaping Property Flood Risk Predisposition in Cities

Chenyue Liu,Ali Mostafavi
2024-03-21
Abstract:Understanding the fundamental characteristics that shape the inherent flood risk disposition of urban areas is critical for integrated urban design strategies for flood risk reduction. Flood risk disposition specifies an inherent and event-independent magnitude of property flood risk and measures the extent to which urban areas are susceptible to property damage if exposed to a weather hazard. This study presents FloodGenome as an interpretable machine learning model for evaluation of the extent to which various hydrological, topographic, and built-environment features and their interactions shape flood risk disposition in urban areas. Using flood damage claims data from the U.S. National Flood Insurance Program covering the period 2003 through 2023 across four metropolitan statistical areas (MSAs), the analysis computes building damage ratios and flood claim counts by employing k-means clustering for classifying census block groups (CBGs) into distinct property flood risk disposition levels. Then a random forest model is created to specify property flood risk levels of CBGs based on various intertwined hydrological, topographic, and built-environment features. The model transferability analysis results show consistent performance across MSAs, revealing the universality of underlying features that shape city property flood risks. The FloodGenome model is then used to:(1) evaluate the extent to which future urban development would exacerbate flood risk disposition of urban areas; and (2) specify property flood risk levels at finer spatial resolution providing critical insights for flood risk management processes. The FloodGenome model and the findings provide novel tools and insights for improving the characterization and understanding of intertwined features that shape flood risk profiles of cities.
Computational Engineering, Finance, and Science
What problem does this paper attempt to address?
The paper aims to address the issue of property flood risk predisposition in urban areas. Specifically, the research objective is to uncover the various interwoven characteristics that influence property flood risk predisposition in cities. These characteristics include hydrological, topographical, and built environment features and their interactions. Through this approach, the study hopes to provide urban planners with a tool to better understand and assess the likelihood of property damage in different urban areas when facing weather disasters. To achieve this goal, the paper proposes the FloodGenome model, an interpretable machine learning model capable of evaluating how various hydrological, topographical, and built environment features influence flood risk predisposition. The study utilizes data from the National Flood Insurance Program, covering four Metropolitan Statistical Areas (Houston, Miami, New Orleans, and New York) from 2003 to 2023. Census Block Groups (CBGs) are classified into different flood risk levels using k-means clustering, and a random forest model is employed to identify key determinants of flood risk. Additionally, the paper evaluates the transfer performance of the model across different Metropolitan Statistical Areas, verifying the universality of fundamental characteristics influencing flood risk predisposition in different cities. This helps in formulating unified and scalable urban flood risk management strategies. Ultimately, the research results provide new tools and insights to improve the understanding of urban flood risk characteristics and support the development of urban planning and disaster management strategies.