Flood susceptibility mapping through geoinformatics and ensemble learning methods, with an emphasis on the AdaBoost-Decision Tree algorithm, in Mazandaran, Iran
Maryam Jahanbani,Mohammad H. Vahidnia,Hossein Aghamohammadi,Zahra Azizi
DOI: https://doi.org/10.1007/s12145-023-01213-2
2024-01-17
Earth Science Informatics
Abstract:Floods, as natural disasters, impose significant human and financial burdens, necessitating stringent mitigation measures. The recurrent annual incidence of floods precipitates considerable economic setbacks and tragic human casualties. In the realm of disaster management, flood susceptibility mapping has evolved into an indispensable instrument for preemptive intervention. In recent years, the amalgamation of machine learning (ML) methodologies and geographic information systems (GIS) has demonstrated remarkable promise in the realm of flood susceptibility mapping. Nonetheless, the inherent limitations of standalone ML models have constrained their predictive efficacy. Several shortcomings are evident in prior research. These include the failure to utilize contemporary ensemble approaches capable of enhancing performance and the limited exploration of diverse classifier combinations, which are instrumental in augmenting reliability. Simultaneously, there is an absence of current and up-to-date flood susceptibility maps on recent floods within the study area. Hence, this study endeavors to enhance the precision of flood susceptibility mapping, within the Haraz-Neka River basin across Mazandaran province, by harnessing an ensemble of ML models. The research methodology encompassed several pivotal phases. Initially, data about 240 flood sites were meticulously compiled. Subsequently, 70% of this dataset was allocated for training and cartographic elucidations, whereas the remaining 30%, selected at random, served to validate the resultant maps. The analytical framework incorporated a spectrum of influential parameters, encompassing Elevation, Slope, Aspect, Rainfall, land use, Vegetation Differentiation Index (NDVI), Soil Hydrology Groups, Proximity to the River, Distance from Landslides, Topographic Wetness Index (TWI), Stream Power Index (SPI), and Sediment Transport Index (STI) for spatial modeling. The results undeniably highlight the superior performance of the ensemble model compared to its individual counterparts. Validation exercises, leveraging historical flood data, prominently endorsed the AdaBoost algorithm integrated with the Decision Tree classifier as the most efficacious. Garnering an Area Under ROC curve surpassing 0.96, accompanied by an accuracy of 0.93%, a sensitivity of 0.95%, and a specificity of 0.92%, this amalgamation substantiates its prowess. The proposed framework stands poised to empower decision-makers in identifying vulnerable regions and devising efficacious flood risk mitigation strategies.
geosciences, multidisciplinary,computer science, interdisciplinary applications