Flood Data Analysis on SpaceNet 8 Using Apache Sedona

Yanbing Bai,Zihao Yang,Jinze Yu,Rui-Yang Ju,Bin Yang,Erick Mas,Shunichi Koshimura
2024-04-29
Abstract:With the escalating frequency of floods posing persistent threats to human life and property, satellite remote sensing has emerged as an indispensable tool for monitoring flood hazards. SpaceNet8 offers a unique opportunity to leverage cutting-edge artificial intelligence technologies to assess these hazards. A significant contribution of this research is its application of Apache Sedona, an advanced platform specifically designed for the efficient and distributed processing of large-scale geospatial data. This platform aims to enhance the efficiency of error analysis, a critical aspect of improving flood damage detection accuracy. Based on Apache Sedona, we introduce a novel approach that addresses the challenges associated with inaccuracies in flood damage detection. This approach involves the retrieval of cases from historical flood events, the adaptation of these cases to current scenarios, and the revision of the model based on clustering algorithms to refine its performance. Through the replication of both the SpaceNet8 baseline and its top-performing models, we embark on a comprehensive error analysis. This analysis reveals several main sources of inaccuracies. To address these issues, we employ data visual interpretation and histogram equalization techniques, resulting in significant improvements in model metrics. After these enhancements, our indicators show a notable improvement, with precision up by 5%, F1 score by 2.6%, and IoU by 4.5%. This work highlights the importance of advanced geospatial data processing tools, such as Apache Sedona. By improving the accuracy and efficiency of flood detection, this research contributes to safeguarding public safety and strengthening infrastructure resilience in flood-prone areas, making it a valuable addition to the field of remote sensing and disaster management.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper primarily addresses the following key issues: 1. **Improving flood detection accuracy**: - The paper proposes a new method that utilizes the Apache Sedona platform to process large-scale geospatial data, aiming to improve the accuracy and efficiency of flood damage detection. By retrieving historical flood event cases, adapting to current scenarios, and refining the model based on clustering algorithms, the model's performance is enhanced. 2. **Systematic analysis of error sources**: - The research focuses not only on improving detection algorithms but also on systematically analyzing the sources of inaccuracies in flood damage detection, such as data annotation errors, low image contrast, and model capability limitations. Based on this analysis, targeted model optimizations are conducted. 3. **Enhancing decision quality and model interpretability**: - By introducing clustering algorithms to systematically analyze error cases, patterns and clusters of errors are identified, allowing for targeted corrections and adjustments. This helps improve the decision quality and interpretability of the flood detection model, thereby enhancing its accuracy and robustness in practical applications. Overall, this study aims to develop a novel method by leveraging the advantages of Apache Sedona to enhance the application of satellite remote sensing technology in flood monitoring, contributing to the improvement of public safety and disaster resilience of infrastructure.