Prioritizing Potential Wetland Areas via Region-to-Region Knowledge Transfer and Adaptive Propagation

Yoonhyuk Choi,Reepal Shah,John Sabo,K. Selcuk Candan,Huan Liu
2024-06-09
Abstract:Wetlands are important to communities, offering benefits ranging from water purification, and flood protection to recreation and tourism. Therefore, identifying and prioritizing potential wetland areas is a critical decision problem. While data-driven solutions are feasible, this is complicated by significant data sparsity due to the low proportion of wetlands (3-6\%) in many areas of interest in the southwestern US. This makes it hard to develop data-driven models that can help guide the identification of additional wetland areas. To solve this limitation, we propose two strategies: (1) The first of these is knowledge transfer from regions with rich wetlands (such as the Eastern US) to sparser regions (such as the Southwestern area with few wetlands). Recognizing that these regions are likely to be very different from each other in terms of soil characteristics, population distribution, and land use, we propose a domain disentanglement strategy that identifies and transfers only the applicable aspects of the learned model. (2) We complement this with a spatial data enrichment strategy that relies on an adaptive propagation mechanism. This mechanism differentiates between node pairs that have positive and negative impacts on each other for Graph Neural Networks (GNNs). To summarize, given two spatial cells belonging to different regions, we identify domain-specific and domain-shareable features, and, for each region, we rely on adaptive propagation to enrich features with the features of surrounding cells. We conduct rigorous experiments to substantiate our proposed method's effectiveness, robustness, and scalability compared to state-of-the-art baselines. Additionally, an ablation study demonstrates that each module is essential in prioritizing potential wetlands, which justifies our assumption.
Information Retrieval
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve The paper aims to address the issue of data sparsity in wetland identification and prioritization. Specifically, the main challenge faced by researchers is the low wetland coverage in certain areas (such as the southwestern United States), which only accounts for 3%-6% of the land area. This makes it difficult to develop data-driven methods to identify new wetlands. To overcome this limitation, the research proposes two strategies: 1. **Cross-Regional Knowledge Transfer**: Transfer knowledge from wetland-rich areas (such as the eastern United States) to wetland-scarce areas (such as the southwestern United States). Considering that soil characteristics, population distribution, and land use may vary significantly between different regions, the research proposes a domain decoupling strategy to identify and transfer applicable model aspects. 2. **Spatial Data Enrichment**: Utilize an adaptive propagation mechanism within the same region to distinguish the positive and negative impacts between node pairs, thereby enhancing the information transmission effect in Graph Neural Networks (GNNs). This mechanism can identify the positive and negative impacts between different nodes and enrich features through adaptive propagation. Through these methods, the researchers aim to develop a new model that can effectively identify potential wetland areas. Additionally, experimental results validate the effectiveness, robustness, and scalability of the proposed methods, and ablation studies demonstrate the importance of each module.