Partial Label Learning with Focal Loss for Sea Ice Classification Based on Ice Charts

Behzad Vahedi,Benjamin Lucas,Farnoush Banaei-Kashani,Andrew P. Barrett,Walter N. Meier,Siri Jodha Khalsa,Morteza Karimzadeh
DOI: https://doi.org/10.1109/JSTARS.2024.3413003
2024-06-10
Abstract:Sea ice, crucial to the Arctic and Earth's climate, requires consistent monitoring and high-resolution mapping. Manual sea ice mapping, however, is time-consuming and subjective, prompting the need for automated deep learning-based classification approaches. However, training these algorithms is challenging because expert-generated ice charts, commonly used as training data, do not map single ice types but instead map polygons with multiple ice types. Moreover, the distribution of various ice types in these charts is frequently imbalanced, resulting in a performance bias towards the dominant class. In this paper, we present a novel GeoAI approach to training sea ice classification by formalizing it as a partial label learning task with explicit confidence scores to address multiple labels and class imbalance. We treat the polygon-level labels as candidate partial labels, assign the corresponding ice concentrations as confidence scores to each candidate label, and integrate them with focal loss to train a Convolutional Neural Network (CNN). Our proposed approach leads to enhanced performance for sea ice classification in Sentinel-1 dual-polarized SAR images, improving classification accuracy (from 87% to 92%) and weighted average F-1 score (from 90% to 93%) compared to the conventional training approach of using one-hot encoded labels and Categorical Cross-Entropy loss. It also improves the F-1 score in 4 out of the 6 sea ice classes.
Computer Vision and Pattern Recognition,Image and Video Processing
What problem does this paper attempt to address?
The main problems that this paper attempts to solve are two major challenges in automated sea - ice classification on training data: the multi - label problem and the class - imbalance problem. Specifically: 1. **Multi - label problem**: Sea - ice maps generated by experts (as training data) are usually not of a single type of sea - ice, but rather polygon areas containing multiple sea - ice types. The traditional one - hot encoding method cannot effectively handle these mixed labels, resulting in information loss and potential mislabeling. 2. **Class - imbalance problem**: The distribution of different types of sea - ice in the data is usually unbalanced, and some types of sea - ice occur far less frequently than others. This will cause the model to be biased towards frequently occurring classes during the training process, and have poor performance on rare classes. To solve these problems, the author proposes a new GeoAI method, formulating the sea - ice classification task as a Partial Label Learning (PLL) problem and introducing Focal Loss to deal with class - imbalance. Specific methods include: - **Partial Label Learning (PLL)**: Consider polygon - level labels as candidate partial labels and assign confidence scores according to the concentration of each sea - ice type. These confidence scores are integrated into the training process to better reflect the existence probability of each sea - ice type. - **Focal Loss**: By adjusting the modulation factor and the focusing parameter in the loss function, reduce the influence of common classes and increase the attention to rare classes, thereby improving the overall classification performance. Through this method, the author has successfully improved the accuracy of sea - ice classification based on Sentinel - 1 dual - polarization SAR images from 87% to 92%, and the weighted average F1 - score from 90% to 93%. In addition, the F1 - score has also been improved in 4 out of 6 sea - ice categories. In summary, this paper aims to overcome the multi - label and class - imbalance problems in automated sea - ice classification through innovative PLL and Focal Loss methods, thereby improving classification accuracy and reliability.