Plankton Classification on Imbalanced Dataset Via Hybrid Resample Method with LightBGM

Yiran Liu,Xu Qiao,Rui Gao
DOI: https://doi.org/10.1109/icivc52351.2021.9526988
2021-01-01
Abstract:Plankton monitoring plays an essential role in marine ecological environment protection, effective identification of its species and quantity can assess the health of the marine ecosystem. Thus, it is valuable to build an automatic classification system for plankton. However, the data of plankton naturally exhibit an imbalance in their class distribution. As a result, we need to take the class-imbalance problem into account for plankton classification. In this paper, we propose a classification model based on a hybrid resample method with LightBGM classifier. Our hybrid resample method combines borderline-SMOTE oversampling and Fuzzy C-means cluster-based undersampling (BSFCM), which is available for handling both within-class and between-class imbalance. In addition, to eliminate the irrelevant factors, dataset preprocessing and feature dimension reduction are employed for the in situ plankton images. The F1-measure and G-mean are used as the evaluation criterion to assess the classification performance. The experimental results show that our BSFCM method using LightBGM classifier is superior to the compared benchmark methods, and achieves good performance on the imbalanced plankton dataset.
What problem does this paper attempt to address?