Classification of freshwater snails of the genus Radomaniola with multimodal triplet networks

Dennis Vetter,Muhammad Ahsan,Diana Delicado,Thomas A. Neubauer,Thomas Wilke,Gemma Roig
2024-07-30
Abstract:In this paper, we present our first proposal of a machine learning system for the classification of freshwater snails of the genus Radomaniola. We elaborate on the specific challenges encountered during system design, and how we tackled them; namely a small, very imbalanced dataset with a high number of classes and high visual similarity between classes. We then show how we employed triplet networks and the multiple input modalities of images, measurements, and genetic information to overcome these challenges and reach a performance comparable to that of a trained domain expert.
Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
The paper aims to address the issue of species classification within the freshwater snail genus Radomaniola. Due to the small size of these snails (2-4 mm in length) and the inconspicuous shell characteristics, traditional classification methods require genetic and anatomical data, and the process is complex and time-consuming. Even experts find it difficult to identify subtle differences with the naked eye. Therefore, the paper proposes a machine learning system based on multimodal triplet networks to overcome the following challenges: 1. **Small-scale and highly imbalanced dataset**: The dataset contains a small number of samples, and the distribution of samples among different categories is extremely uneven. 2. **High visual similarity**: The differences in appearance between different species are very subtle, making the classification task extremely difficult. To address these challenges, the research team utilized three input modalities: images, measurement data, and genetic information, and employed triplet networks to learn an intermediate representation that can distinguish between different categories. This approach not only helps improve classification accuracy but also achieves performance comparable to domain experts with limited data. Additionally, this system has the potential to serve as an auxiliary tool in the future, helping taxonomists quickly and accurately identify species, thereby reducing workload and increasing efficiency.