Effects of input image size on the accuracy of fish identification using deep learning

Yuka Iwahara,Yasutoki Shibata,Masahiro Manano,Tomoya Nishino,Ryosuke Kariya,Hiroki Yaemori
DOI: https://doi.org/10.1101/2024.03.01.582886
2024-03-04
Abstract:The length composition of catches by species is important for stock assessment. However, length measurement is performed manually, jeopardizing the future of continuous measurement because of likely labor shortages. We focused on applying deep learning to estimate length composition by species from images of fish caught for sustainable management. In this study, input image sizes were varied to evaluate the effect of input image size on detection and classification accuracy, as a method for improving the accuracy. The images (43,226 fish of 85 classes) were captured on conveyor belts to sort set-net catches. Fish detection and classification were performed using Mask R-CNN. The effect of input image size on accuracy was examined using three image sizes of 1333×888, 2000×1333, and 2666×1777 pixels, achieving an mAP50-95 of 0.580 or higher. The accuracy improved with increasing image size, attaining a maximum improvement of 4.3% compared to the smallest size. However, increasing the image size too far from the default size may not improve the accuracy of models with fine-tuning. Improvements in accuracy were primarily observed for the species with low accuracy at the smallest image size. Increasing image size would be a useful and simple way to improve accuracy for these species.
Ecology
What problem does this paper attempt to address?
The problem this paper attempts to address is: evaluating the impact of changing the input image size on the accuracy of fish detection and classification, in order to improve the precision of image-based fish length estimation, thereby supporting the sustainable management of fishery resources. Specifically, the paper focuses on how to use deep learning techniques to estimate the length composition of fish from captured fish images, as a replacement for traditional manual measurement methods. Since manual measurement relies on skilled researchers and is time-consuming and labor-intensive, there may be a future labor shortage issue. Therefore, developing a method that does not depend on the number of researchers is crucial for the assessment and sustainable utilization of fishery resources. The paper explores the impact of different image sizes (1333×888, 2000×1333, and 2666×1777 pixels) on the accuracy of fish detection and classification by adjusting the input image size, and evaluates the improvement effect of these changes on model performance.