A multitask model for realtime fish detection and segmentation based on YOLOv5

QinLi Liu,Xinyao Gong,Jiao Li,Hongjie Wang,Ran Liu,Dan Liu,Ruoran Zhou,Tianyu Xie,Ruijie Fu,Xuliang Duan
DOI: https://doi.org/10.7717/peerj-cs.1262
2023-03-10
Abstract:The accuracy of fish farming and real-time monitoring are essential to the development of "intelligent" fish farming. Although the existing instance segmentation networks (such as Maskrcnn) can detect and segment the fish, most of them are not effective in real-time monitoring. In order to improve the accuracy of fish image segmentation and promote the accurate and intelligent development of fish farming industry, this article uses YOLOv5 as the backbone network and object detection branch, combined with semantic segmentation head for real-time fish detection and segmentation. The experiments show that the object detection precision can reach 95.4% and the semantic segmentation accuracy can reach 98.5% with the algorithm structure proposed in this article, based on the golden crucian carp dataset, and 116.6 FPS can be achieved on RTX3060. On the publicly available dataset PASCAL VOC 2007, the object detection precision is 73.8%, the semantic segmentation accuracy is 84.3%, and the speed is up to 120 FPS on RTX3060.
What problem does this paper attempt to address?