A lightweight underwater fish image semantic segmentation model based on U‐Net
Zhenkai Zhang,Wanghua Li,Boon‐Chong Seet
DOI: https://doi.org/10.1049/ipr2.13161
IF: 2.3
2024-06-27
IET Image Processing
Abstract:A lightweight underwater fish image semantic segmentation model is proposed based on U‐Net. The multi‐input mode, MRS module, MSC structure module and CBAM attention mechanism are introduced into U‐Net network. The results show that the proposed model has the advantages of high accuracy and a small amount of parameters, and achieves the balance between accuracy and speed, which is more suitable for underwater image segmentation. Semantic segmentation of underwater fish images is vital for monitoring fish stocks, assessing marine resources, and sustaining fisheries. To tackle challenges such as low segmentation accuracy, inadequate real‐time performance, and imprecise location segmentation in current methods, a novel lightweight U‐Net model is proposed. The proposed model acquires more segmentation details by applying a multiple‐input approach at the first four encoder levels. To achieve both lightweight and high accuracy, a multi‐scale residual structure (MRS) module is proposed to reduce parameters and compensate for the accuracy loss caused by the reduction of channels. To improve segmentation accuracy, a multi‐scale skip connection (MSC) structure is further proposed, and the convolution block attention mechanism (CBAM) is introduced at the end of each decoder level for weight adjustment. Experimental results demonstrate a notable reduction in model volume, parameters, and floating‐point operations by 94.20%, 94.39%, and 51.52% respectively, compared to the original model. The proposed model achieves a high mean intersection over union (mIOU) of 94.44%, mean pixel accuracy (mPA) of 97.03%, and a frame rate of 43.62 frames per second (FPS). With its high precision and minimal parameters, the model strikes a balance between accuracy and speed, making it particularly suitable for underwater image segmentation.
computer science, artificial intelligence,engineering, electrical & electronic,imaging science & photographic technology