Chicken Image Segmentation via Multi-Scale Attention-Based Deep Convolutional Neural Network

Wei Li,Yang Xiao,Xibin Song,Na Lv,Xinbo Jiang,Yan Huang,Jingliang Peng
DOI: https://doi.org/10.1109/access.2021.3074297
IF: 3.9
2021-01-01
IEEE Access
Abstract:Accurate segmentation and analysis for each animal in surveillance video images will help poultry farmers to monitor and promote animal welfare. However, it is challenging to accurately segment each animal due to the similar appearance, different scales, rapid growth and adhesive areas of group animals. Meanwhile, lacking of useful training data also limits the effectiveness of animal segmentation algorithms. To address these problems, we first construct a chicken image segmentation dataset to study the behavior of chickens for intelligent monitoring and analysis. Then, we propose an effective end-to-end framework for chicken image segmentation, which can also be used for other animal image segmentation. An end-to-end multi-scale based encoder-decoder network is first utilized to extract multi-scale features. Then, an attention-based module is employed to extract and intensify effective features, thus better segmentation results can be obtained. Finally, a multi-output combined loss function is proposed to make effective supervision for better segmentation. Experimental results demonstrate the promising performance of the proposed framework for chicken image segmentation.
computer science, information systems,telecommunications,engineering, electrical & electronic
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: in surveillance video images, how to accurately segment and analyze each chicken to help poultry farmers monitor and promote animal welfare. Specifically, the paper mainly addresses the following challenges: 1. **Similar Appearance and Different Scales**: Due to the similar appearance and different sizes of individuals in the chicken flock, it is difficult to accurately segment each individual. 2. **Rapid Growth and Adhesive Areas**: The rapid growth of chickens and their group aggregation make it difficult to separate the adhesive areas between them. 3. **Lack of Effective Training Data**: The existing chicken image segmentation data sets are insufficient, which limits the effectiveness of the segmentation algorithm. To solve these problems, the authors took the following measures: - **Constructing a Chicken Image Segmentation Data Set**: The authors created a new chicken image segmentation data set for studying the behavior of chickens for intelligent monitoring and analysis. - **Proposing a Deep Convolutional Neural Network Framework with Multi - scale Attention Mechanism (MSAnet)**: This framework can extract multi - scale features and strengthen effective features through the attention mechanism to obtain better segmentation results. - **Introducing a Multi - output Combined Loss Function**: In order to better supervise network training, a multi - output combined loss function was proposed. Through these methods, the paper aims to improve the accuracy of chicken image segmentation, thereby helping farmers better monitor the behavior of chickens and ensure their health and welfare. ### Formula Summary 1. **Multi - scale Module Feature Extraction Formula**: \[ F_1 = CR(Cat(CR(I), CR^2(I))), \quad i = 1 \] \[ F_i = CR(Cat(CR(I_i), CR^2(Cat(CR(I_i), F_{i - 1}^v)))), \quad i\in[2, n] \] where \(I\) represents the original chicken image, \(I_i\) represents the image down - sampled at the \(i\)-th layer, \(n\) is the total number of layers, \(Cat(\cdot)\) represents the concatenation operation in the channel direction, and \(CR(\cdot)\) and \(CR^2(\cdot)\) represent the one - layer and two - layer convolution plus ReLU operations respectively. 2. **Channel Attention Mechanism Formula**: \[ M = Y_f'\odot Z_f' \] \[ \theta = softmax(M) \] \[ f^{CA} = \theta\odot X_f \] where \(\odot\) represents the dot - product operation, and \(X_f, Y_f, Z_f\) are three components extracted from the input feature map \(f\) through convolution operations. 3. **Edge Attention Mechanism Formula**: \[ \hat{F}_i=\frac{1}{N_k}\sum_{k\in\Omega_i}w_kF_l'+\frac{1}{N_k}\sum_{k\in\Omega_i}b_k \] \[ \hat{F}_i = W_l*F_l'+B_l \] \[ F_h' = W_h*F_h + B_h \] where \(\Omega_i\) represents the set of all windows containing position \(i\), \(W_l, B_l\) are the coefficients of the low - resolution feature map, and after bilinear up - sampling, the coefficients of the high - resolution feature map \(W_h, B_h\) are obtained. 4. **Combined Loss Function Formula**: \[ loss(i)=-y\log(y_i) \] \[ L=\sum_{i = 1}^n\omega_i\cdot loss(i) \] where \(y\) represents the ground - truth label, \(y_i\) represents the mask predicted at the \(i\)-th layer, and \(\omega_i\) is the loss weight of each side - output layer. Through these methods, the paper effectively improves chicken image segmentation.