The Application Research of Deep Neural Networks in Colonic Polyp Segmentation

Jun Li,Fang Wang,Haima Yang,Lihua Qiu,Jin Liu,Le Fu,Dawei Zhang,Hong Wang,Yeye Song
DOI: https://doi.org/10.21203/rs.3.rs-2093277/v1
2022-01-01
Abstract:Background: In the study of medical images segmentation, U-Net is a good choice with better performance. But it will cause some problems such as gradient problems or information loss. Methods: In this paper, we introduce a new model based on such an encoder-decoder structure. We mixed a full connection and encoder output in per layer and we add an axial attention to solve gradient problem and keep the connection with pixel from long distance. After pooling, we add a module which has three atrous Convolution processes with different expansion rate and a non-local Self attention to increase the feeling field and fix the problem of losing spatial information. In decoder part, we put a dual channel gate to point out 'what' and 'where' feature we need by the union of Channel Attention and Spatial Attention. Results: We use 4 public dataSets are CVC-ClinicDB, Kvasir-SEG, CVC-ColonDB and CVC-T. We compared more than 10 groups other different models with these 4 group dataSets and marked some evaluating indicators such as mIou, mDice and so on. We also did some ablation experiments to verify that our model structure is reasonable and effective. Conclusions: The comparative training of 12 models in 4 different datasets verifies the feasibility and the good effectiveness of the proposed ideas. We also did some ablation experiments to verify that our model structure is reasonable and effective. The basic structure of the encoder decoder in semantic segmentation has been greatly improved.
What problem does this paper attempt to address?