A Pyramid Input Augmented Multi-Scale CNN for GGO Detection in 3D Lung CT Images

Weihua Liu,Xiabi Liu,Xiongbiao Luo,Murong Wang,Guanghui Han,Xinming Zhao,Zheng Zhu
DOI: https://doi.org/10.1016/j.patcog.2022.109261
IF: 8
2023-01-01
Pattern Recognition
Abstract:This paper proposes a new convolutional neural network (CNN) with multi-scale processing for detecting ground-glass opacity nodules (GGO) in 3D computed tomography (CT) images, which is referred to as Pi-aNet for short. PiaNet consists of a feature-extraction module and a prediction module. The former mod-ule is constructed by introducing pyramid multi-scale source connections into a contracting-expanding structure. Besides, a new multi-receptive-field convolution block (MRCB) is presented to fuse the convo-lutions with multiple kernels of varying sizes for capturing features in each scale of information better. The latter module includes a bounding-box regressor and a classifier that are employed to simultane-ously recognize GGO nodules and estimate bounding boxes at multiple scales. To train the proposed Pi-aNet, a two-stage transfer learning strategy is developed. In the first stage, the feature-extraction module is embedded into a classifier network that is trained on a large data set of GGO and non-GGO patches, which are generated by performing data augmentation from a small number of annotated CT scans. In the second stage, the pretrained feature-extraction module is loaded into PiaNet, and then PiaNet is fine-tuned using the annotated CT scans. We evaluate the proposed PiaNet with the LIDC-IDRI dataset. The experimental results demonstrate that our method outperforms state-of-the-art counterparts, including the Subsolid CAD and Aidence systems and CPM-Net and S4ND and GA-SSD methods. PiaNet achieves a sensitivity of 93.6% with only one false positive per scan.(c) 2022 Elsevier Ltd. All rights reserved.
What problem does this paper attempt to address?