Abstract:In medical-data driven learning, 3D convolutional neural networks (CNNs) have started to show superior performance to 2D CNNs in numerous deep learning tasks, proving the added value of 3D spatial information in feature representation. However, the difficulty in collecting more training samples to converge, more computational resources and longer execution time make this approach less applied. Also, applying transfer learning on 3D CNN is challenging due to a lack of publicly available pre-trained 3D models. To tackle these issues, we proposed a novel 2D strategical representation of volumetric data, namely 2.75D. In this work, the spatial information of 3D images is captured in a single 2D view by a spiral-spinning technique. As a result, 2D CNN networks can also be used to learn volumetric information. Besides, we can fully leverage pre-trained 2D CNNs for downstream vision problems. We also explore a multi-view 2.75D strategy, 2.75D 3 channels (2.75Dx3), to boost the advantage of 2.75D. We evaluated the proposed methods on three public datasets with different modalities or organs (Lung CT, Breast MRI, and Prostate MRI), against their 2D, 2.5D, and 3D counterparts in classification tasks. Results show that the proposed methods significantly outperform other counterparts when all methods were trained from scratch on the lung dataset. Such performance gain is more pronounced with transfer learning or in the case of limited training data. Our methods also achieved comparable performance on other datasets. In addition, our methods achieved a substantial reduction in time consumption of training and inference compared with the 2.5D or 3D method.

What problem does this paper attempt to address?

### Problems the paper attempts to solve The paper aims to solve the problems of insufficient data samples, high demand for computing resources and long training time faced by 3D convolutional neural networks (3D CNN) in medical image analysis. Although 3D CNN is superior to 2D CNN in feature representation, its application is limited by the above problems. In addition, due to the lack of publicly available pre - trained 3D models, the application of transfer learning on 3D CNN also faces challenges. To overcome these problems, the author proposes a new 2D representation method - 2.75D, which converts 3D volume data into 2D views through spiral scanning technology. This method can not only use pre - trained 2D CNN for downstream visual tasks, but also significantly improve the efficiency of training and inference. Specifically, the 2.75D method has been improved in the following aspects: 1. **Data representation**: Through spiral scanning technology, the spatial information in 3D volume data is captured into a single 2D view, enabling 2D CNN to also learn volume information. 2. **Transfer learning**: Make full use of pre - trained 2D CNN to improve performance on small - scale data sets. 3. **Computing efficiency**: Compared with 2.5D and 3D methods, the 2.75D method significantly reduces the time consumption of training and inference while maintaining high performance. ### Experimental verification The author evaluated the 2.75D method on three publicly available data sets, which involve different modalities and organs (lung CT, breast MRI and prostate MRI respectively). The experimental results show that the 2.75D method is significantly superior to 2D, 2.5D and 3D methods in classification tasks, especially in the cases of training from scratch and transfer learning. In addition, the 2.75D method performs particularly well when the amount of data is limited, and also has obvious advantages in training and inference speed. ### Main contributions 1. **Propose 2.75D strategy**: Through spiral scanning technology, 3D volume data is efficiently converted into 2D representation as the input of 2D classification CNN. 2. **Multi - view 2.75D strategy**: Further explore the multi - view 2.75D strategy (2.75D×3) to enhance the feature extraction ability. 3. **Transfer learning**: Use the models pre - trained on large - scale 2D image data sets for transfer learning, which significantly improves the performance. 4. **Systematic research**: Systematically study the training effects of different methods under different data volumes. ### Conclusion The 2.75D method provides an efficient and effective solution, which can improve the performance of 3D medical image analysis under the conditions of limited data samples and limited computing resources. By converting 3D volume data into 2D representation, the 2.75D method can not only make full use of pre - trained 2D CNN, but also significantly improve the efficiency of training and inference.

2.75D: Boosting learning by representing 3D Medical imaging to 2D features for small data

3D Multiple-Contextual ROI-Attention Network for Efficient and Accurate Volumetric Medical Image Segmentation.

Super Images -- A New 2D Perspective on 3D Medical Imaging Analysis

Bridging 2D and 3D Segmentation Networks for Computation Efficient Volumetric Medical Image Segmentation: An Empirical Study of 2.5D Solutions

Learning 3D Features with 2D CNNs Via Surface Projection for CT Volume Segmentation.

Reinventing 2D Convolutions for 3D Images

A Spatial Mapping Algorithm with Applications in Deep Learning-Based Structure Classification

Enhancement and evaluation for deep learning-based classification of volumetric neuroimaging with 3D-to-2D knowledge distillation

Leveraging 2D Deep Learning ImageNet-trained models for Native 3D Medical Image Analysis

3D Self-Supervised Methods for Medical Imaging

Anisotropic Hybrid Network For Cross-Dimension Transferable Feature Learning In 3d Medical Images

Adapting Pre-trained Vision Transformers from 2D to 3D through Weight Inflation Improves Medical Image Segmentation

A 3D Coarse-to-Fine Framework for Volumetric Medical Image Segmentation

C V ] 5 O ct 2 01 9 Self-supervised Feature Learning for 3 D Medical Images by Playing a Rubik ’ s Cube

3D Tiled Convolution for Effective Segmentation of Volumetric Medical Images

A 3D Convolutional Neural Network for Volumetric Image Semantic Segmentation

3D Dense Separated Convolution Module for Volumetric Medical Image Analysis

3D Anisotropic Hybrid Network: Transferring Convolutional Features from 2D Images to 3D Anisotropic Volumes.

Contextual Embedding Learning to Enhance 2D Networks for Volumetric Image Segmentation

Pretrained Deep 2.5D Models for Efficient Predictive Modeling from Retinal OCT

3D ConvNet+: A lightweight adaptive network for 3D medical image segmentation