Abstract:Deep learning-based super-resolution (SR) techniques have generally achieved excellent performance in the computer vision field. Recently, it has been proven that three-dimensional (3D) SR for medical volumetric data delivers better visual results than conventional two-dimensional (2D) processing. However, deepening and widening 3D networks increases training difficulty significantly due to the large number of parameters and small number of training samples. Thus, we propose a 3D convolutional neural network (CNN) for SR of magnetic resonance (MR) and computer tomography (CT) volumetric data called ParallelNet using parallel connections. We construct a parallel connection structure based on the group convolution and feature aggregation to build a 3D CNN that is as wide as possible with a few parameters. As a result, the model thoroughly learns more feature maps with larger receptive fields. In addition, to further improve accuracy, we present an efficient version of ParallelNet (called VolumeNet), which reduces the number of parameters and deepens ParallelNet using a proposed lightweight building block module called the Queue module. Unlike most lightweight CNNs based on depthwise convolutions, the Queue module is primarily constructed using separable 2D cross-channel convolutions. As a result, the number of network parameters and computational complexity can be reduced significantly while maintaining accuracy due to full channel fusion. Experimental results demonstrate that the proposed VolumeNet significantly reduces the number of model parameters and achieves high precision results compared to state-of-the-art methods in tasks of brain MR image SR, abdomen CT image SR, and reconstruction of super-resolution 7T-like images from their 3T counterparts.

LVNet: A lightweight volumetric convolutional neural network for real-time and high-performance recognition of 3D objects

RINet: Efficient 3D Lidar-Based Place Recognition Using Rotation Invariant Neural Network

Lightweight Image Super-Resolution Network Using 3D Convolutional Neural Networks

SparseVoxNet: 3-D Object Recognition With Sparsely Aggregation of 3-D Dense Blocks

3D LVCN: A Lightweight Volumetric ConvNet

VoxNet: A 3D Convolutional Neural Network for real-time object recognition

Vehicle Behavior Recognition using Multi-Stream 3D Convolutional Neural Network

PVConvNet: Pixel-Voxel Sparse Convolution for multimodal 3D object detection

OVPT: Optimal Viewset Pooling Transformer for 3D Object Recognition.

MV-C3D: A Spatial Correlated Multi-View 3D Convolutional Neural Networks

VolumeNet: A Lightweight Parallel Network for Super-Resolution of MR and CT Volumetric Data

Design Light-weight 3D Convolutional Networks for Video Recognition Temporal Residual, Fully Separable Block, and Fast Algorithm

DVFENet: Dual-branch voxel feature extraction network for 3D object detection

Virtual Sparse Convolution for Multimodal 3D Object Detection

VPFNet: Voxel-Pixel Fusion Network for Multi-class 3D Object Detection

Latent-MVCNN: 3D Shape Recognition Using Multiple Views from Pre-defined or Random Viewpoints

Point-Voxel CNN for Efficient 3D Deep Learning

LDCNet: A Lightweight Multi-Scale Convolutional Neural Network Using Local Dense Connectivity for Image Recognition

LLR-MVSNet: a lightweight network for low-texture scene reconstruction

Multi-channel Deep 3D Face Recognition

Ghost-dil-NetVLAD: A Lightweight Neural Network for Visual Place Recognition