Multi-View Spatial Aggregation Framework for Joint Localization and Segmentation of Organs at Risk in Head and Neck CT Images

Shujun Liang,Kim-Han Thung,Dong Nie,Yu Zhang,Dinggang Shen
DOI: https://doi.org/10.1109/tmi.2020.2975853
IF: 10.6
2020-09-01
IEEE Transactions on Medical Imaging
Abstract:Accurate segmentation of organs at risk (OARs) from head and neck (H&N) CT images is crucial for effective H&N cancer radiotherapy. However, the existing deep learning methods are often not trained in an end-to-end fashion, i.e., they independently predetermine the regions of target organs before organ segmentation, causing limited information sharing between related tasks and thus leading to suboptimal segmentation results. Furthermore, when conventional segmentation network is used to segment all the OARs simultaneously, the results often favor big OARs over small OARs. Thus, the existing methods often train a specific model for each OAR, ignoring the correlation between different segmentation tasks. To address these issues, we propose a new multi-view spatial aggregation framework for joint localization and segmentation of multiple OARs using H&N CT images. The core of our framework is a proposed region-of-interest (ROI)-based fine-grained representation convolutional neural network (CNN), which is used to generate multi-OAR probability maps from each 2D view (i.e., axial, coronal, and sagittal view) of CT images. Specifically, our ROI-based fine-grained representation CNN (1) unifies the OARs localization and segmentation tasks and trains them in an end-to-end fashion, and (2) improves the segmentation results of various-sized OARs via a novel ROI-based fine-grained representation. Our multi-view spatial aggregation framework then spatially aggregates and assembles the generated multi-view multi-OAR probability maps to segment all the OARs simultaneously. We evaluate our framework using two sets of H&N CT images and achieve competitive and highly robust segmentation performance for OARs of various sizes.
engineering, biomedical,imaging science & photographic technology, electrical & electronic,computer science, interdisciplinary applications,radiology, nuclear medicine & medical imaging
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to optimize the transmission beamforming in the visible light communication (VLC) multi - user downlink system to maximize the achievable total rate of the system. Specifically, the author focuses on how to improve the total data transmission rate of the system by optimizing the beamforming vector under the condition of satisfying the optical power constraints of light - emitting diodes (LEDs). The solution to this problem is of great significance for improving the spectral efficiency of VLC systems and meeting the increasing demand for high data rates. In the paper, the author first uses information - theoretic tools to derive a closed - form total - rate expression. Then, further considering the LED optical power constraints, the rate - maximization problem is formulated as a mathematical optimization problem. To solve this complex non - convex optimization problem, the author proposes an iterative algorithm, which utilizes the sequential parametric convex approximation technique. Compared with the existing zero - forcing (ZF) beamforming strategy, the proposed algorithm does not force the co - channel interference to be zero, so it can achieve a higher total rate, which is verified in the simulation results.