Unified Model for Interpreting Multi-View Echocardiographic Sequences Without Temporal Information

Ming Li,Shizhou Dong,Zhifan Gao,Cheng Feng,Huahua Xiong,Wei Zheng,Dhanjoo Ghista,Heye Zhang,Victor Hugo C. de Albuquerque
DOI: https://doi.org/10.1016/j.asoc.2019.106049
IF: 8.7
2020-01-01
Applied Soft Computing
Abstract:The robust and fully automatic interpretation of multi-view echocardiographic sequences across multi-vendor and multi-center is a challenging task due to abounding artifacts, low signal-to-noise ratio, large shape variations among different views, and large gaps across different centers and vendors. In this paper, a dense pyramid and deep supervision network (DPSN) is proposed to tackle this challenging task. DPSN incorporates the advantages of the densely connected network, feature pyramid network, and deeply supervised network, which help to extract and fuse multi-level and multi-scale holistic semantic information. This capability endows DPSN with prominent generalization and robustness, enabling it to yield a precise interpretation. To reduce the computational complexity and avoid the frequent information loss in temporal modeling, DPSN processes all frames independently (i.e., without utilizing temporal information) but can still obtain stable and coherent performance in the sequence. Adequate experiments on the heterogeneous (multi-view, multi-center, and multi-vendor) dataset (10858 labeled images) corroborate that DPSN achieves not only superior segmentation results but also prominent computational efficiency and stable performance. Estimation of the ejection fraction also shows good clinical correlation, revealing the clinical potential of DPSN.
What problem does this paper attempt to address?