Vision Intelligence Assisted Lung Function Estimation Based on Transformer Encoder-Decoder Network With Invertible Modeling.

Liuyin Chen,Di Lu,Jianxue Zhai,Kaican Cai,Long Wang ,Zijun Zhang
DOI: https://doi.org/10.1109/TAI.2023.3348428
2024-01-01
Abstract:Lung function evaluation is important to many medical applications, but conducting pulmonary function tests is constrained by different conditions. This paper presents a pioneer study of an integrated invertible deep learning method for lung function estimation via using computed tomography (CT) images. First, the projection method is proposed to flatten the 3D image onto a 2D plane, with preserving location information in 3D. Next, the MBConv Transformer-based encoder-decoder structure is developed to extract latent features. Finally, we develop an invertible Normalizing Flow model to infer lung function based on the extracted features and design two loss functions for two directions. The method enables both estimating the lung function based on CT images and metadata as well as generating the corresponding simulated CT image according to the lung function. Computational studies show that the proposed regression model outperforms all state-of-the-art image regression models. A comprehensive comparative analysis also demonstrates the effectiveness of using generated images and confirms the superiority of the proposed method. To the best of our knowledge, this work is the first of its kind in combining encoder-decoder network with Normalizing Flows to ensure the effectiveness of the fully invertible framework, especially in lung CT image analysis.
What problem does this paper attempt to address?