TanrsColour: Transformer‐based medical image colourization with content and structure preservation

Qinghai Liu,Dengping Zhao,Lun Tang,Limin Xu
DOI: https://doi.org/10.1049/ipr2.13129
IF: 2.3
2024-06-16
IET Image Processing
Abstract:A novel Transformer‐based medical image colourization method, which consists of content and style Transformer encoder to capture domain‐specific telematic information for long‐term dependency in colourization task is proposed. Then, a Transformer decoder is designed to translate content sequences and refer to style sequences, which achieves better fusion of content and style features. Additionally, content‐aware positional encoding scheme and style‐aware positional encoding scheme are employed for scale‐invariant visual generation tasks. Medical image colouring techniques enable to colourize grey‐scale medical images for assisting doctors in diagnosis. Benefiting from the non‐linear fitting ability of deep neural network, deep medical image colouring techniques have achieved remarkable results. However, existing methods are still facing content and structure feature leakage, unrealistic colouring and poor scale invariability. Thus, this paper, proposes a Transformer‐based medical image colouring algorithm with long‐term dependency to avoid feature leakage of coloured images. To be specific, this method employs two different Transformer encoders to generate and encode feature sequences for grey‐scale medical images and real human colour slice images, respectively. Then, a novel multi‐layer Transformer decoder is used to stylize grey‐scale map image features based on the real physical colour feature sequences. For colouring images at different scales, we implement content‐ aware positional encoding with scale invariance and propose style‐aware positional encoding strategy to take realistic and physical colour prior into account. Extensive experimental results indicate our method has achieved better colourization effects than recent state‐of‐the‐art medical image colourization methods.
computer science, artificial intelligence,engineering, electrical & electronic,imaging science & photographic technology
What problem does this paper attempt to address?