Fine-Grained Classification of Colposcopic Images with Multi-Modality

Yiming Wang,Yi Chen,Jialiang Lu,Song Li,Dan Wu
DOI: https://doi.org/10.1145/3502871.3502877
2022-01-01
Abstract:Computer aided diagnosis (CAD) has gained popularity in medical image processing over the last decade. As with image based manual diagnosis, CAD system should base its results on specific lesions. In the case of colposcopic images, two modalities are collected for diagnosis, namely Lugol’s Iodine (LI) and Acetic Acid (A). In practice, classification performance may suffer from barely distinguishable lesions in mono-modality. To tackle this problem, we present in this paper a framework with fine-grained strategy and further boost its performance with multi-modality. Specifically, our proposed framework consists of two branches, Local Lesion Branch and Global Branch, to implement a fine-grained strategy. Local Lesion Branch indicates existence and severity of lesion in different modalities on patch level. Global Branch extracts features on whole image level, and uses our proposed Cross Stage Modality Fusion (CSMF) Blocks to enable feature exchange and fusion between two modalities. Features from two branches are then concatenated for final prediction. We conduct experiments on clinical dataset of colposcopic images. The result shows that our method achieves an accuracy of 83.39%, outperforming mono-modal baseline model by over 7.24%.
What problem does this paper attempt to address?