Masked Latent Transformer with the Random Masking Ratio to Advance the Diagnosis of Dental Fluorosis

Yun Wu,Hao Xu,Maohua Gu,Zhongchuan Jiang,Jun Xu,Youliang Tian
2024-04-21
Abstract:Dental fluorosis is a chronic disease caused by long-term overconsumption of fluoride, which leads to changes in the appearance of tooth enamel. It is an important basis for early non-invasive diagnosis of endemic fluorosis. However, even dental professionals may not be able to accurately distinguish dental fluorosis and its severity based on tooth images. Currently, there is still a gap in research on applying deep learning to diagnosing dental fluorosis. Therefore, we construct the first open-source dental fluorosis image dataset (DFID), laying the foundation for deep learning research in this field. To advance the diagnosis of dental fluorosis, we propose a pioneering deep learning model called masked latent transformer with the random masking ratio (MLTrMR). MLTrMR introduces a mask latent modeling scheme based on Vision Transformer to enhance contextual learning of dental fluorosis lesion characteristics. Consisting of a latent embedder, encoder, and decoder, MLTrMR employs the latent embedder to extract latent tokens from the original image, whereas the encoder and decoder comprising the latent transformer (LT) block are used to process unmasked tokens and predict masked tokens, respectively. To mitigate the lack of inductive bias in Vision Transformer, which may result in performance degradation, the LT block introduces latent tokens to enhance the learning capacity of latent lesion features. Furthermore, we design an auxiliary loss function to constrain the parameter update direction of the model. MLTrMR achieves 80.19% accuracy, 75.79% F1, and 81.28% quadratic weighted kappa on DFID, making it state-of-the-art (SOTA).
Computer Vision and Pattern Recognition,Artificial Intelligence
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve This paper aims to address several key issues in the diagnosis of dental fluorosis: 1. **Lack of Public Datasets**: Currently, there are no publicly available image datasets for dental fluorosis that can be used to train deep learning models. 2. **Limitations of Traditional Methods**: Existing research mainly relies on traditional machine learning techniques, such as fuzzy C-means clustering and enhanced unsupervised clustering algorithms. These methods have limitations in feature extraction and classification performance. 3. **Shortage of Professionals**: In high-prevalence areas, there is a lack of experienced dentists to conduct large-scale screenings, leading to increased risks of misdiagnosis and missed diagnosis. To address these issues, the paper proposes the following contributions: - Constructed the first open-source dental fluorosis image dataset (DFID), providing a foundation for research in this field. - Proposed a novel deep learning model based on Vision Transformer—Masked Latent Transformer with Random Mask Ratio (MLTrMR)—to enhance the automated intelligent diagnosis of dental fluorosis. - Designed an auxiliary loss function to constrain the direction of model parameter updates, significantly improving model performance. Through these efforts, the paper aims to advance the automation of dental fluorosis diagnosis and provide a new approach to addressing current diagnostic challenges.