Abstract:Anatomical Landmark Detection is the process of identifying key areas of an image for clinical measurements. Each landmark is a single ground truth point labelled by a clinician. A machine learning model predicts the locus of a landmark as a probability region represented by a heatmap. Diffusion models have increased in popularity for generative modelling due to their high quality sampling and mode coverage, leading to their adoption in medical image processing for semantic segmentation. Diffusion modelling can be further adapted to learn a distribution over landmarks. The stochastic nature of diffusion models captures fluctuations in the landmark prediction, which we leverage by blurring into meaningful probability regions. In this paper, we reformulate automatic Anatomical Landmark Detection as a precise generative modelling task, producing a few-hot pixel heatmap. Our method achieves state-of-the-art MRE and comparable SDR performance with existing work.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the accuracy and efficiency issues in **Automatic Anatomical Landmark Detection**. Specifically, the author proposes a method based on the diffusion model, aiming to improve the accuracy and robustness of landmark detection while reducing the time for model training and inference. ### Problem Background Anatomical landmark detection refers to identifying key regions in medical images for clinical measurement. Each landmark is labeled as a single ground truth (GT) by a doctor. Traditional methods rely on manual annotation, which is not only time - consuming but also prone to large inter - annotator variability. Automating this process can significantly improve the annotation speed and accuracy and reduce the annotator's variability. ### Limitations of Existing Methods Currently, most studies use convolutional neural networks (CNNs) for landmark detection, most commonly by generating heatmaps to predict the position of landmarks. However, these methods may encounter difficulties when dealing with complex images, especially in capturing the uncertainty of landmark positions. ### Main Contributions of the Paper 1. **Introduction of Diffusion Model**: The author applies the denoising diffusion probability model (DDPMs) to the landmark detection task, taking advantage of its strong sample quality and pattern coverage capabilities. 2. **Single - step and Multi - step Diffusion Models**: A single - step diffusion model is proposed to accelerate the analysis speed, and combined with a multi - step diffusion model to improve accuracy. 3. **Improved Loss Function**: By adjusting the loss function, the model can better learn the probability distribution of landmarks, thereby improving the detection accuracy. 4. **Salt & Pepper Heatmaps**: By gradually reducing Gaussian blur, the activation points in the heatmap are enhanced, making the model output more meaningful. ### Method Overview - **Data Generation**: Generate data step by step through the diffusion model, starting from the standard Gaussian distribution \(N(0, I)\), gradually removing noise until the training data distribution is restored. - **Model Architecture**: Use a time - encoded U - Net architecture, combined with the sinusoidal position embedding of Transformer to encode the time step. - **Optimization Objective**: Optimize the learning process of the model by combining the mean square error (MSE) and negative log - likelihood (NLL) loss functions. ### Experimental Results The experimental results show that this method has achieved state - of - the - art performance on multiple public datasets, especially in terms of the mean radial error (MRE) and the success detection rate (SDR). In particular, the multi - step diffusion model has achieved high accuracy in all radius ranges and has an efficiency comparable to existing methods. ### Conclusion This paper demonstrates the effectiveness of the multi - step diffusion model in automatic landmark detection. By optimizing the model structure and loss function, the training speed and detection accuracy are significantly improved. Future work will further explore alternative formulas and more efficient inference methods to further improve the model performance. --- In summary, this paper aims to solve the accuracy and efficiency problems in automatic anatomical landmark detection by introducing the diffusion model and an improved heatmap generation method, thereby providing a more reliable tool for medical image processing.

Salt & Pepper Heatmaps: Diffusion-informed Landmark Detection Strategy

A Heatmap Regression Approach for Right Ventricle Anatomical Landmarks Localization on Echocardiography

Spine Landmark Localization with combining of Heatmap Regression and Direct Coordinate Regression

Synthetic Augmentation for Anatomical Landmark Localization using DDPMs

Learn Fine-Grained Adaptive Loss for Multiple Anatomical Landmark Detection in Medical Images

Anatomic-landmark Detection Using Graphical Context Modelling.

Landmark Localization from Medical Images with Generative Distribution Prior

Spatial landmark detection and tissue registration with deep learning

Super-resolution landmark detection networks for medical images

Multi-task global optimization-based method for vascular landmark detection

Sub-pixel face landmarks using heatmaps and a bag of tricks

Heatmap-based 2D Landmark Detection with a Varying Number of Landmarks

An Automatic Cephalometric Landmark Detection Method Based on Heatmap Regression and Monte Carlo Dropout

Attentive One-Dimensional Heatmap Regression for Facial Landmark Detection and Tracking

Multiple Landmark Detection using Multi-Agent Reinforcement Learning

Adversarial Learning Framework for Cerebrovascular Landmark Detection Using Cross-Modality Information.

A Cascade Regression Model For Anatomical Landmark Detection

What is Healthy? Generative Counterfactual Diffusion for Lesion Localization

Uncertainty-aware U-Net for Medical Landmark Detection

Optimising for the Unknown: Domain Alignment for Cephalometric Landmark Detection

Diffusion Models for Counterfactual Generation and Anomaly Detection in Brain Images