Machine Unlearning for Medical Imaging

Reza Nasirigerdeh,Nader Razmi,Julia A. Schnabel,Daniel Rueckert,Georgios Kaissis
2024-07-10
Abstract:Machine unlearning is the process of removing the impact of a particular set of training samples from a pretrained model. It aims to fulfill the "right to be forgotten", which grants the individuals such as patients the right to reconsider their contribution in models including medical imaging models. In this study, we evaluate the effectiveness (performance) and computational efficiency of different unlearning algorithms in medical imaging domain. Our evaluations demonstrate that the considered unlearning algorithms perform well on the retain set (samples whose influence on the model is allowed to be retained) and forget set (samples whose contribution to the model should be eliminated), and show no bias against male or female samples. They, however, adversely impact the generalization of the model, especially for larger forget set sizes. Moreover, they might be biased against easy or hard samples, and need additional computational overhead for hyper-parameter tuning. In conclusion, machine unlearning seems promising for medical imaging, but the existing unlearning algorithms still needs further improvements to become more practical for medical applications.
Machine Learning,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper aims to address the application of Machine Unlearning in the field of medical imaging. Specifically, it focuses on how to effectively remove the influence of specific training samples from pre-trained models to meet the "Right to be Forgotten," which allows patients to reconsider their contributions to medical datasets and the models trained on these datasets. The paper evaluates the effectiveness and computational efficiency of different machine unlearning algorithms in the medical imaging domain, exploring their performance on the retention set (the set of samples whose influence is retained), the forgetting set (the set of samples whose contribution to the model needs to be eliminated), and the test set. It also analyzes the impact of these algorithms on the model's generalization ability and potential bias issues. The main research questions of the paper include: 1. **Effectiveness of the algorithms**: Evaluating the performance of different machine unlearning algorithms on the retention set and the forgetting set, ensuring that the algorithm's performance on the retention set is comparable to the pre-trained model, while significantly lower on the forgetting set. 2. **Computational efficiency of the algorithms**: Comparing the computational overhead of different machine unlearning algorithms, especially their efficiency in handling large-scale datasets. 3. **Model generalization ability**: Analyzing the impact of machine unlearning algorithms on the model's generalization ability, particularly when the size of the forgetting set is large. 4. **Fairness**: Investigating whether the algorithms introduce bias towards specific groups (e.g., male or female samples) and whether there is bias towards easily or difficult-to-classify samples. Through this research, the paper aims to provide theoretical foundations and practical guidance for the application of machine unlearning technology in the medical imaging field, promoting the development and application of related technologies.