How to unlearn a learned Machine Learning model ?

Seifeddine Achour
2024-10-14
Abstract:In contemporary times, machine learning (ML) has sparked a remarkable revolution across numerous domains, surpassing even the loftiest of human expectations. However, despite the astounding progress made by ML, the need to regulate its outputs and capabilities has become imperative. A viable approach to address this concern is by exerting control over the data used for its training, more precisely, by unlearning the model from undesired data. In this article, I will present an elegant algorithm for unlearning a machine learning model and visualize its abilities. Additionally, I will elucidate the underlying mathematical theory and establish specific metrics to evaluate both the unlearned model's performance on desired data and its level of ignorance regarding unwanted data.
Machine Learning,Optimization and Control
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to "forget" specific unwanted data from an already - trained machine - learning model while maintaining good performance on the desired data. Specifically, the author proposes an elegant algorithm to achieve "unlearning" of the machine - learning model and introduces new evaluation metrics to measure the model's performance during this process. ### Problem Background With the wide application of machine learning (ML) in various fields, the supervision of its output and capabilities has become crucial. To meet this challenge, controlling the data used to train the model is a viable method, especially by making the model "forget" unwanted data. This "unlearning" requires not only that the model forgets the unwanted data but also ensures that its performance on the desired data is not affected or even improved. ### Main Objectives 1. **Propose a new unlearning algorithm**: This algorithm defines a new objective function (Ethical MSE, EMSE) so that the model minimizes the error on the desired data while maximizing the error on the unwanted data during the training process. 2. **Establish evaluation metrics**: To evaluate the performance of the model after unlearning, the author proposes two new evaluation metrics: - **Exponential R - squared**: Measures the non - representativeness of the model for the unwanted data. - **Fair R - squared**: Comprehensively evaluates the representativeness of the model on the desired data and the non - representativeness on the unwanted data. ### Mathematical Theory and Formulas The EMSE objective function proposed by the author is as follows: \[ \text{EMSE} = \sum_{i \in \text{wanted}} (y_i - \hat{y}_i)^2 - \sum_{i \in \text{unwanted}} \log\left(1 - \frac{1}{\sqrt{\pi}} \exp\left(- (y_i - \hat{y}_i)^2\right)\right) \] where: - \( y_i \) is the true value, - \( \hat{y}_i \) is the predicted value, - \( \text{wanted} \) represents the index set of the desired data, - \( \text{unwanted} \) represents the index set of the unwanted data. By optimizing this objective function, the model can effectively "forget" the unwanted data while maintaining a good fit for the desired data. ### Experimental Results The author verified the effectiveness of this algorithm through a series of experiments. The experiments showed that after applying the EMSE criterion, the model can significantly improve the fit to the desired data (R - squared increased from 0.43 to 0.98) and completely ignored the unwanted data. Moreover, even when the amount of unwanted data is large, this algorithm still performs well, further proving its robustness. ### Conclusion This paper proposes a novel unlearning algorithm and introduces new metrics for evaluating model performance. These methods provide new ideas for the controllability and security of machine - learning models, especially in application scenarios where privacy needs to be protected or sensitive data needs to be deleted.