Multi-Modal Recommendation Unlearning for Legal, Licensing, and Modality Constraints

Yash Sinha,Murari Mandal,Mohan Kankanhalli
2024-12-17
Abstract:User data spread across multiple modalities has popularized multi-modal recommender systems (MMRS). They recommend diverse content such as products, social media posts, TikTok reels, etc., based on a user-item interaction graph. With rising data privacy demands, recent methods propose unlearning private user data from uni-modal recommender systems (RS). However, methods for unlearning item data related to outdated user preferences, revoked licenses, and legally requested removals are still largely unexplored. Previous RS unlearning methods are unsuitable for MMRS due to the incompatibility of their matrix-based representation with the multi-modal user-item interaction graph. Moreover, their data partitioning step degrades performance on each shard due to poor data heterogeneity and requires costly performance aggregation across shards. This paper introduces MMRecUn, the first approach known to us for unlearning in MMRS and unlearning item data. Given a trained RS model, MMRecUn employs a novel Reverse Bayesian Personalized Ranking (BPR) objective to enable the model to forget marked data. The reverse BPR attenuates the impact of user-item interactions within the forget set, while the forward BPR reinforces the significance of user-item interactions within the retain set. Our experiments demonstrate that MMRecUn outperforms baseline methods across various unlearning requests when evaluated on benchmark MMRS datasets. MMRecUn achieves recall performance improvements of up to 49.85% compared to baseline methods and is up to $\mathbf{1.3}\times$ faster than the Gold model, which is trained on retain set from scratch. MMRecUn offers significant advantages, including superiority in removing target interactions, preserving retained interactions, and zero overhead costs compared to previous methods. The code will be released after review.
Machine Learning,Information Retrieval
What problem does this paper attempt to address?
This paper attempts to solve the data forgetting problem in multi - modal recommendation systems (MMRS), especially the item - data forgetting related to obsolete user preferences, revoked permissions, and legal requirements for deletion. Existing methods mainly focus on data forgetting in single - modal recommender systems (uni - modal recommender systems, RS), while there is less research on data forgetting in multi - modal recommendation systems. ### Main problems 1. **Data Forgetting in Multi - Modal Recommendation Systems**: - Existing methods are not applicable to multi - modal recommendation systems because they are based on matrix representations, while multi - modal recommendation systems rely on multi - modal user - item interaction graphs. - The data partitioning step will lead to performance degradation and requires expensive performance aggregation between each shard. 2. **Item - Data Forgetting**: - The forgetting requirements for item - data (such as audio, video, etc.) have not been fully explored. For example, when a music library is removed from TikTok, the recommendation system needs to adapt to this change while maintaining the validity of other modal data. 3. **Legal and Privacy Compliance**: - With the strengthening of data privacy regulations (such as GDPR), the recommendation system needs to have the "right to be forgotten" to ensure the security and privacy of user data. - Changes in legal litigation and content license agreements require the recommendation system to be able to dynamically adapt to new contractual obligations and legal regulations. 4. **User Interest Evolution**: - Users' interests may change over time, and the recommendation system needs to be able to adjust the recommended content according to users' new interests, so as to provide more relevant and personalized services. ### Solutions The paper proposes a method named MMR ECUN to solve the above problems. Specifically: - **Reverse Bayesian Personalized Ranking (BPR) Objective**: Minimize the influence of labeled interactions to make the model "forget" specific data points. - **Positive BPR Objective on the Retention Set**: To maintain the performance of the model on the retained data, use the positive BPR objective to strengthen the user - item interactions in the retention set. - **Multi - Modal Data Forgetting**: Introduce contrastive loss and regularization terms to reduce the influence of the learned item - item semantic correlations, thereby achieving multi - modal data forgetting. - **Balance Forgetting Specificity and Retention Fidelity**: Adjust the hyper - parameter α to balance forgetting specificity and retention fidelity, ensuring that the model can effectively forget the target data and maintain good performance on the retained data. ### Experimental Results Experiments show that MMR ECUN outperforms the baseline methods in various forgetting - request scenarios, including user, item, and user - item (simultaneous) forgetting. It improves the recall rate by up to 49.85% and is 1.3 times faster than the GOLD model trained from scratch. In addition, MMR ECUN also has the advantages of high efficiency, superior performance, and zero additional overhead. In conclusion, this paper proposes an innovative solution to solve the key challenges of data forgetting in multi - modal recommendation systems, providing an important reference for future recommendation system design.