Abstract:The newly emerged machine learning（ML） methods have been widely applied to various applications, and have become a strong driving force to revolutionize a wide range of industries, which have greatly promoted the prosperity and development of artificial intelligence. Meanwhile, the training and inference of the machine learning model are based on a large amount of data, which always contains some private information. And the privacy and security of the ML has faced serious challenges. Membership inference attacks（MIAs） mainly aim to infer whether a data record was used to train a target model or not. MIAs have not only been shown to be effective on various ML models（e.g., classification models and generative models）, but also have been penetrated into the fields of image classification, speech recognition, natural language processing, computer vision and so on, which creates a great security threat to the long-term development of machine learning.Therefore, in order to better improve the security of ML models for membership inference attacks, in this paper, we systematically introduce and analyze the basic principles and characteristics of the MIAs and their defenses from a ML attack-defense perspective. Firstly, we introduce the definitions and threat models of the MIAs, and classify these MIAs from six different perspectives such as attacks’ principles, scenarios, background knowledge, target models, fields and the size of attack datasets, and we compare their advantages and disadvantages. Secondly, we summary the reasons caused the MIAs from three aspects, namely diversity of training data, types of target models and overfitting of target models. Thirdly, we survey defensive techniques for MIAs as well as their characteristics by differential privacy, regularization, data argumentation,model stacking, early stopping, confidence score masking and knowledge distillation. Futhermore, we institute the evaluation metrics and datasets used in MIAs, and the other applications of the MIAs. Finally, by comparing and analyzing the existing MIAs and their defenses, we discuss the challenges and future research directions.

SocInf: Membership Inference Attacks on Social Media Health Data with Machine Learning.

Membership inference attacks on machine learning: A survey

Personal Information Inference in Social Networks

Advancing Membership Inference Attacks: the Present and the Future

Asurvey on membership inference attacks and defenses in Machine Learning

Inference Attacks Based on Neural Networks in Social Networks

Membership inference attacks against synthetic health data

Improved Membership Inference Attacks Against Language Classification Models

Evaluation of Query-Based Membership Inference Attack on the Medical Data

A Method to Facilitate Membership Inference Attacks in Deep Learning Models

Privacy Analysis of Deep Learning in the Wild: Membership Inference Attacks against Transfer Learning

Defenses to Membership Inference Attacks: A Survey

Membership Inference Attacks Against Machine Learning Models Via Prediction Sensitivity.

Systematic Evaluation of Privacy Risks of Machine Learning Models

Membership Inference Attacks Against Recommender Systems

Predicting social media users' indirect aggression through pre-trained models

Privacy-Preserving in Defending Against Membership Inference Attacks

Social Influence‐based Privacy Inference Attacks in Online Social Networks

Efficient Membership Inference Attacks against Federated Learning via Bias Differences