Gradient-Leaks: Enabling Black-Box Membership Inference Attacks Against Machine Learning Models

Gaoyang Liu,Tianlong Xu,Rui Zhang,Zixiong Wang,Chen Wang,Ling Liu
DOI: https://doi.org/10.1109/tifs.2023.3324772
IF: 7.231
2023-11-25
IEEE Transactions on Information Forensics and Security
Abstract:Machine Learning (ML) techniques have been applied to many real-world applications to perform a wide range of tasks. In practice, ML models are typically deployed as the black-box APIs to protect the model owner's benefits and/or defend against various privacy attacks. In this paper, we present Gradient-Leaks as the first evidence showcasing the possibility of performing membership inference attacks (MIAs), with mere black-box access, which aim to determine whether a data record was utilized to train a given target ML model or not. The key idea of Gradient-Leaks is to construct a local ML model around the given record which locally approximates the target model's prediction behavior. By extracting the membership information of the given record from the gradient of the substituted local model using an intentionally modified autoencoder, Gradient-Leaks can thus breach the membership privacy of the target model's training data in an unsupervised manner, without any priori knowledge about the target model's internals or its training data. Extensive experiments on different types of ML models with real-world datasets have shown that Gradient-Leaks can achieve a better performance compared with state-of-the-art attacks.
computer science, theory & methods,engineering, electrical & electronic
What problem does this paper attempt to address?