Safe machine learning model release from Trusted Research Environments: The SACRO-ML package

Jim Smith,Richard J. Preen,Andrew McCarthy,Maha Albashir,Alba Crespi-Boixader,Shahzad Mumtaz,James Liley,Simon Rogers,Yola Jones
2024-10-24
Abstract:We present SACRO-ML, an integrated suite of open source Python tools to facilitate the statistical disclosure control (SDC) of machine learning (ML) models trained on confidential data prior to public release. SACRO-ML combines (i) a SafeModel package that extends commonly used ML models to provide ante-hoc SDC by assessing the vulnerability of disclosure posed by the training regime; and (ii) an Attacks package that provides post-hoc SDC by rigorously assessing the empirical disclosure risk of a model through a variety of simulated attacks after training. The SACRO-ML code and documentation are available under an MIT license at <a class="link-external link-https" href="https://github.com/AI-SDC/SACRO-ML" rel="external noopener nofollow">this https URL</a>
Machine Learning,Cryptography and Security,Information Retrieval
What problem does this paper attempt to address?