Decision Tree Classification on Outsourced Data

Koray Mancuhan,Chris Clifton
DOI: https://doi.org/10.48550/arXiv.1610.05796
IF: 5.414
2016-10-18
Machine Learning
Abstract:This paper proposes a client-server decision tree learning method for outsourced private data. The privacy model is anatomization/fragmentation: the server sees data values, but the link between sensitive and identifying information is encrypted with a key known only to clients. Clients have limited processing and storage capability. Both sensitive and identifying information thus are stored on the server. The approach presented also retains most processing at the server, and client-side processing is amortized over predictions made by the clients. Experiments on various datasets show that the method produces decision trees approaching the accuracy of a non-private decision tree, while substantially reducing the client's computing resource requirements.
What problem does this paper attempt to address?