Privacy Preserving in distributed SVM data mining over horizontally partitioned data

Mohammed Z. Omer,Hui Gao,Nadir Mustafa
DOI: https://doi.org/10.1109/ICCWAMTIP.2016.8079835
2016-01-01
Abstract:Data Mining algorithms can tackle the data either centrally or distributed. Outsourcing data can solve the issues of processing, storing, and analyzing a massive data. A proportion of existing data in various places and to improve the classification results, we propose the following solution for data mining with preserving the privacy. However, a critical problem that precludes free sharing of information is confidentiality and security issues. One of the significant tasks of data mining and machine learning is classification new instances, the SVM algorithms a widely used in classification, which applicable in many various areas. We propose a privacy-preserving solution for SVM classification over horizontally partitioned data. Our solution constructs a global SVM classification model from global Gram Matrix, without revealing sensitive data. Our experimental results demonstrate the accuracy of distributed SVM using Gram matrix up to 97% with preserving the privacy.
What problem does this paper attempt to address?