Quasi-optimal Data Placement for Secure Multi-tenant Data Federation on the Cloud

Qi Kang,Ji Liu,Sijia Yang,Haoyi Xiong,Haozhe An,Xingjian Li,Zhi Feng,Licheng Wang,Dejing Dou
DOI: https://doi.org/10.1109/bigdata50022.2020.9377953
2020-01-01
Abstract:As it is difficult to directly share data among different organizations, data federation brings new opportunities to the data-related cooperation among different organizations by providing abstract data interfaces. With the development of Cloud computing, organizations store data on the Cloud to achieve elasticity and scalability for data processing. The existing data placement approaches generally only consider one aspect, which is either communication cost or time cost, and do not consider the features of jobs that process the data. In this paper, we propose an approach to enable secure data processing on the Cloud with the data from different organizations. The approach consists of a data federation platform for secure data processing on the Cloud named FedCube and a greedy data placement algorithm that creates a plan to store data on the Cloud in order to achieve multiple objectives based on a cost model. The cost model is composed of two objectives, i.e., reducing both monetary cost and execution time. We present an experimental evaluation by comparing our data placement algorithm with the existing methods based on the data federation platform. The experiments show that our proposed algorithm significantly reduce the total cost (up to 69.8%).
What problem does this paper attempt to address?