Eventdb: A Large-Scale Semi-Structured Scientific Data Management System
Wenjia Zhao,Yong Qi,Di Hou,Peijian Wang,Xin Gao,Zirong Du,Yudong Zhang,Yongfang Zong
DOI: https://doi.org/10.1007/978-3-030-28061-1_12
2019-01-01
Abstract:During the process of scientific research, the amount of data collected from scientific experimental devices has reached hundreds of PB per year. So how to use these data efficiently to produce some scientific findings is a hot problem. There are many challenges in the use of these scientific big data, such as the storage, processing and sharing of the data. In this paper, we propose a data management system, EventDB, for scientific big data. EventDB provides data management function for massive semi-structured scientific data; In EventDB, we propose IndexDB to provide a faster data retrieval, cross-domain access to provide a better data sharing and operator libraries to provide higher performance data analysis. Our preliminary experiments show that our system has improved performance by more than 6 times in data retrieval.