Research on Provenance Collection and Storage Based on Object-Based Storage System

Xuelong LIAO,Yulai XIE,Zhen RONG,Leihua QIN,Jianxi CHEN,Dan FENG
DOI: https://doi.org/10.3778/j.issn.1673-9418.1611041
2018-01-01
Abstract:Provenance is metadata that describes the ancestry or history of a digital object.Provenance enhances the value of the data it describes,as it provides answers to questions such as:How is this object created? What other object does this object depend on? How do the ancestries of these two objects differ? This paper analyzes the advantages of using objected-based storage system to store and manage provenance information,designs and implements how to use object-based storage architecture to collect and store provenance information.The system collects the kernel information by using system-status files,uses the JHOVE application to analyze file formats,and uses the Linux audit to monitor ordinary user applications on object-based storage client,and then encapsulates these provenance information into objects,stores them in Berkeley DB or log files in object-based storage devices.The measurement results show that the provenance-aware system based on object-based storage system has a good performance in terms of provenance collection,storage and query.
What problem does this paper attempt to address?