Time-Series Multi-Level Probabilistic Graphical Model for Representing Lineages over Uncertain Data

ZHU Yunlei,YUE Kun,QIAN Wenhua,YANG Wenjing,LIU Weiyi
DOI: https://doi.org/10.3778/j.issn.1673-9418.1207010
2013-01-01
Abstract:Lineage analysis over uncertain data will trace the origin of uncertainty of data production and evolution with time passing. In order to reflect the inherent time-series property and the process of data evolution, and support probability inferences and uncertainty tracing in lineage analysis, this paper considers the lineages representation of query processing over uncertain data and adopts Bayesian network (BN), an important probabilistic graphical model (PGM), as the framework for uncertainty representation. Specifically, it extends BN by incorporating the time-series and multi-level properties. To provide the basis of lineage analysis models, this paper starts from the Boolean formulasthen gives the corresponding method for constructing BN structures in separate time slices and those between adjacent time slices, as well as the method for computing probability parameters of nodes. Experimental results show that the proposed method for lineage representation is effective and applicable.
What problem does this paper attempt to address?