Efficient Recovery of Missing Events

Jianmin Wang,Shaoxu Song,Xiaochen Zhu,Xuemin Lin,Jiaguang Sun
DOI: https://doi.org/10.1109/tkde.2016.2594785
IF: 9.235
2016-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:For various entering and transmission issues raised by human or system, missing events often occur in event data, which record execution logs of business processes. Without recovering the missing events, applications such as provenance analysis or complex event processing built upon event data are not reliable. Following the minimum change discipline in improving data quality, it is also rational to find a recovery that minimally differs from the original data. Existing recovery approaches fall short of efficiency owing to enumerating and searching over all of the possible sequences of events. In this paper, we study the efficient techniques for recovering missing events. According to our theoretical results, the recovery problem appears to be NP-hard. Nevertheless, advanced indexing, pruning techniques are developed to further improve the recovery efficiency. The experimental results demonstrate that our minimum recovery approach achieves high accuracy, and significantly outperforms the state-of-the-art technique for up to five orders of magnitudes improvement in time performance.
What problem does this paper attempt to address?