Timestamped State Sharing for Stream Analytics

Yunjian Zhao,Zhi Liu,Yidi Wu,Guanxian Jiang,James Cheng,Kunlong Liu,Xiao Yan
DOI: https://doi.org/10.1109/tpds.2021.3073253
IF: 5.3
2021-11-01
IEEE Transactions on Parallel and Distributed Systems
Abstract:State access in existing distributed stream processing systems is restricted locally within each operator. However, in advanced stream analytics such as online learning and dynamic graph analytics, enabling state sharing across different operators makes application development easier and stream processing more efficient. In addition, when stream records are timestamped, proper time semantics should be defined for both state updates and fetches. We propose a new state abstraction to address the limitations of existing systems and develop a distributed stream processing system, Nova, with native support for timestamped state sharing. We validate the expressiveness and efficiency of Nova with extensive experiments.
computer science, theory & methods,engineering, electrical & electronic
What problem does this paper attempt to address?