Hfilter: Hybrid Finite Automaton Based Stream Filtering For Deep And Recursive Xml Data

Weiwei Sun,Yongrui Qin,Ping Yu,Zhuoyao Zhang,Zhenying He
DOI: https://doi.org/10.1007/978-3-540-85654-2_48
2008-01-01
Abstract:XML filtering applications are pining increasing popularity recently. Automata are generally adopted to construct query indexes for evaluating large numbers of XPath queries over XML streams. Usually only shallow data are observed in existing approaches. How to process deep and recursive XML data with low memory limitation efficiently is still a challenging issue. In this paper, we propose HFilter, a Hybrid Finite Automaton (HFA) based stream filtering approach, to solve this problem. We introduce the basic two-tier HFA (lazy DFA tier and NFA tier) first. which realizes data prefix sharing and memory overflow control to improve the filtering throughput. Then an optimized three-tier HFA with an extra pre-expanded DFA tier is Put forward, which significantly reduces the restarting cost of HFA after memory overflow. Experiments show that our approaches work more efficiently than existing ones.
What problem does this paper attempt to address?