Document Fragmentation for XML Streams Based on Markov Table

HUO Huan,WANG Guo-ren,CHEN Qing-kui,PENG Dun-lu
2010-01-01
Abstract:Unlike in traditional databases,queries on XML streams are bounded not only by memory but also by real time processing.Based on hole-filler model,Path Frequency Tree(PFT)is first introduced to represent queries' statistics.With the help of PFT,a suffix merging document fragmentation policy is developed based on Markov table and a corresponding fragmentation algorithm is put forward.The algorithm effectively enhances the utilization and the cohesion of XML fragments.The performance study shows that the document fragmentation algorithm based on Markov table performs well on query cost and other metrics.
What problem does this paper attempt to address?