Exploring Topical Lead-Lag Across Corpora

Shixia Liu,Yang Chen,Hao Wei,Jing Yang,Kun Zhou,Steven M. Drucker
DOI: https://doi.org/10.1109/tkde.2014.2324581
IF: 9.235
2014-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:Identifying which text corpus leads in the context of a topic presents a great challenge of considerable interest to researchers. Recent research into lead-lag analysis has mainly focused on estimating the overall leads and lags between two corpora. However, real-world applications have a dire need to understand lead-lag patterns both globally and locally. In this paper, we introduce TextPioneer, an interactive visual analytics tool for investigating lead-lag across corpora from the global level to the local level. In particular, we extend an existing lead-lag analysis approach to derive two-level results. To convey multiple perspectives of the results, we have designed two visualizations, a novel hybrid tree visualization that couples a radial space-filling tree with a node-link diagram and a twisted-ladder-like visualization. We have applied our method to several corpora and the evaluation shows promise, especially in support of text comparison at different levels of detail.
What problem does this paper attempt to address?