Large Scale Behavioral Analytics via Topical Interaction

Shih-Chieh Su
DOI: https://doi.org/10.48550/arXiv.1608.07625
2016-08-27
Abstract:We propose the split-diffuse (SD) algorithm that takes the output of an existing dimension reduction algorithm, and distributes the data points uniformly across the visualization space. The result, called the topic grids, is a set of grids on various topics which are generated from the free-form text content of any domain of interest. The topic grids efficiently utilizes the visualization space to provide visual summaries for massive data. Topical analysis, comparison and interaction can be performed on the topic grids in a more perceivable way.
Machine Learning
What problem does this paper attempt to address?