Big Data Density Analytics Using Parallel Coordinate Visualization

Jinson Zhang,Wen Bo Wang,Mao Lin Huang,Liang Fu Lu,Zhao-Peng Meng
DOI: https://doi.org/10.1109/cse.2014.219
2014-01-01
Abstract:Parallel coordinate is a popular tool for visualizing high-dimensional data and analyzing multivariate data. With the rapid growth of data size and complexity, data clutter in parallel coordinates is a major issue for Big Data visualization. This has given rise to three problems, 1) how to rearrange the parallel axes without the loss of data patterns, 2) how to shrink data attributes on each axis without the loss of data trends, 3) how to visualize the structured and unstructured data patterns for Big Data analysis. In this paper, we introduce the 5Ws dimensions as the parallel axes and establish the 5Ws sending density and receiving density as additional axes for Big Data visualization. Our model not only demonstrates Big Data attributes and patterns, but also reduces data over-lapping by up to 80 percent without the loss of data patterns. Experiments show that this new model can be efficiently used for Big Data analysis and visualization.
What problem does this paper attempt to address?