Review of Data Visualization Research
LIU Bin,LIU Zengjie,LIU Yu,LI Ziwen,CHEN Li,SUN Zhongxian,WANG Ying,ZHANG Yihui,ZHAO Jiasheng,ZHANG Hongbin,LIU Qing
DOI: https://doi.org/10.7535/hbkd.2021yx06012
2021-01-01
Abstract:Data visualization plays an important role in discovering rules from massive data,enhancing data performance and improving interaction efficiency.At present,the concept of data visualization and related research fields are expanding.In terms of data types,the current visualization research gradually focuses on the fields of multidimensional data,time series data,network data and hierarchical data.Through the analysis of Chinese and foreign literature on CNKI,it can be seen that 2014 and 2015 are "milestone" years in which the research heat in the field of data visualization is upgraded and a large number of theoretical achievements are produced;Data visualization is an important supporting field of rapid development after the formation of the research upsurge in the field of big data in China;The research in the field of data visualization at home and abroad has basically achieved synchronization in time;Wuhan University,Zhejiang University,Beijing University of Posts and telecommunications,University of national defense science and technology and University of Electronic Science and technology research actively in this field in China.In order to obtain good visual effects,help users reduce the difficulty of understanding,efficiently analyze data and insight value,It is usually necessary to pay attention to technical points such as color and semantics,highlighting core data,preventing data overload and preventing excessive divergence of thinking.The existing data visualization technologies are mainly divided into geometry based technology,icon based technology,dimension reduction based technology,pixel oriented technology,time series based technology,network data based technology,hierarchical visualization technology and distribution technology.Visualization methods based on geometric technology,including parallel coordinates,scatter matrix,Andrews curve,etc;The coordinate based visualization method can clearly show the relationship between variables,but limited by the screen size,it is difficult to visually display all dimensions when the data dimensions exceed three.It needs to be displayed in combination with human-computer interaction technology,which is suitable for the correlation between different dimensions,such as the correlation between students' learning behaviors;Icon based visualization method mainly includes star drawing method and Chernoff surface method.Geometric graphics are used as icons to depict multi-dimensional data,which intuitively reflects the visual significance of each work surface.It is suitable for work completion and incentive work progress overview,etc;The visualization method based on dimension reduction technology determines the coordinates of points according to the dimension attributes and maps them to the low-dimensional visual space on the premise of keeping the data relationship unchanged.The dimension reduction technology mainly involves principal component analysis,self-organizing mapping,isometric mapping,etc;The visualization method based on time series is a visualization method to display the relationship and influence degree between data,mainly including linear graph,stacking graph,horizon graph,etc.the corresponding data is collected with the development of time and presented by the above three visualization methods,which is suitable for representing the flow and change state of information data,such as the trend distribution of grades in different time periods and the change of theme concepts,etc;The core of the visualization method based on network data is the automatic layout algorithm,which draws the graph of network structure through automatic layout and calculation.It mainly strongly guides the layout,circular layout and grid layout,etc.It is commonly used to represent the large-scale social network structure,which is suitable for activity analysis,citation relationship,etc;Hierarchical visualization technology mainly includes node connection,space filling and hybrid methods,etc.it represents the data of hierarchical structure by drawing nodes and bounding boxes with different shapes.It is suitable for the discovery and mining of interactive relationships among group members,such as the interaction between online collaborative employees.Based on the analysis of data visualization CNKI research,this paper puts forward some points for attention in the process of data visualization,and points out that data visualization technology needs to focus on color matching and establish a relationship between color and the importance of data content;The visualization scheme shall reasonably combine and apply relevant visualization technologies based on business logic on the basis of meeting business needs;The unified visualization style helps to improve the coherence,consistency and efficiency of people's understanding of data;At the same time,It also takes into account the aesthetic requirements of users and establishes a reasonable matching relationship between style and color;Data visualization should focus on the practical,reasonable and efficient performance of key processes,key objectives and key results.This paper also summarizes the visualization application example Echarts,including the application of Echarts interactive components (markPoint and markLine annotation point components,datazoom area components,legend interactive components) in visualization,dynamic data rendering and so on.Finally,the challenges and future research directions of visualization are analyzed and prospected,and it is pointed out that virtual reality,visualization system and data analysis are the research directions of visualization in the future.Its application also includes statistical visualization,news visualization,thinking visualization,social network visualization and search log visualization.