Bibliometric Study on Big Data Research: an Integration of Topic Model and Citation Network Analysis.

Dong Ke,Wu Jiang,Cheng Ni
2017-01-01
Abstract:Big data has been attracting wide attention due to its great importance. The rapid development of big data in recent years has led to a large amount of publications containing the achieved knowledge of this area. To study the intellectual structure of the related research about big data, a retrospective bibliometric analysis is conducted based on the Web of Science databases. 13673 papers and 8016 citation links are collected. The LDA topic model are used to detect the topic distribution of big data research area. 11 topics about big data technology, application and security are found. Island algorithm is applied to find the most influential 28 research communities from citation network, and these communicates are labelled through topic distribution rather than traditional way of single tag. Labelling each cluster by topic distribution can reveal the research content for sub-structures, which reflects a much more comprehensive picture of big data research area and provide a valuable reference for researchers to understand the overview and present situations in this field.
What problem does this paper attempt to address?