Research on method for extracting large-scale social network based on Mapreduce

SHI Quan,XIAO Yang-hua,WEN Wen-hao,ZHU Qian-qian,WANG Heng-shan
DOI: https://doi.org/10.3969/j.issn.1001-3695.2011.01.041
2011-01-01
Abstract:Extracting large-scale social networks from massive heterogeneous Web data is of both theoretical and practical significance. However,one of definite features of this task was large-scale computing, which remains to be a great challenge that would be addressed.Cloud computing platform had provided us new opportunity to overcome this challenge.Hence,efforts would be dedicated to investigate the methods to extract large social network from Web data by cloud computing techniques.Specifically,proposed a Mapreduce-based approach to extract common interest network from DIGG. The experimental results show that the proposed method has good scalability and extensibility,having the capability to extract large-scale social network of high quality from heterogeneous Web data sources.
What problem does this paper attempt to address?