Network Public Opinion Monitoring Based on MapReduce

Yunhong Zhao,Suruo Li
DOI: https://doi.org/10.1088/1742-6596/2037/1/012031
2021-09-01
Journal of Physics: Conference Series
Abstract:Public opinion information on the network is abundantly decentralized in Internet, these false ones, easily misleading, not legally binding, it is difficult to deal with. Quickly master information is a prerequisite to deal with the network of public opinion, clustering algorithm in data mining plays an important role in in the collection of information statistics. Disadvantages of classical clustering methods are high resources consumption, low efficiency, high time and space complexity, and difficult to deal with large-scale data processing in massive network. To the network public opinion text as the research object, in-depth study of summarizing the network public opinion monitoring technology based on distributed MapReduce, including parallel technology, improved clustering algorithm, relational database and distributed database construction, which is in order to improve the efficiency of information processing, to reduce the pin.
What problem does this paper attempt to address?