Competitor Mining from Web Encyclopedia: A Graph Embedding Approach

Xin Hong,Peiquan Jin,Lin Mu,Jie Zhao,Shouhong Wan
DOI: https://doi.org/10.1007/978-3-030-62005-9_5
2020-01-01
Abstract:Mining competitors from the web has been a valuable and emerging topic in big data and business analytics. While normal web pages may include incredible information like fake news, in this paper, we aim to extract competitors from web encyclopedia like Wikipedia and DBpedia, which provide more credible information. We notice that the entities in web encyclopedia can form graph structures. Motivated by this observation, we propose to extract competitors by employing a graph embedding approach. We first present a general framework for mining competitors from web encyclopedia. Then, we propose to mine competitors based on the similarity among graph nodes and further present a similarity computation method combing graph-node similarity and textual relevance. We implement the graph-embedding-based algorithm and compare the proposed method with four existing algorithms on the real data sets crawled fromWikipedia and DBpedia. The results in terms of precision, recall, and F1-measure suggest the effectiveness of our proposal.
What problem does this paper attempt to address?