Unsupervised Author Disambiguation Using Dempster–Shafer Theory

Hao Wu,Bo Li,Yijian Pei,Jun He
DOI: https://doi.org/10.1007/s11192-014-1283-x
IF: 3.801
2014-01-01
Scientometrics
Abstract:The name ambiguity problem presents many challenges for scholar finding, citation analysis and other related research fields. To attack this issue, various disambiguation methods combined with separate disambiguation features have been put forward. In this paper, we offer an unsupervised Dempster–Shafer theory (DST) based hierarchical agglomerative clustering algorithm for author disambiguation tasks. Distinct from existing methods, we exploit the DST in combination with Shannon’s entropy to fuse various disambiguation features and come up with a more reliable candidate pair of clusters for amalgamation in each iteration of clustering. Also, some solutions to determine the convergence condition of the clustering process are proposed. Depending on experiments, our method outperforms three unsupervised models, and achieves comparable performances to a supervised model, while does not prescribe any hand-labelled training data.
What problem does this paper attempt to address?