User Identification for Knowledge Graph Construction Across Multiple Online Social Networks

Cuicui Ye,Jing Yang,Yan Mao
DOI: https://doi.org/10.1016/j.aej.2023.04.035
2023-01-01
Abstract:User identification across multiple online social networks is beneficial for building knowledge graphs. Under privacy protection considerations, researchers have shown increasing interest in user identification based on username similarity. However, existing solutions rely on manual features extracted by domain experts and do not exploit the deep semantic features of usernames. Moreover, existing solutions are limited to monolingual user names such as English or Chinese, ignoring other multilingual usernames. This paper proposes a multilingual pre-trained modelbased username similarity method for user identification across multiple online social networks. First, we use many multilingual corpora to enable the model to learn more semantic information and extract deep semantic features of usernames. Then, fine-tuning is performed on our constructed dataset of multilingual usernames across multiple online social networks. Ultimately assess the similarity of user identities across multiple online social networks. Our method facilitates user identification with limited data. Finally, the efficiency of our model is verified on three constructed realworld multilingual username datasets across multiple online social networks and compared with existing state-of-the-art methods. Experimental results show that the proposed algorithm outperforms the compared algorithms.& COPY; 2023 THE AUTHORS. Published by Elsevier BV on behalf of Faculty of Engineering, Alexandria University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/ licenses/by-nc-nd/4.0/).
What problem does this paper attempt to address?