Language-Independent Type Inference of the Instances from Multilingual Wikipedia

Tianxing Wu,Guilin Qi,Bin Luo,Lei Zhang,Haofen Wang
DOI: https://doi.org/10.4018/ijswis.2019040102
2019-01-01
International Journal on Semantic Web and Information Systems
Abstract:Extracting knowledge from Wikipedia has attracted much attention in recent ten years. One of the most valuable kinds of knowledge is type information, which refers to the axioms stating that an instance is of a certain type. Current approaches for inferring the types of instances from Wikipedia mainly rely on some language-specific rules. Since these rules cannot catch the semantic associations between instances and classes (i.e. candidate types), it may lead to mistakes and omissions in the process of type inference. The authors propose a new approach leveraging attributes to perform language-independent type inference of the instances from Wikipedia. The proposed approach is applied to the whole English and Chinese Wikipedia, which results in the first version of MulType (Multilingual Type Information), a knowledge base describing the types of instances from multilingual Wikipedia. Experimental results show that not only the proposed approach outperforms the state-of-the-art comparison methods, but also MulType contains lots of new and high-quality type information.
What problem does this paper attempt to address?