Subject-action-object-triples-based method for extraction of knowledge gene

Qi Xu,GU Xin-jian
DOI: https://doi.org/10.3785/j.issn.1008-973X.2013.03.001
2013-01-01
Abstract:Taking the patent citation network as carrier and the basic characteristics of knowledge gene as extraction principle, such as stability, hereditary and variability, this work proposed a subject-action-object-triples-based method for extraction of knowledge gene. First, the connectivity algorithm is applied to analyze the patent citation relationship, mine the knowledge flow of inheritance and development between citing patents, and cited patents and establish the knowledge evolutionary trajectory. Then, the text parsing technology was used to extract the subject-action-object triples from patent claims. And last, semantic processing was carried out based on semantic repository WordNet to compute semantic similarity, combine synonymous subject-action-object triples, and draw knowledge genetic map. This work collected 5073 patents related to data mining which was granted between 1975 to 1999 from database of United States Patent and Trademark Office. The geographical distribution and annual distribution of the patents were analyzed. Query from the patent data set National Bureau of Economic Research(NBER) to get patent citation relations and use the network analysis software Pajek to build patent citation network. Taking it the patent citation metwork as experimental data, the proposed knowledge gene extraction method was validated. The experimental results show that the extracted subject-action-object triples possess the basic characteristics of knowledge gene, so they can be used as a kind of form of knowledge gene.
What problem does this paper attempt to address?