Method of Discovering Similar Patents Based on Vector Space Model and Characteristics of Patent Documents

CHEN Ji-xi,GU Xin-jian,CHEN Guo-hai,WEI Jiang
DOI: https://doi.org/10.3785/j.issn.1008-973x.2009.10.018
2009-01-01
Abstract:A method to discover the similarity of patent documents was proposed in order to help enterprises in patent application, protection and utilization. A patent model tree was built based on the characteristics of patent documents. The patent model tree and its nodes were defined. Through analyzing the nodes' attribute values, patent documents were categorized by using the vector space model(VSM) based text categorization technology and the weighted similarities of patent name and patent abstract. According to the categorization, similar patents were discovered by the weighted similarities of patent characteristics in the same category. Several ways to identify the weight of patent characteristics were discussed according to the actual needs in enterprise application. A case study showed that the method can be used in patent categorization and similar patent search.
What problem does this paper attempt to address?