Discovery of factors influencing patent value based on machine learning in patents in the field of nanotechnology

Scott D. Bass,Lukasz A. Kurgan
DOI: https://doi.org/10.1007/s11192-009-0008-z
IF: 3.801
2009-06-09
Scientometrics
Abstract:Patents represent the technological or inventive activity and output across different fields, regions, and time. The analysis of information from patents could be used to help focus efforts in research and the economy; however, the roles of the factors that can be extracted from patent records are still not entirely understood. To better understand the impact of these factors on patent value, machine learning techniques such as feature selection and classification are used to analyze patents in a sample industry, nanotechnology. Each nanotechnology patent was represented by a comprehensive set of numerical features that describe inventors, assignees, patent classification, and outgoing references. After careful design that included selection of the most relevant features, selection and optimization of the accuracy of classification models that aimed at finding most valuable (top-performing) patents, we used the generated models to analyze which factors allow to differentiate between the top-performing and the remaining nanotechnology patents. A few interesting findings surface as important such as the past performance of inventors and assignees, and the count of referenced patents.
information science & library science,computer science, interdisciplinary applications
What problem does this paper attempt to address?