Multi label classification of Artificial Intelligence related patents using Modified D2SBERT and Sentence Attention mechanism

Yongmin Yoo,Tak-Sung Heo,Dongjin Lim,Deaho Seo
2023-03-03
Abstract:Patent classification is an essential task in patent information management and patent knowledge mining. It is very important to classify patents related to artificial intelligence, which is the biggest topic these days. However, artificial intelligence-related patents are very difficult to classify because it is a mixture of complex technologies and legal terms. Moreover, due to the unsatisfactory performance of current algorithms, it is still mostly done manually, wasting a lot of time and money. Therefore, we present a method for classifying artificial intelligence-related patents published by the USPTO using natural language processing technique and deep learning methodology. We use deformed BERT and sentence attention overcome the limitations of BERT. Our experiment result is highest performance compared to other deep learning methods.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The paper attempts to address the issue of multi-label classification of AI-related patents. Specifically: 1. **Patent Classification Challenge**: Due to the complex legal terminology and technical descriptions contained in patent documents, manual classification is time-consuming, labor-intensive, and prone to errors. 2. **Insufficiency of Existing Algorithms**: Current algorithms perform poorly in classifying AI-related patents. Most patents still require manual classification by lawyers or patent agents, which not only consumes a lot of time and money but is also prone to errors. 3. **Proposing a New Method**: The researchers propose a new method that utilizes natural language processing techniques and deep learning methods to perform multi-label classification on AI-related patents published by the United States Patent and Trademark Office (USPTO) in 2019 and 2020. By using an improved D2SBERT model and sentence attention mechanism, the researchers aim to overcome the sequence length limitations of traditional BERT models and improve classification accuracy. Experimental results show that this method has higher performance compared to other deep learning methods.