Enhancing patent text classification with Bi-LSTM technique and alpine skiing optimization for improved diagnostic accuracy
Zhang, Fan
DOI: https://doi.org/10.1007/s11042-024-18806-8
IF: 2.577
2024-05-08
Multimedia Tools and Applications
Abstract:Currently, technological innovation has been increasingly valued by companies around the world, and patents as an important recording medium of innovation show a tendency for explosive growth. They are used to perform the classification of technical documents that are obtained with different predefined categories. However the presence of numerous patent documents obtained as a challenging task in classification and not able to convey effective knowledge. So this study deals with automatic annotation and classification techniques of patents has intrigued many scholars. First, this paper is based on the analysis of patent text features, including application number, publication number, application date, publication date, classification number, applicant, inventor, title, abstract, claims, full text, and the like. Second, we selected patent text fields, proposed to classify three tags namely technical field, technical effect, and technical means by the title, abstract, specification, background, and full-text field of the patent, and described its effectiveness. Then, we trained three different models Text Convolutional neural network (CNN), Bidirectional Long short term memory (Bi-LSTM), and ALBERT-Tiny, and chose Bi-LSTM and then multiple factors are employed to validate processing time and accuracy. Moreover, Alpine Skiing Optimization (ASO) is an initial search strategy that is determined for feature classification, focusing on selecting relevant features for analysis. Conclusively, the classification results of technical field, technical effect, and technical means based on the parameters trained using the Bi-LSTM model, and the prediction accuracy were 88.39%, 88.67%, and 88.53%, respectively. The existing methods can be compared with proposed with the performance metrics accuracy, precision, and recall which attained higher performance with 98.01%, 97%, and 98.56%. This model can satisfy actual needs, and its accuracy can be improved through further enrichment of features.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering