Abstract:A document's keywords provide high-level descriptions of the content that summarize the document's central themes, concepts, ideas, or arguments. These descriptive phrases make it easier for algorithms to find relevant information quickly and efficiently. It plays a vital role in document processing, such as indexing, classification, clustering, and summarization. Traditional keyword extraction approaches rely on statistical distributions of key terms in a document for the most part. According to contemporary technological breakthroughs, contextual information is critical in deciding the semantics of the work at hand. Similarly, context-based features may be beneficial in the job of keyword extraction. For example, simply indicating the previous or next word of the phrase of interest might be used to describe the context of a phrase. This research presents several experiments to validate that context-based key extraction is significant compared to traditional methods. Additionally, the KeyBERT proposed methodology also results in improved results. The proposed work relies on identifying a group of important words or phrases from the document's content that can reflect the authors' main ideas, concepts, or arguments. It also uses contextual word embedding to extract keywords. Finally, the findings are compared to those obtained using older approaches such as Text Rank, Rake, Gensim, Yake, and TF-IDF. The Journals of Universal Computer (JUCS) dataset was employed in our research. Only data from abstracts were used to produce keywords for the research article, and the KeyBERT model outperformed traditional approaches in producing similar keywords to the authors' provided keywords. The average similarity of our approach with author-assigned keywords is 51%.

Novel Word Features For Keyword Extraction

Exploiting Semantic Knowledge Base for Patent Retrieval

A Patent Keyword Extraction Method Based on Corpus Classification

A Semantic Query Expansion-Based Patent Retrieval Approach

Patent Keyword Extraction Algorithm Based on Distributed Representation for Patent Classification

Keyword extraction using support vector machine

An Ontology-Based Automatic Semantic Annotation Approach for Patent Document Retrieval in Product Innovation Design

Bert-Based Text Keyword Extraction

A patent retrieval method based on automatic query expansion

Query Generation for Patent Retrieval with Keyword Extraction based on Syntactic Features

Integrating Semantic Relatedness and Words' Intrinsic Features for Keyword Extraction.

Keyword Extraction: A Modern Perspective

Chinese Keyword Extraction Algorithm Based on Neighbour Words

Impact analysis of keyword extraction using contextual word embedding

Keyword Extraction in Scientific Documents

Distributed Feature Sets for Document Specific Key-Phrase Extraction

Experiment Research on Feature Selection and Learning Method in Keyphrase Extraction

AI for Patents: A Novel Yet Effective and Efficient Framework for Patent Analysis

The patent mining analysis method based on Chinese word segmentation

Keyword Extraction Approach Based on Probabilistic-Entropy, Graph, and Neural Network Methods