Abstract:The assessment of semantic similarity between lexical terms plays a critical part in semantic-oriented applications for natural language processing and cognitive science. The optimization of calculation models is still a challenging issue for improving the performance of similarity measurement. In this paper, we investigate WordNet-based measures including distance-based, information-based, feature-based and hybrid. Among them, the distance-based measures are considered to have the lowest computational complexity due to simple distance calculation. However, most of existing works ignore the meronymy relation between concepts and the non-uniformity of path distances caused by various semantic relations, in which path distances are simply determined by conceptual hyponymy relation. To solve this problem, we propose a novel model to calculate the path distance between concepts, and also propose a similarity measure which nonlinearly transforms the distance to semantic similarity. In the proposed model, we assign different weights in accordance with various relations to edges that link different concepts. On basis of the distance model, we use five structure properties of WordNet for similarity measurement, which consist of multiple meanings, multiple inheritance, link type, depth and local density. Our similarity measure is compared against state-of-the-art WordNet-based measures on M&C dataset, R&G dataset and WS-353 dataset. According to experiment results, the proposed measure in this work outperforms others in terms of both Pearson and Spearman correlation coefficients, which indicates the effectiveness of our distance model. Besides, we construct six additional benchmarks to prove that the proposed measure maintains stable performance.

Evaluation of taxonomic and neural embedding methods for calculating semantic similarity

A Novel Comprehensive Approach for Estimating Concept Semantic Similarity in WordNet

A New Hypred Improved Method for Measuring Concept Semantic Similarity in WordNet.

Measuring Distance-Based Semantic Similarity Using Meronymy and Hyponymy Relations

Bridging the Semantic Latent Space Between Brain and Machine: Similarity is All You Need

A WordNet-based hybrid semantic similarity measurement

A New Measure of Word Semantic Similarity Based on WordNet Hierarchy and DAG Theory

A Hybrid Approach for Measuring Semantic Similarity Based on IC-weighted Path Distance in WordNet

Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy

A Hybrid Semantic Similarity Measurement for Geospatial Entities

Lexical semantics enhanced neural word embeddings

Computing Semantic Similarity Based on Novel Models of Semantic Representation Using Wikipedia.

Quantifying Semantic Similarity of Chinese Words from HowNet

From Ontology to Semantic Similarity: Calculation of Ontology-Based Semantic Similarity

Semantic Similarity Computing Model Based on Multi Model Fine-Grained Nonlinear Fusion

Measuring Semantic Similarity Based on WordNet

A novel model for semantic similarity measurement based on wordnet and word embedding

Using a Chinese Lexicon to Learn Sense Embeddings and Measure Semantic Similarity.

Semantic Similarity Calculation Based on Sememe Set

A Large Probabilistic Semantic Network Based Approach to Compute Term Similarity

A New Model to Compute Semantic Similarity from Multi-ontology.