Abstract:English is now widely used in the world as an international language. As a symbol of the development of human civilization, English characters provide an important medium and tool for mankind. In the current information age, the vocabulary of English words is more quantitative, and it is almost everywhere. Under the background of the multiquantification of English words and the quantification of the relationship between words, the similarity measurement analysis and calculation of English words and the classification of vocabulary measurement calculation are carried out by integrating the characteristics of language. The experimental results are as follows: (1) the development situation of English words is analyzed, the research direction of the experiment is determined, the concept of English character features is proposed, and the similarity calculation method is selected according to different features, in order to simplify the complex and difficult-to-understand word meaning relationship between English words; (2) the text features are extracted through the similarity feature selection of language and text. The extraction of features indirectly affects the effectiveness of classification. The similarity word embedding vector is used to map English words into the vector for analysis and comparison, calculate the distance between the similarity numerical variables between English words and their similarity coefficient, measure the distance between them, and evaluate the similarity between them, including the angle cosine method and correlation coefficient method which are the two main methods for calculating the similarity coefficient.

Similarity Measurement and Classification of English Characters Based on Language Features

An adaptive method for text domain similarity calculation

A Text Similarity Measurement Based on Semantic Fingerprint of Characteristic Phrases

Research on Chinese Semantic Similarity Algorithm

Measurement of Text Similarity: A Survey

Measuring Word Similarity Based on Pattern Vector Space Model

A Combined Measure for Text Semantic Similarity

Improving Word Similarity Computation Accuracy by Multiple Parameter Optimization Based on Ontology Knowledge

Research on Word Similarity and Its Application in English Auxiliary Writing

Chinese Word Similarity Computing Based on Combination Strategy

A Similarity Algorithm Based on the Generality and Individuality of Words

Using Multiple Features and Statistical Model to Calculate Text Units Similarity

Application-Oriented Comparison and Evaluation of Six Semantic Similarity Measures Based on Wordnet

A Comparison and Semi-Quantitative Analysis of Words and Character-Bigrams As Features in Chinese Text Categorization

Measuring Domain Similarity for Statistical Machine Translation

Learning similarity measures in s

Research on Hownet-Based Chinese Word Lexical Semantic Similarity Measurement

Short Text Similarity Calculation Using Semantic Information

A Novel Linguistic Phenomenon Description for Text Similarity Computing

Quantifying Semantic Similarity of Chinese Words from HowNet

Semantic Word-formation Based Chinese Word Similarity Computing