Abstract:Chinese text semantic error detection is always the difficult point of Chinese text automatic error detection.In this paper,a semantic error detection model is proposed based on semantic knowledge base and D-S theory.We discuss the building method of the three layers semantic collocation knowledge base and the semantic error detection algorithm based on the three layers semantic collocation knowledge base and D-S theory.Construction of three layers semantic collocation knowledge base is divided into two steps:(1)According to the notional collocation frame in Modern Chinese Dictionary of Notional Words Collocation to construct words collocation rule set,extract collocations from the training corpus based on the rule set,and building the collocation knowledge base through filtering the some collocations by mutual information and co-occurrence frequency;(2)use HowNet to extract the sememe information of word in order to generate the word-sememe and the sememe-sememe knowledge base,and use the polymerization degree model to do the second level filtering.On the basis of the three layers semantic generate knowledge base,a top-down search pattern is used to identify the possible errors firstly,and then the semantic collocation mutual information MI and polymerization degree PD are used as evidences,adopt statistical method to generate basic probability assignment,combining the evidence conflict resolution and the weighted distribution D-S rules to get the relevancy of semantic collocation to determine whether there is a semantic error in Chinese text.The experimental result shows that the F-Score values of the error detecting model and algorithm proposed in this paper improved 14.02% than the best values in the literature.

Study of Semantic Error Detecting Method for Chinese Text

Semantic error checking in automatic proofreading for Chinese texts

A finite-state automata based negation detection algorithm for Chinese clinical documents

CSED: A Chinese Semantic Error Diagnosis Corpus

Winnow-based approach in automatic error detection and correction of Chinese text

Improving Pre-trained Language Models with Syntactic Dependency Prediction Task for Chinese Semantic Error Recognition

Chinese Error Correction of Searching Engine under N-Gram Statistic Model

Error Detection for Text-to-SQL Semantic Parsing

Topic Detection Technology for Chinese Text Based on Statistics and Semantic Information

An Adversarial Multi-Task Learning Method for Chinese Text Correction with Semantic Detection

A New Strategy for Reducing Errors in Scene Text Detection

Research on Chinese Text Error Correction Based on Sequence Model

An Alignment-Agnostic Model for Chinese Text Error Correction

A Study in Dictionary-Based All-word Word Sense Disambiguation for Pre-Qin Chinese

Research on the Application of a Chinese Semantic Knowledge Base in Chinese Phrase Disambiguation

Automatic Detection of Improper Categorization in Semantic Lexicon

Design of Chinese Grammar Recognition and Error Correction Model Based on the Deep Neural Network

Resolving error accumulation of automatically acquiring bilingual lexical knowledge by semantic similarity

A New Evaluation Method: Evaluation Data and Metrics for Chinese Grammar Error Correction

Dynamic Assessment-Based Curriculum Learning Method for Chinese Grammatical Error Correction

Chinese Lexical Sememe Prediction Using CilinE Knowledge