Study of Semantic Error Detecting Method for Chinese Text

Yang-Sen ZHANG,Jia ZHENG
DOI: https://doi.org/10.11897/SP.J.1016.2017.00911
2017-01-01
Abstract:Chinese text semantic error detection is always the difficult point of Chinese text automatic error detection.In this paper,a semantic error detection model is proposed based on semantic knowledge base and D-S theory.We discuss the building method of the three layers semantic collocation knowledge base and the semantic error detection algorithm based on the three layers semantic collocation knowledge base and D-S theory.Construction of three layers semantic collocation knowledge base is divided into two steps:(1)According to the notional collocation frame in Modern Chinese Dictionary of Notional Words Collocation to construct words collocation rule set,extract collocations from the training corpus based on the rule set,and building the collocation knowledge base through filtering the some collocations by mutual information and co-occurrence frequency;(2)use HowNet to extract the sememe information of word in order to generate the word-sememe and the sememe-sememe knowledge base,and use the polymerization degree model to do the second level filtering.On the basis of the three layers semantic generate knowledge base,a top-down search pattern is used to identify the possible errors firstly,and then the semantic collocation mutual information MI and polymerization degree PD are used as evidences,adopt statistical method to generate basic probability assignment,combining the evidence conflict resolution and the weighted distribution D-S rules to get the relevancy of semantic collocation to determine whether there is a semantic error in Chinese text.The experimental result shows that the F-Score values of the error detecting model and algorithm proposed in this paper improved 14.02% than the best values in the literature.
What problem does this paper attempt to address?