Automatic Grammar Inference Based on Sentence Segmentation for Spoken Chinese

Fang Zheng
2009-01-01
Abstract:The grammar for spoken dialogue systems for information enquiry is often manually designed by experts.Automatic grammar inference method based on sentence segmentation was developed based on an enhanced context free grammar for spoken Chinese.The system parses the training sentences with an initial rule set.If the parsed syntactic tree is incomplete,the top-most constituents are used to recursively infer the missing rules after disambiguation and normalization,and then the rule set is updated.The output grammar is improved by adjusting the processing order of the training sentences to refine the process.Evaluations based on weather forecast enquiries gave a parsing accuracy for the output grammar of 64.8% with an empty initial rule set and 86.4% with an initial rule set including only rules for date descriptions.
What problem does this paper attempt to address?