Chinese Prosodic Phrasing Based on Extension Matrix Theory

谌卫军,林福宗,李建民,张钹
DOI: https://doi.org/10.3321/j.issn:0254-4164.2003.01.004
2003-01-01
Jisuanji Xuebao/Chinese Journal of Computers
Abstract:This paper presents a new inductive learning algorithm based on the extension matrix theory, and uses it to solve the prosodic phrasing problem for Chinese Text-to-Speech systems. Authors propose a novel definition of the consistency of a rule and of a set of positive examples, and reveal their relationship using a theorem: By dividing the positive examples of a specific class in a given example set into consistent groups and adopting a simple strategy to find a conjunctive rule for each group which covers all the group's positive examples and none of the negative examples, the algorithm finds a set of consistent rules in the form of variable-valued logic. Authors collect 937 sentences of different genres (about 78 minutes length) from CCTV news program and built a large speech corpus. A group of features for modeling prosody are also proposed, and their effectiveness is measured by the interpretation of the resulting rules. Lastly, a serial of experiments are conducted. The data is divided into two parts: training set and test set, and the experimental results show that authors' method achieves higher accuracy, better interpretation and less rules than other algorithms. And the generated rules are quite similar to hand-crafted ones, which may help us better understand the relationship between Chinese syntax and prosody.
What problem does this paper attempt to address?