The Method of Chinese Marked with Pinyin with Tonality

MA Zhi-qiang
2008-01-01
Abstract:As there is polyphony in Chinese characters,it is surely difficult for us to mark with pinyin with tonality.The method is designed to solve the problem by combining character with word.Firstly,the dictionary of Pinyin with tonality and the lexicon with Pinyin were created.The Pinyin of polyphone in the dictionary was arrayed by the used frequency,and the dictionary of Pinyin was indexed according to the last character of word.Secondly,the improved reverse maximum matching algorithm based on the two lay indexing structure based binary-seek-by-word was implemented.Finally,three experimental schemes were tested,and the results indicated the method makes error rate down from 11% to 0.09%.
What problem does this paper attempt to address?