Real-time translation of discrete Sinhala speech to Unicode text

M.K.H. Gunasekara,R.G.N. Meegama
DOI: https://doi.org/10.1109/icter.2015.7377680
2015-08-01
Abstract:This paper presents a methodology to translate discrete Sinhala speech to Sinhala Unicode text in real time. Initially, the Hidden Markov Model and the associated Hidden Markov Toolkit (HTK) is used as the speech recognizer. While real time decoding is obtained by the Julius decoder a three-states Bakis HMM topology is used to build the acoustic model. The normalized Mel frequency cepstral coefficients with zeroth coefficient as the feature vector is used to recognize speech. Although a single person is used during the training session, an average accuracy of 85% is obtained for both speaker dependent and speaker independent speech recognition. Performance evaluation shows the capabilities of the proposed system to convert discrete Sinhala speech to Sinhala Unicode in both quiet and noisy environments.
What problem does this paper attempt to address?