A Study of Speech Intention Understanding Based on Multimodal Information Integration

Bin-bin ZHENG,JIAJia,Lian-hong CAI
DOI: https://doi.org/10.3969/j.issn.1003-6970.2011.05.002
2011-01-01
Abstract:In order to obtain comprehensive speech intention information containing both the literal meaning and speaker’s affective state,a speech understanding method based on multimodal information integration is proposed.Key algorithms including keywords extraction,command analyzing,text/prosody-based affective state determination and multimodal information integration are designed.The method is able to effectively obtain rich intention information by extracting information of different modality from recognition text and speech signal and merging them together,which is helpful to establish a natural human-computer interaction environment.
What problem does this paper attempt to address?