Objective Assessment of Communication Speech Interference Effect Based on Feature Fusion

LIN Yun,XU Huaitao,WANG Sen,ZHANG Sicheng,ZHUANG Long
DOI: https://doi.org/10.11959/j.issn.1000−436x.2023043
2023-01-01
Abstract:In view of the objective assessment problem of the effect of communication speech interference, methods based on multi-measurements and multimodal fusion were proposed. First, the interfered speech was preprocessed by the endpoint detection algorithm and time warping algorithm. Then, the content of speech was extracted and performed measurement calculated with the standard speech to obtain five kinds of measure. After the fusion of five measures, random forest model was used to assessed the quality level. Finally, a neural network model based on residual structure was designed combined multimodal fusion technique, which fused the graph domain and measure domain features of the interfered speech data and performed quality level assessment. Experimental results show that the accuracy of two methods have reached more than 90%. Among them, the multimodal assessment method improves the accuracy by about 3.269%compared with the existing research methods, which proves that it has a better performance.
What problem does this paper attempt to address?