Semantic Feature Extraction Based on Subspace Learning with Temporal Constraints for Acoustic Event Recognition

Qiuying Shi,Shiwen Deng,Jiqing Han
DOI: https://doi.org/10.1016/j.dsp.2020.102947
IF: 3.614
2022-01-01
Applied Acoustics
Abstract:For acoustic event recognition (AER), it is important to extract the semantic feature that jointly considers the content information and the temporal ordering. Thus, our previous work proposed a method to obtain this type of semantic feature by independently learning a related subspace for each acoustic event. However, in this method, since different subspaces are employed to extract the semantic features of various events, the subspace consistency cannot be guaranteed, and this makes the differences between various semantic features come from not only the semantic characteristic but also the unnecessary subspace inconsistency. To solve the problem, we propose a common subspace learning (CSL) based method for extracting the semantic features of various events in this paper. To obtain the above subspace, the CSL minimizes an objective that can jointly consider the content information and the temporal orderings of various events in the subspace, and provides an efficient algorithm to get its optimization solution. Furthermore, in the obtained common subspace, the semantic feature of each event is represented by a corresponding projection matrix. Since various projection matrices are the mappings onto this shared subspace, the subspace consistency is ensured. To evaluate the performance of our method, experiments are conducted on the AudioEvent, the ESC-50, and the ESC-10 databases, and the results indicate that the CSL is better than our previous and the related state-of-the-art methods.
What problem does this paper attempt to address?