DeepEMO: Deep Learning for Speech Emotion Recognition

Enkhtogtokh Togootogtokh,Christian Klasen
DOI: https://doi.org/10.48550/arXiv.2109.04081
2021-09-09
Abstract:We proposed the industry level deep learning approach for speech emotion recognition task. In industry, carefully proposed deep transfer learning technology shows real results due to mostly low amount of training data availability, machine training cost, and specialized learning on dedicated AI tasks. The proposed speech recognition framework, called DeepEMO, consists of two main pipelines such that preprocessing to extract efficient main features and deep transfer learning model to train and recognize. Main source code is in <a class="link-external link-https" href="https://github.com/enkhtogtokh/deepemo" rel="external noopener nofollow">this https URL</a> repository
Sound,Machine Learning,Audio and Speech Processing
What problem does this paper attempt to address?