An Efficient Approach for Audio-Visual Emotion Recognition with Missing Labels and Missing Modalities

Fei Ma,Shao-Lun Huang,Lin Zhang
DOI: https://doi.org/10.1109/icme51207.2021.9428219
2021-01-01
Abstract:Audio-visual emotion recognition is important for human-machine interaction systems by combining the information of audio and visual modalities. Although great progress has been made by previous works using multimodal learning compared with unimodal learning, they still cannot effectively deal with two key challenges. Firstly, it is difficult or expensive to acquire labeled emotional data, which r...
What problem does this paper attempt to address?