The 6th Affective Behavior Analysis in-the-wild (ABAW) Competition

Dimitrios Kollias,Panagiotis Tzirakis,Alan Cowen,Stefanos Zafeiriou,Irene Kotsia,Alice Baird,Chris Gagne,Chunchang Shao,Guanyu Hu
2024-03-13
Abstract:This paper describes the 6th Affective Behavior Analysis in-the-wild (ABAW) Competition, which is part of the respective Workshop held in conjunction with IEEE CVPR 2024. The 6th ABAW Competition addresses contemporary challenges in understanding human emotions and behaviors, crucial for the development of human-centered technologies. In more detail, the Competition focuses on affect related benchmarking tasks and comprises of five sub-challenges: i) Valence-Arousal Estimation (the target is to estimate two continuous affect dimensions, valence and arousal), ii) Expression Recognition (the target is to recognise between the mutually exclusive classes of the 7 basic expressions and 'other'), iii) Action Unit Detection (the target is to detect 12 action units), iv) Compound Expression Recognition (the target is to recognise between the 7 mutually exclusive compound expression classes), and v) Emotional Mimicry Intensity Estimation (the target is to estimate six continuous emotion dimensions). In the paper, we present these Challenges, describe their respective datasets and challenge protocols (we outline the evaluation metrics) and present the baseline systems as well as their obtained performance. More information for the Competition can be found in:
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to understand and analyze human emotions and behaviors in natural scenes, which is a key challenge in the development of human - centered technologies. Specifically, the 6th Affect - in - the - wild Behavior Analysis Competition (ABAW Competition) focuses on five sub - challenges, and each challenge aims to solve a specific emotion recognition task: 1. **Valence - Arousal Estimation**: The goal is to estimate two continuous emotion dimensions - valence and arousal - in each frame of the video. These two dimensions respectively describe the change of emotional states from negative to positive and from passive to active. 2. **Expression Recognition**: The goal is to recognize seven basic expressions (anger, disgust, fear, happiness, sadness, surprise) and the "other" category in each frame of the video. These categories represent emotional states that do not belong to the basic expressions. 3. **Action Unit Detection**: The goal is to detect 12 Action Units in each frame of the video. These units refer to specific movements or configurations of facial muscles. 4. **Compound Expression Recognition**: The goal is to recognize seven compound expressions in each frame of the video. These compound expressions are composed of basic expressions, such as frightened surprise, happily surprised, etc. 5. **Emotional Mimicry Intensity Estimation**: The goal is to predict the intensities of six emotion dimensions, including "admiration", "entertainment", "determination", "sympathetic pain", "excitement" and "joy". By solving these problems, the competition aims to promote the development of the field of affective computing, especially to improve the ability of machines to understand human emotions and behaviors in different situations, thereby promoting the progress of human - machine interaction technologies.