RICA2: Rubric-Informed, Calibrated Assessment of Actions

Abrar Majeedi,Viswanatha Reddy Gajjala,Satya Sai Srinath Namburi GNVV,Yin Li

2024-08-07

Abstract:The ability to quantify how well an action is carried out, also known as action quality assessment (AQA), has attracted recent interest in the vision community. Unfortunately, prior methods often ignore the score rubric used by human experts and fall short of quantifying the uncertainty of the model prediction. To bridge the gap, we present RICA^2 - a deep probabilistic model that integrates score rubric and accounts for prediction uncertainty for AQA. Central to our method lies in stochastic embeddings of action steps, defined on a graph structure that encodes the score rubric. The embeddings spread probabilistic density in the latent space and allow our method to represent model uncertainty. The graph encodes the scoring criteria, based on which the quality scores can be decoded. We demonstrate that our method establishes new state of the art on public benchmarks, including FineDiving, MTL-AQA, and JIGSAWS, with superior performance in score prediction and uncertainty calibration. Our code is available at <a class="link-external link-https" href="https://abrarmajeedi.github.io/rica2_aqa/" rel="external noopener nofollow">this https URL</a>

Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

### Problems Addressed by the Paper This paper aims to address two main issues in Action Quality Assessment (AQA): 1. **Ignoring the Scoring Rubric**: Existing methods often ignore the scoring rubric used by human experts, leading to insufficient accuracy in model predictions. 2. **Prediction Uncertainty**: Existing methods fail to effectively quantify the uncertainty of model predictions, i.e., the confidence of the model's predictions. This is particularly important in high-risk scenarios (such as sports competitions or surgical procedures) because understanding prediction uncertainty can help decide whether further evaluation by human experts is needed. To address these shortcomings, the authors propose RICA2 (Rubric-Informed, Calibrated Assessment of Actions), a deeply integrated probabilistic model that incorporates the scoring rubric and quantifies prediction uncertainty. RICA2 encodes the scoring rubric through a graph structure and defines random embeddings on the graph to propagate probability densities in the latent space, representing the model's uncertainty. This approach not only improves prediction accuracy but also provides calibrated uncertainty estimates. ### Main Contributions 1. **Proposing RICA2**: A novel deep probabilistic method that combines scoring rubrics and uncertainty modeling to achieve accurate action scoring and calibrated uncertainty estimation. 2. **Technical Innovations**: - Using Graph Neural Networks (GNN) to model the scoring rubric and defining random embeddings on the graph to quantify prediction uncertainty. - Designing a training scheme under the variational information bottleneck framework. 3. **Experimental Validation**: Conducting experiments on multiple public AQA datasets (such as FineDiving, MTL-AQA, and JIGSAWS), showing that RICA2 significantly outperforms existing methods in both prediction accuracy and uncertainty calibration. ### Experimental Results - **FineDiving Dataset**: RICA2 improves the Spearman Rank Correlation Coefficient (SRCC) and relative L2 distance (Rℓ2) by 0.7% and 1.4%, respectively, compared to the latest method TPT, and by 4.9% and 29.8%, respectively, compared to non-example methods (such as MUSDL and DAE-MT). - **MTL-AQA Dataset**: RICA2 also significantly outperforms existing methods in SRCC and Kendall Tau, particularly excelling in uncertainty calibration. ### Conclusion By integrating the scoring rubric and quantifying prediction uncertainty, RICA2 significantly improves the accuracy and reliability of action quality assessment, providing strong support for decision-making in high-risk scenarios.

RICA2: Rubric-Informed, Calibrated Assessment of Actions

Uncertainty-Driven Action Quality Assessment

Hierarchical NeuroSymbolic Approach for Comprehensive and Explainable Action Quality Assessment

Uncertainty-Aware Score Distribution Learning for Action Quality Assessment

Assessing action quality with semantic-sequence performance regression and densely distributed sample weighting

Interpretable Long-term Action Quality Assessment

GAIA: Rethinking Action Quality Assessment for AI-Generated Videos

MAGR: Manifold-Aligned Graph Regularization for Continual Action Quality Assessment

Dual-referenced assistive network for action quality assessment

Automated Essay Scoring and the Deep Learning Black Box: How Are Rubric Scores Determined?

Re-IQA: Unsupervised Learning for Image Quality Assessment in the Wild

Multimodal Action Quality Assessment

AM-BQA: Enhancing blind image quality assessment using attention retractable features and multi-dimensional learning

Automatic Modelling for Interactive Action Assessment

FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality Assessment

Stratified Rule-Aware Network for Abstract Visual Reasoning

Group-aware Contrastive Regression for Action Quality Assessment

Procedure-Aware Action Quality Assessment: Datasets and Performance Evaluation

End-to-end Action Quality Assessment with Action Parsing Transformer

CoFInAl: Enhancing Action Quality Assessment with Coarse-to-Fine Instruction Alignment

ActionReasoningBench: Reasoning about Actions with and without Ramification Constraints