What problem does this paper attempt to address?

The problem that this paper attempts to solve is: how to accurately analyze the movements in figure skating videos. Specifically, the paper focuses on constructing an algorithm model that can accurately recognize and classify various movements in figure skating competitions. ### Problem Background 1. **Challenge Background**: - This research is to solve the problems in the "1st SkatingVerse Challenge", which is affiliated with the 18th IEEE International Conference on Automatic Face and Gesture Recognition (FG). - The challenge provides a comprehensive dataset containing 1,687 continuous videos, covering 28 different figure - skating movement categories. - The dataset is divided into 19,993 training video segments and 8,586 test video segments. 2. **Objectives**: - Develop an algorithm that can accurately analyze the movements shown in each video. - Improve the recognition accuracy of figure - skating movements, thereby promoting research and technological progress in related fields. ### Solutions To achieve this goal, the author proposes a multi - step method: 1. **Pre - processing Stage**: - Use the DINO framework for Region of Interest (ROI) extraction and precisely crop the original video. - Extract video frames through FFmpeg and use the DINO framework to detect human bounding boxes in each frame. - Combine the bounding box results of all frames to generate the final detection box, and crop the video based on this. 2. **Model Structure**: - Use three different models (Unmasked Teacher, UniformerV2, and InfoGCN) to capture different aspects of the data. - Fine - tune these models to adapt to specific tasks. - Finally, improve the overall performance by integrating the logits of the model prediction results. 3. **Model Integration**: - Adopt two integration strategies: the voting method and the weighted aggregation method. - The integrated model scored 95.73% on the leaderboard, significantly outperforming the performance of a single model. ### Results Through the above methods, the author successfully improved the recognition accuracy of figure - skating movements, achieving a high leaderboard score, proving the effectiveness of this method. ### Formula Representation When evaluating the performance of the model, the following formula is used to calculate the average accuracy: \[ \text{Mean} = \frac{1}{l} \sum_{i = 1}^{l} \frac{M_i}{N_i} \] where: - \( l \) is the number of categories. - \( M_i \) is the number of correctly predicted samples in the \( i \)-th category. - \( N_i \) is the total number of samples in the \( i \)-th category. This formula ensures a fair evaluation of the accuracy of different categories.

1st Place Solution to the 1st SkatingVerse Challenge

1st Place Solutions for the UVO Challenge 2022

The 1st-place Solution for ECCV 2022 Multiple People Tracking in Group Dance Challenge

1st Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation

The SkatingVerse Workshop Challenge: Methods and Results

1st Place Solution for 5th LSVOS Challenge: Referring Video Object Segmentation

UVO Challenge on Video-based Open-World Segmentation 2021: 1st Place Solution

1st Place Solution for ICCV 2023 OmniObject3D Challenge: Sparse-View Reconstruction

1St Place Solution in Google Universal Images Embedding

Learning Semantics-Guided Representations for Scoring Figure Skating

First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Spatiotemporal Agent Detection 2024

1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR'24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation

First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Atomic Activity Recognition 2024

1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation

First Place Solution to the Multiple-choice Video QA Track of The Second Perception Test Challenge

1st Place Solution for Waymo Open Dataset Challenge -- 3D Detection and Domain Adaptation

Skating-Mixer: Long-Term Sport Audio-Visual Modeling with MLPs

Towards Fine-grained Large Object Segmentation 1st Place Solution to 3D AI Challenge 2020 -- Instance Segmentation Track

1st Place Solutions for OpenImage2019 -- Object Detection and Instance Segmentation

Top-1 Solution of Multi-Moments in Time Challenge 2019

1st Place Solution for MOSE Track in CVPR 2024 PVUW Workshop: Complex Video Object Segmentation