The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assisted Video Editing

Dawit Mureja Argaw,Fabian Caba Heilbron,Joon-Young Lee,Markus Woodson,In So Kweon
DOI: https://doi.org/10.48550/arXiv.2207.09812
2022-07-21
Abstract:Machine learning is transforming the video editing industry. Recent advances in computer vision have leveled-up video editing tasks such as intelligent reframing, rotoscoping, color grading, or applying digital makeups. However, most of the solutions have focused on video manipulation and VFX. This work introduces the Anatomy of Video Editing, a dataset, and benchmark, to foster research in AI-assisted video editing. Our benchmark suite focuses on video editing tasks, beyond visual effects, such as automatic footage organization and assisted video assembling. To enable research on these fronts, we annotate more than 1.5M tags, with relevant concepts to cinematography, from 196176 shots sampled from movie scenes. We establish competitive baseline methods and detailed analyses for each of the tasks. We hope our work sparks innovative research towards underexplored areas of AI-assisted video editing.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to promote the research and development of AI - assisted video editing. Specifically, the paper points out that most of the current solutions mainly focus on video processing and visual effects (VFX), while there is less research on other important tasks in video editing, such as automatic material organization and assisted video assembly. To fill this gap, the author introduced the Anatomy of Video Editing (AVE) dataset and benchmark suite, aiming to promote the research in these underexplored areas. By annotating more than 1.5 million cinematography - related tags and extracting data from 196,176 shots, the AVE dataset not only provides rich video - editing materials but also defines multiple tasks to evaluate and promote the application of AI in video editing, including automatic material organization, shot - property classification, shot - sequence ordering, next - shot selection, and missing - shot - property prediction, etc. The provision of these tasks and datasets provides a comprehensive platform for researchers and developers to explore and develop more advanced AI - assisted video - editing technologies.