A Survey of AI-Generated Video Evaluation

Xiao Liu,Xinhao Xiang,Zizhong Li,Yongheng Wang,Zhuoheng Li,Zhuosheng Liu,Weidi Zhang,Weiqi Ye,Jiawei Zhang
2024-10-25
Abstract:The growing capabilities of AI in generating video content have brought forward significant challenges in effectively evaluating these videos. Unlike static images or text, video content involves complex spatial and temporal dynamics which may require a more comprehensive and systematic evaluation of its contents in aspects like video presentation quality, semantic information delivery, alignment with human intentions, and the virtual-reality consistency with our physical world. This survey identifies the emerging field of AI-Generated Video Evaluation (AIGVE), highlighting the importance of assessing how well AI-generated videos align with human perception and meet specific instructions. We provide a structured analysis of existing methodologies that could be potentially used to evaluate AI-generated videos. By outlining the strengths and gaps in current approaches, we advocate for the development of more robust and nuanced evaluation frameworks that can handle the complexities of video content, which include not only the conventional metric-based evaluations, but also the current human-involved evaluations, and the future model-centered evaluations. This survey aims to establish a foundational knowledge base for both researchers from academia and practitioners from the industry, facilitating the future advancement of evaluation methods for AI-generated video content.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### Problems Addressed by the Paper The paper "A Review of AI-Generated Video Evaluation" aims to address the effective evaluation of AI-generated videos. Specifically, as AI's capabilities in generating video content continue to improve, effectively evaluating these videos has become a significant challenge. Unlike static images or text, video content includes complex spatial and temporal dynamics, requiring a more comprehensive and systematic evaluation of its content, including video performance quality, semantic information transmission, consistency with human intentions, and consistency with the virtual reality of the physical world. The main contributions of the paper include: 1. **Emphasizing Emerging Fields**: Proposing and emphasizing the importance of AI-Generated Video Evaluation (AIGVE) as a new research field. 2. **Comprehensive Review of Existing Evaluation Methods**: Systematically reviewing various evaluation methods related to AIGVE from multiple research fields, categorizing and analyzing these methods, and providing a structured overview of existing research. 3. **Guidance for Future Research Directions**: Identifying several potential areas for further research and development, including integrating evaluation frameworks with visual language models, enhancing the interpretability of evaluation scores, and addressing ethical and safety considerations of these frameworks. These guidelines aim to provide foundational resources for researchers and industry practitioners, promoting the development of more effective and comprehensive AI-generated video evaluation methods. Through these contributions, the paper hopes to establish a foundational knowledge base for academic researchers and industry practitioners, facilitating the development of future AI-generated video evaluation methods.