FilmAgent: Automating Virtual Film Production Through a Multi-Agent Collaborative Framework

Zhenran Xu,Jifang Wang,Longyue Wang,Zhouyi Li,Senbao Shi,Baotian Hu,Min Zhang
DOI: https://doi.org/10.1145/3681758.3698014
2024-01-01
Abstract:Virtual film production requires intricate decision-making processes, including scriptwriting, virtual cinematography, and precise actor positioning and actions. Remarkable progress in automated decision-making have utilized agent societies powered by large language models (LLMs). This paper introduces FilmAgent, a novel LLM-based multi-agent collaborative framework designed to automate and streamline the film production process. FilmAgent simulates key crew roles—directors, screenwriters, actors, and cinematographers—within a sandbox environment, integrating efficient human workflows. The process is divided into three stages: planning, scriptwriting, and cinematography. Each stage engages a team of film crews providing iterative feedback, thus verifying intermediate results and reducing errors. Our evaluation of generated videos reveals that collaborative FilmAgent significantly outperforms individual efforts in line consistency, script coherence, character actions, and camera settings. Further analysis highlights the importance of feedback and verification in reducing hallucinations, enhancing script quality, and improving camera choices. We hope that this project lays the groundwork and shows the potential of integrating LLMs into creative multimedia tasks1.
What problem does this paper attempt to address?