Beat-It: Beat-Synchronized Multi-Condition 3D Dance Generation

Zikai Huang,Xuemiao Xu,Cheng Xu,Huaidong Zhang,Chenxi Zheng,Jing Qin,Shengfeng He
2024-07-10
Abstract:Dance, as an art form, fundamentally hinges on the precise synchronization with musical beats. However, achieving aesthetically pleasing dance sequences from music is challenging, with existing methods often falling short in controllability and beat alignment. To address these shortcomings, this paper introduces Beat-It, a novel framework for beat-specific, key pose-guided dance generation. Unlike prior approaches, Beat-It uniquely integrates explicit beat awareness and key pose guidance, effectively resolving two main issues: the misalignment of generated dance motions with musical beats, and the inability to map key poses to specific beats, critical for practical choreography. Our approach disentangles beat conditions from music using a nearest beat distance representation and employs a hierarchical multi-condition fusion mechanism. This mechanism seamlessly integrates key poses, beats, and music features, mitigating condition conflicts and offering rich, multi-conditioned guidance for dance generation. Additionally, a specially designed beat alignment loss ensures the generated dance movements remain in sync with the designated beats. Extensive experiments confirm Beat-It's superiority over existing state-of-the-art methods in terms of beat alignment and motion controllability.
Graphics,Sound,Audio and Speech Processing
What problem does this paper attempt to address?
This paper aims to address the problem of synchronizing and controllably generating dance movements with music. Specifically, existing methods fall short in generating dance sequences that are precisely synchronized with music beats and controllable, especially in assigning specific key poses to specific music beats. To solve these issues, the paper proposes a new framework called **Beat-It**. ### Main Contributions 1. **Multi-Condition Dance Generation Framework**: Introduces a multi-condition dance generation framework that achieves beat synchronization and motion controllability. To the best of the authors' knowledge, this is the first attempt to achieve beat-controllable key pose-guided dance generation. 2. **Multi-Level Multi-Condition Fusion Mechanism**: Proposes a multi-level multi-condition fusion mechanism that effectively suppresses conflicts between different conditions and fully utilizes complementary information. 3. **Beat Representation and Alignment Loss**: Delves into beat attributes and represents them in the form of the nearest beat distance. Further, a new beat alignment loss is designed to provide explicit supervision signals, significantly enhancing the synchronization of generated dance movements with given beat conditions. 4. **Experimental Results**: Extensive experiments show that this method outperforms current state-of-the-art methods in terms of beat alignment and motion controllability. Additionally, the framework supports arbitrary beat specification and flexible key pose frame allocation. Through these contributions, the **Beat-It** framework not only generates dance movements that are highly synchronized with music beats but also provides higher generation controllability, making dance choreography more flexible and diverse.