HoloSinger: Semantics and Music Driven Motion Generation with Octahedral Holographic Projection

Zeyu Jin,Zixuan Wang,Qixin Wang,Jia,Ye Bai,Yi Zhao,Hao Li,Xiaorui Wang
DOI: https://doi.org/10.1145/3581783.3612674
2023-01-01
Abstract:Lyrics and music are both significant for a singer to perform a song. Therefore, it is important in singer's motion generation to model both semantic and acoustic correlation with motions at the same time. In this paper, we propose HoloSinger, a novel comprehensive system that synthesizes singing motions according to the given song. Additionally, we present singing avatar with octahedral holographic projection. For singing motion generation, we introduce a Transformer-VAE generative model to decompose lyrics and music, then fuse their impacts to synthesize singer's motions. Extensive experiments and user studies show that our method automatically generates realistic motions that adhere to musical choreography and reflect the lyric semantics appropriately. Furthermore, we design a desktop-level holographic projection device with an octahedral structure. It achieves high-definition holographic projection effects with smaller volume, larger imaging area ratio, and the ability of real-time AI interaction.
What problem does this paper attempt to address?