Seed-Music: A Unified Framework for High Quality and Controlled Music Generation
Ye Bai,Haonan Chen,Jitong Chen,Zhuo Chen,Yi Deng,Xiaohong Dong,Lamtharn Hantrakul,Weituo Hao,Qingqing Huang,Zhongyi Huang,Dongya Jia,Feihu La,Duc Le,Bochen Li,Chumin Li,Hui Li,Xingxing Li,Shouda Liu,Wei-Tsung Lu,Yiqing Lu,Andrew Shaw,Janne Spijkervet,Yakun Sun,Bo Wang,Ju-Chiang Wang,Yuping Wang,Yuxuan Wang,Ling Xu,Yifeng Yang,Chao Yao,Shuo Zhang,Yang Zhang,Yilin Zhang,Hang Zhao,Ziyi Zhao,Dejian Zhong,Shicen Zhou,Pei Zou
2024-09-19
Abstract:We introduce Seed-Music, a suite of music generation systems capable of producing high-quality music with fine-grained style control. Our unified framework leverages both auto-regressive language modeling and diffusion approaches to support two key music creation workflows: controlled music generation and post-production editing. For controlled music generation, our system enables vocal music generation with performance controls from multi-modal inputs, including style descriptions, audio references, musical scores, and voice prompts. For post-production editing, it offers interactive tools for editing lyrics and vocal melodies directly in the generated audio.
We encourage readers to listen to demo audio examples at <a class="link-external link-https" href="https://team.doubao.com/seed-music" rel="external noopener nofollow">this https URL</a> "<a class="link-external link-https" href='https://team.doubao.com/seed-music"' rel="external noopener nofollow">this https URL</a>.
Sound,Audio and Speech Processing