MELONS: generating melody with long-term structure using transformers and structure graph

Yi Zou,Pei Zou,Yi Zhao,Kaixiang Zhang,Ran Zhang,Xiaorui Wang
DOI: https://doi.org/10.48550/arXiv.2110.05020
2021-11-03
Abstract:The creation of long melody sequences requires effective expression of coherent musical structure. However, there is no clear representation of musical structure. Recent works on music generation have suggested various approaches to deal with the structural information of music, but generating a full-song melody with clear long-term structure remains a challenge. In this paper, we propose MELONS, a melody generation framework based on a graph representation of music structure which consists of eight types of bar-level relations. MELONS adopts a multi-step generation method with transformer-based networks by factoring melody generation into two sub-problems: structure generation and structure conditional melody generation. Experimental results show that MELONS can produce structured melodies with high quality and rich contents.
Sound,Multimedia,Audio and Speech Processing
What problem does this paper attempt to address?