Melody Generation from Lyrics Using Three Branch Conditional LSTM-GAN
Abhishek Srivastava,Wei Duan,Rajiv Ratn Shah,Jianming Wu,Suhua Tang,Wei Li,Yi Yu
DOI: https://doi.org/10.1007/978-3-030-98358-1_45
2022-01-01
Abstract:With the availability of paired lyrics-melody dataset and advancements of artificial intelligence techniques, research on melody generation conditioned on lyrics has become possible. In this work, for melody generation, we propose a novel architecture, Three Branch Conditional (TBC) LSTM-GAN conditioned on lyrics which is composed of a LSTM-based generator and discriminator respectively. The generative model is composed of three branches of identical and independent lyrics-conditioned LSTM-based sub-networks, each responsible for generating an attribute of a melody. For discrete-valued sequence generation, we leverage the Gumbel-Softmax technique to train GANs. Through extensive experiments, we show that our proposed model generates tuneful and plausible melodies from the given lyrics and outperforms the current state-of-the-art models quantitatively as well as qualitatively.
What problem does this paper attempt to address?