Abstract:The rise of deep learning technologies has quickly advanced many fields, including generative music systems. There exists a number of systems that allow for the generation of musically sounding short snippets, yet, these generated snippets often lack an overarching, longer-term structure. In this work, we propose CM-HRNN: a conditional melody generation model based on a hierarchical recurrent neural network. This model allows us to generate melodies with long-term structures based on given chord accompaniments. We also propose a novel and concise event-based representation to encode musical lead sheets while retaining the melodies' relative position within the bar with respect to the musical meter. With this new data representation, the proposed architecture is able to simultaneously model the rhythmic, as well as the pitch structures effectively. Melodies generated by the proposed model were extensively evaluated in quantitative experiments as well as a user study to ensure the musical quality and long-term structure of the output. We also compared the system with the state-of-the-art AttentionRNN [1]. The comparison shows that melodies generated by CM-HRNN contain more repeated patterns (i.e., higher compression ratio) and a lower tonal tension (i.e., more tonally concise). Results from our listening test indicate that CM-HRNN outperforms AttentionRNN in terms of long-term structure and overall rating.

Conditioning a Recurrent Neural Network to synthesize musical instrument transients

Exploring Conditioning for Generative Music Systems with Human-Interpretable Controls

Explicitly Conditioned Melody Generation: A Case Study with Interdependent RNNs

A Recurrent Neural Network for Rhythmic Timing

Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning

Recurrent Neural Network Method for Automatic Generation of Music

Automatic Synthesis Technology of Music Teaching Melodies Based on Recurrent Neural Network

Hyper Recurrent Neural Network: Condition Mechanisms for Black-box Audio Effect Modeling

A Spiking Neural Network Model for Sound Recognition.

SynthNet: Learning to Synthesize Music End-to-End

A Novel Method of Music Generation Based on Three Different Recurrent Neural Networks

Generating Sample-Based Musical Instruments Using Neural Audio Codec Language Models

Conditioning Deep Generative Raw Audio Models for Structured Automatic Music

Exploring how a generative AI interprets music

Hierarchical Recurrent Neural Networks for Conditional Melody Generation with Long-term Structure

An Interactive Musical Prediction System with Mixture Density Recurrent Neural Networks

Neural Percussive Synthesis Parameterised by High-Level Timbral Features

Sine, Transient, Noise Neural Modeling of Piano Notes

Rethinking Recurrent Latent Variable Model for Music Composition

Algorithmic Composition of Melodies with Deep Recurrent Neural Networks

DrumGAN: Synthesis of Drum Sounds With Timbral Feature Conditioning Using Generative Adversarial Networks