Text-based LSTM networks for Automatic Music Composition

Keunwoo Choi,George Fazekas,Mark Sandler
DOI: https://doi.org/10.48550/arXiv.1604.05358
2016-04-19
Abstract:In this paper, we introduce new methods and discuss results of text-based LSTM (Long Short-Term Memory) networks for automatic music composition. The proposed network is designed to learn relationships within text documents that represent chord progressions and drum tracks in two case studies. In the experiments, word-RNNs (Recurrent Neural Networks) show good results for both cases, while character-based RNNs (char-RNNs) only succeed to learn chord progressions. The proposed system can be used for fully automatic composition or as semi-automatic systems that help humans to compose music by controlling a diversity parameter of the model.
Artificial Intelligence,Multimedia
What problem does this paper attempt to address?
This paper aims to explore the application of text - based Long Short - Term Memory Network (LSTM) in automatic music composition. Specifically, the author introduced new methods and discussed the experimental results. These methods are mainly used to learn relationships from text documents representing chord progressions and drum tracks. The paper verifies the effectiveness of the proposed network through two case studies: one is to automatically generate chord progressions in jazz, and the other is to generate drum tracks in rock music. The experimental results show that word - level Recurrent Neural Networks (word - RNNs) perform well in both cases, while character - level Recurrent Neural Networks (char - RNNs) are only successful in learning chord progressions. In addition, the method proposed in the paper can control the diversity of the output by adjusting the diversity parameter of the model, so it can be used for fully automated music composition or as a semi - automatic system to assist human composition.