Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval: a Survey

Dinh-Viet-Toan Le,Louis Bigo,Mikaela Keller,Dorien Herremans
2024-02-27
Abstract:Several adaptations of Transformers models have been developed in various domains since its breakthrough in Natural Language Processing (NLP). This trend has spread into the field of Music Information Retrieval (MIR), including studies processing music data. However, the practice of leveraging NLP tools for symbolic music data is not novel in MIR. Music has been frequently compared to language, as they share several similarities, including sequential representations of text and music. These analogies are also reflected through similar tasks in MIR and NLP. This survey reviews NLP methods applied to symbolic music generation and information retrieval studies following two axes. We first propose an overview of representations of symbolic music adapted from natural language sequential representations. Such representations are designed by considering the specificities of symbolic music. These representations are then processed by models. Such models, possibly originally developed for text and adapted for symbolic music, are trained on various tasks. We describe these models, in particular deep learning models, through different prisms, highlighting music-specialized mechanisms. We finally present a discussion surrounding the effective use of NLP tools for symbolic music data. This includes technical issues regarding NLP methods and fundamental differences between text and music, which may open several doors for further research into more effectively adapting NLP tools to symbolic MIR.
Information Retrieval,Artificial Intelligence,Sound,Audio and Speech Processing
What problem does this paper attempt to address?
The problems that this paper attempts to solve are: How to effectively apply natural language processing (NLP) methods to symbolic music generation and information retrieval. Specifically, the paper explores the following points: 1. **Representation methods of symbolic music**: - The author elaborates in detail on how to convert symbolic music (such as music scores) into sequence representations suitable for computational processing. This includes time - slice - based tokenization strategies and event - based tokenization strategies. For example, time - slice - based tokenization can represent music as a sequence of notes with fixed time intervals, while event - based tokenization triggers tokenization according to specific events (such as the start or end of a note). 2. **Model selection and adaptation**: - The paper discusses the application of different types of deep - learning models in processing symbolic music, including recurrent neural networks (RNN), attention - mechanism models (such as Transformer). These models were originally designed for text, but through appropriate adjustments, they can also be used to process symbolic music data. For example, the Transformer model can capture the multi - dimensional characteristics of music through a multi - modal structure when processing symbolic music. 3. **Technical challenges and future directions**: - The paper points out the technical challenges faced when using NLP methods to process symbolic music, such as data availability, structural differences between symbolic music and natural language, etc. In addition, the author also looks forward to future research directions, including developing more lightweight models, improving the interpretability of models, and exploring more new models suitable for symbolic music information retrieval. ### Summary of main problems - **How to represent symbolic music as a sequence suitable for computational processing?** - **Which deep - learning models can be effectively applied to symbolic music generation and information retrieval?** - **What technical challenges are faced when using NLP methods to process symbolic music?** - **In which directions should future research develop to better meet the needs of symbolic music processing?** Through the exploration of these problems, the paper aims to promote the application of NLP methods in the field of symbolic music and provide valuable references for related research.