Literature survey of multi-track music generation model based on generative confrontation network in intelligent composition

Weiming Liu
DOI: https://doi.org/10.1007/s11227-022-04914-5
2022-11-10
Abstract:The production of traditional music is too complicated, consuming a lot of financial and human resources. Therefore, this paper aims to use artificial intelligence (AI) for songwriting and to explore the development and application of the Generative Adversarial Network (GAN) in smart music. An improved GAN-based Multi-Track Music (MTM)-GAN is established. The model is validated with the generation of 5 different music tracks for bass, drums, guitar, piano, and strings. The verification results are compared with the music generated by the existing Multi-Track Sequential GAN (MuseGAN) index evaluation method. The results show that many music clips generated by the MTM-GAN model are smooth and have a certain artistic aesthetic effect. Through the comparison of the two convergence curves of MuseGAN and MTM-GAN, when the penalty term is increased, the MTM-GAN of Consistency Term (CT) converges faster, and the training process is more stable. The numerical space of the parameter distribution obtained by the MTM-GAN-based music segment test is significantly smaller than that of MuseGAN. The probability of MTM-GAN overfitting is small. 62.8% of music listeners cannot distinguish the generated melody from the real melody. Therefore, the proposed model has the advantages of a more stable, more realistic, and faster fitting speed in music generation, indicating that the music generation method is effective.
computer science, theory & methods,engineering, electrical & electronic, hardware & architecture
What problem does this paper attempt to address?