Applications and Advances of Artificial Intelligence in Music Generation:A Review

Yanxu Chen,Linshu Huang,Tian Gou
2024-09-03
Abstract:In recent years, artificial intelligence (AI) has made significant progress in the field of music generation, driving innovation in music creation and applications. This paper provides a systematic review of the latest research advancements in AI music generation, covering key technologies, models, datasets, evaluation methods, and their practical applications across various fields. The main contributions of this review include: (1) presenting a comprehensive summary framework that systematically categorizes and compares different technological approaches, including symbolic generation, audio generation, and hybrid models, helping readers better understand the full spectrum of technologies in the field; (2) offering an extensive survey of current literature, covering emerging topics such as multimodal datasets and emotion expression evaluation, providing a broad reference for related research; (3) conducting a detailed analysis of the practical impact of AI music generation in various application domains, particularly in real-time interaction and interdisciplinary applications, offering new perspectives and insights; (4) summarizing the existing challenges and limitations of music quality evaluation methods and proposing potential future research directions, aiming to promote the standardization and broader adoption of evaluation techniques. Through these innovative summaries and analyses, this paper serves as a comprehensive reference tool for researchers and practitioners in AI music generation, while also outlining future directions for the field.
Sound,Artificial Intelligence,Audio and Speech Processing
What problem does this paper attempt to address?
This paper attempts to address the latest research advancements and applications in the field of artificial intelligence music generation. Specifically, the main objectives of this review paper include: 1. **Summary of Technical Framework**: Provide a comprehensive technical framework that systematically classifies and compares different technical approaches (such as symbolic generation, audio generation, and hybrid models), helping readers better understand the overall landscape of the field. 2. **Literature Review**: Conduct an extensive survey of current literature, covering emerging topics (such as multimodal datasets and emotional expression evaluation), providing a broad reference for related research. 3. **Practical Application Analysis**: Analyze in detail the practical impact of AI music generation in different application areas, especially in real-time interaction and interdisciplinary applications, offering new perspectives and insights. 4. **Challenges and Future Directions**: Summarize the challenges and limitations in existing music quality evaluation methods and propose potential future research directions, aiming to promote the standardization of evaluation techniques and their broader application. Through these innovative summaries and analyses, this paper aims to provide a comprehensive reference tool for researchers and practitioners in the field of AI music generation and outline the future development directions of the field.