Application and Research of Music Generation System Based on CVAE and Transformer-XL in Video Background Music

Jun Min,Zhiwei Gao,Lei Wang
DOI: https://doi.org/10.1109/tii.2024.3477561
IF: 12.3
2024-01-01
IEEE Transactions on Industrial Informatics
Abstract:In the field of music generation using algorithms, processing time-series data has consistently been a complex task. To improve music generation with long sequences, insightful-unit-conditional variational autoencoder is proposed, which can enhance unit-conditional variational autoencoders with an improved attention mechanism. This model integrates TransformerXLs recurrent mechanism and relative positional encoding with measure-level granularity. For practical applications, a scheme is addressed that uses optical flow to extract motion features from video frames, quantifying motion rate and intensity. Furthermore, a dynamic correlation method is proposed to align video motion features with musical rhythm, guiding the model to generate melodies that match the videos rhythm.
What problem does this paper attempt to address?