Abstract:A music piece is both comprehended hierarchically, from sonic events to melodies, and sequentially, in the form of repetition and variation. Music from different cultures establish different aesthetics by having different style conventions on these two aspects. We propose a framework that could be used to quantitatively compare music from different cultures by looking at these two aspects. The framework is based on an Music Information Dynamics model, a Variable Markov Oracle (VMO), and is extended with a variational representation learning of audio. A variational autoencoder (VAE) is trained to map audio fragments into a latent representation. The latent representation is fed into a VMO. The VMO then learns a clustering of the latent representation via a threshold that maximizes the information rate of the quantized latent representation sequence. This threshold effectively controls the sensibility of the predictive step to acoustic changes, which determines the framework's ability to track repetitions on longer time scales. This approach allows characterization of the overall information contents of a musical signal at each level of acoustic sensibility. Our findings under this framework show that sensibility to subtle acoustic changes is higher for East-Asian musical traditions, while the Western works exhibit longer motivic structures at higher thresholds of differences in the latent space. This suggests that a profile of information contents, analyzed as a function of the level of acoustic detail can serve as a possible cultural characteristic.

Multilingual I-Vector Based Statistical Modeling for Music Genre Classification.

Application of I-Vector in Speech and Music Classification

Music Genre Classification Based on VMD-IWOA-XGBOOST

Complementary Combination in I-Vector Level for Language Recognition.

Statistical and Visual Analysis of Audio, Text, and Image Features for Multi-Modal Music Genre Recognition

Musical Instrument Classification via Low-Dimensional Feature Vectors

Music Classification Via the Bag-of-features Approach.

"Multilingual" Deep Neural Network for Music Genre Classification.

New Music Genre Classification Method via Hierarchical Support Vector Machines

Towards Cross-Cultural Analysis using Music Information Dynamics

Long Short-Term Memory Recurrent Neural Network Based Segment Features for Music Genre Classification

I-vector features and deep neural network modeling for language recognition.

A new fast and memory effective i-vector extraction based on factor analysis of KLD derived GMM supervector

Music Genre Classification: Training an AI model

A Study of Variational Method for Text-Independent Speaker Recognition

Music Genre Classification: A Comparative Analysis of CNN and XGBoost Approaches with Mel-frequency cepstral coefficients and Mel Spectrograms

A Music Classification Model based on Metric Learning and Feature Extraction from MP3 Audio Files

A computational lens into how music characterizes genre in film

Music Curriculum Research Using a Large Language Model, Cloud Computing and Data Mining Technologies

Music Genre Classification Using Spectral Analysis and Sparse Representation of the Signals

Fisher Discriminative Embedding Low-Rank Sparse Representation for Music Genre Classification