Abstract:While the musical instrument classification task is well-studied, there remains a gap in identifying non-pitched percussion instruments which have greater overlaps in frequency bands and variation in sound quality and play style than pitched instruments. In this paper, we present a musical instrument classifier for detecting tambourines, maracas and castanets, instruments that are often used in early childhood music education. We generated a dataset with diverse instruments (e.g., brand, materials, construction) played in different locations with varying background noise and play styles. We conducted sensitivity analyses to optimize feature selection, windowing time, and model selection. We deployed and evaluated our best model in a mixed reality music application with 12 families in a home setting. Our dataset was comprised of over 369,000 samples recorded in-lab and 35,361 samples recorded with families in a home setting. We observed the Light Gradient Boosting Machine (LGBM) model to perform best using an approximate 93 ms window with only 12 mel-frequency cepstral coefficients (MFCCs) and signal entropy. Our best LGBM model was observed to perform with over 84% accuracy across all three instrument families in-lab and over 73% accuracy when deployed to the home. To our knowledge, the dataset compiled of 369,000 samples of non-pitched instruments is first of its kind. This work also suggests that a low feature space is sufficient for the recognition of non-pitched instruments. Lastly, real-world deployment and testing of the algorithms created with participants of diverse physical and cognitive abilities was also an important contribution towards more inclusive design practices. This paper lays the technological groundwork for a mixed reality music application that can detect children’s use of non-pitched, percussion instruments to support early childhood music education and play.

A dataset and classification model for Malay, Hindi, Tamil and Chinese music

Human-centric Music Medical Therapy Exploration System

A New Fuzzy Classifier For Music Emotion Based On Conditional Probability

A Dataset for Learning Stylistic and Cultural Correlations Between Music and Videos

KritiSamhita: A machine learning dataset of South Indian classical music audio clips with tonic classification

Musical Instrument Classification via Low-Dimensional Feature Vectors

ChMusic: A Traditional Chinese Music Dataset for Evaluation of Instrument Recognition

MusicTM-Dataset for Joint Representation Learning among Sheet Music, Lyrics, and Musical Audio

Understanding and Classifying Cultural Music Using Melodic Features Case Of Hindustani, Carnatic And Turkish Music

A MDCT Domain Feature-Based Approach in Classifying MP3 Song

Musical instrument classifier for early childhood percussion instruments

dMelodies: A Music Dataset for Disentanglement Learning

A dataset for multimodal music information retrieval of Sotho-Tswana musical videos

A Music Classification Model based on Metric Learning and Feature Extraction from MP3 Audio Files

Notation of Javanese Gamelan dataset for traditional music applications

Deep Neural Network for Musical Instrument Recognition using MFCCs

Acoustical feature analysis and optimization for aesthetic recognition of Chinese traditional music

Foundation Models for Music: A Survey

New Approach to Classification of Chinese Folk Music Based on Extension of HMM

Exploring modality-agnostic representations for music classification

Ensemble Model-Based Singer Classification with Proposed Vocal Segmentation