Analysis and Detection of Singing Techniques in Repertoires of J-POP Solo Singers

Yuya Yamamoto,Juhan Nam,Hiroko Terasawa
DOI: https://doi.org/10.48550/arXiv.2210.17367
2022-11-16
Abstract:In this paper, we focus on singing techniques within the scope of music information retrieval research. We investigate how singers use singing techniques using real-world recordings of famous solo singers in Japanese popular music songs (J-POP). First, we built a new dataset of singing techniques. The dataset consists of 168 commercial J-POP songs, and each song is annotated using various singing techniques with timestamps and vocal pitch contours. We also present descriptive statistics of singing techniques on the dataset to clarify what and how often singing techniques appear. We further explored the difficulty of the automatic detection of singing techniques using previously proposed machine learning techniques. In the detection, we also investigate the effectiveness of auxiliary information (i.e., pitch and distribution of label duration), not only providing the baseline. The best result achieves 40.4% at macro-average F-measure on nine-way multi-class detection. We provide the annotation of the dataset and its detail on the appendix website 0 .
Sound,Digital Libraries,Information Retrieval,Multimedia,Audio and Speech Processing
What problem does this paper attempt to address?
The problem this paper attempts to address is the analysis and detection of singing techniques used by Japanese pop music (J-POP) solo singers in their performances. Specifically, the researchers focus on the following aspects: 1. **Constructing a new dataset**: The researchers created a dataset containing 168 commercial J-POP songs, each annotated with timestamps of various singing techniques and vocal pitch contours. 2. **Descriptive statistical analysis**: Using descriptive statistical methods, the researchers analyzed the distribution of singing techniques in the dataset to identify which techniques appear most frequently and their usage frequency. 3. **Automatic detection of singing techniques**: The researchers explored the difficulty of automatically detecting singing techniques using machine learning techniques and evaluated the effectiveness of different methods. They not only provided baseline results but also studied the impact of auxiliary information (such as pitch and label duration distribution) on detection performance. 4. **Multi-class detection**: The researchers conducted a multi-class singing technique detection experiment, achieving the best result with a macro-average F-score of 40.4% in a nine-class detection task. Overall, this paper aims to systematically study and understand the singing techniques used by J-POP singers in their performances through dataset construction, descriptive statistical analysis, and automatic detection methods. This not only contributes to research in the field of music information retrieval but can also be applied to music discovery, vocal training, user-generated media, and other areas.