A Holistic Evaluation of Piano Sound Quality

Monan Zhou,Shangda Wu,Shaohua Ji,Zijin Li,Wei Li
2023-10-07
Abstract:This paper aims to develop a holistic evaluation method for piano sound quality to assist in purchasing decisions. Unlike previous studies that focused on the effect of piano performance techniques on sound quality, this study evaluates the inherent sound quality of different pianos. To derive quality evaluation systems, the study uses subjective questionnaires based on a piano sound quality dataset. The method selects the optimal piano classification models by comparing the fine-tuning results of different pre-training models of Convolutional Neural Networks (CNN). To improve the interpretability of the models, the study applies Equivalent Rectangular Bandwidth (ERB) analysis. The results reveal that musically trained individuals are better able to distinguish between the sound quality differences of different pianos. The best fine-tuned CNN pre-trained backbone achieves a high accuracy of 98.3\% as the piano classifier. However, the dataset is limited, and the audio is sliced to increase its quantity, resulting in a lack of diversity and balance, so we use focal loss to reduce the impact of data imbalance. To optimize the method, the dataset will be expanded, or few-shot learning techniques will be employed in future research.
Sound,Artificial Intelligence,Audio and Speech Processing
What problem does this paper attempt to address?
The paper aims to address the issue of piano sound quality evaluation to help beginners make more informed decisions when purchasing a piano. Unlike previous studies that mainly focused on the impact of playing techniques on sound quality, this paper focuses on evaluating the inherent sound quality of different pianos. Specifically, the study achieves its goals through the following methods: 1. **Subjective Questionnaire Survey**: Conducting a questionnaire survey among individuals with a musical background to obtain sound quality ratings for different pianos. 2. **ERB Analysis**: Using Equivalent Rectangular Bandwidth (ERB) analysis to explain the piano timbre classification task from the perspective of auditory perception. 3. **Comparative Experiments**: Comparing different pre-trained Convolutional Neural Network (CNN) models to select the most suitable model for piano sound quality classification. The research results show that the best fine-tuned pre-trained CNN model achieved an accuracy of up to 98.3% in the piano classification task. However, the dataset has certain limitations; audio slicing increased the data volume but reduced diversity and balance. Therefore, focal loss was used to mitigate the impact of data imbalance. Future research will expand the dataset or adopt few-shot learning techniques to further optimize the method.