Piano Skills Assessment

Paritosh Parmar,Jaiden Reddy,Brendan Morris
DOI: https://doi.org/10.48550/arXiv.2101.04884
2021-06-21
Abstract:Can a computer determine a piano player's skill level? Is it preferable to base this assessment on visual analysis of the player's performance or should we trust our ears over our eyes? Since current CNNs have difficulty processing long video videos, how can shorter clips be sampled to best reflect the players skill level? In this work, we collect and release a first-of-its-kind dataset for multimodal skill assessment focusing on assessing piano player's skill level, answer the asked questions, initiate work in automated evaluation of piano playing skills and provide baselines for future work. Dataset is available from: <a class="link-external link-https" href="https://github.com/ParitoshParmar/Piano-Skills-Assessment" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition,Machine Learning,Multimedia,Sound,Audio and Speech Processing
What problem does this paper attempt to address?