Multitask Learning for Polyphonic Piano Transcription, a Case Study

Rainer Kelz,Sebastian Böck,Gerhard Widmer
DOI: https://doi.org/10.48550/arXiv.1902.04390
2019-02-12
Abstract:Viewing polyphonic piano transcription as a multitask learning problem, where we need to simultaneously predict onsets, intermediate frames and offsets of notes, we investigate the performance impact of additional prediction targets, using a variety of suitable convolutional neural network architectures. We quantify performance differences of additional objectives on the large MAESTRO dataset.
Sound,Audio and Speech Processing
What problem does this paper attempt to address?