A Unified Representation Framework for the Evaluation of Optical Music Recognition Systems

Pau Torras,Sanket Biswas,Alicia Fornés
DOI: https://doi.org/10.1007/s10032-024-00485-8
2024-09-06
Abstract:Modern-day Optical Music Recognition (OMR) is a fairly fragmented field. Most OMR approaches use datasets that are independent and incompatible between each other, making it difficult to both combine them and compare recognition systems built upon them. In this paper we identify the need of a common music representation language and propose the Music Tree Notation (MTN) format, with the idea to construct a common endpoint for OMR research that allows coordination, reuse of technology and fair evaluation of community efforts. This format represents music as a set of primitives that group together into higher-abstraction nodes, a compromise between the expression of fully graph-based and sequential notation formats. We have also developed a specific set of OMR metrics and a typeset score dataset as a proof of concept of this idea.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?