Advancing Handwritten Musical Notation Recognition Using Deep Learning: A Convolutional Neural Network-Based Approach with Improved Accuracy

Ee Hern Kheng,Chia Pao Liew,Tianhao Lan,Kim Geok Tan
DOI: https://doi.org/10.1142/s0218001424520074
IF: 1.261
2024-04-05
International Journal of Pattern Recognition and Artificial Intelligence
Abstract:International Journal of Pattern Recognition and Artificial Intelligence, Ahead of Print. The use of computers to read musical scores is referred to as optical music recognition (OMR). The recent advancements in artificial intelligence and big data have led to the development of deep learning approaches for recognizing musical notes. Previous research has shown that there is a lot of room for improvement in handwritten musical notation recognition systems due to differences in writing styles and the complex structure of musical symbols. The research described here aims to develop a deep learning-based system for recognizing handwritten musical notation. The system uses a convolutional neural network (CNN) to extract and learn pixel features of musical symbols and achieve a recognition accuracy of over 90%. The CNN model was trained using image samples from the HOMUS dataset and fine-tuned to minimize the loss function and reduce classification errors. The CNN model achieved an accuracy of 96.95% on the test samples, which is a significant improvement over the 86.0% accuracy from previous studies. The performance of the CNN model was also compared to five state-of-the-art deep learning methods, namely, quantum gray Wolf optimization (QGWO) algorithm, nonfully connected network (NFC-Net) classifier, nearest neighbor classifier, data augmentation and ensemble learning, and the CNN model outperformed four of them. However, the CNN model occasionally misclassified musical symbols with similar shapes, indicating that there is still room for improvement in the system's performance. Future research could focus on improving the model's performance on similar-shaped symbols. Overall, the research demonstrates the effectiveness of using a CNN model for handwritten musical notation recognition and highlights the potential of deep learning approaches in this area.
computer science, artificial intelligence
What problem does this paper attempt to address?