Oktoechos classification in liturgical music using self attention based-stacked bi-directional networks

A, Noumida,Raj T.V., Hridya
DOI: https://doi.org/10.1007/s11042-024-19706-7
IF: 2.577
2024-06-25
Multimedia Tools and Applications
Abstract:A characteristic aspect of the Syrian tradition's musical repertoire is classifying melodies into eight tunes, called 'oktoec̄hos'. It had an impact on a lot of traditions, including the liturgical music of India and Greece. In oktoec̄hos tradition, liturgical hymns are sung in eight modes or eight colours (referred to as eight "niram", regionally). In this paper, the automatic oktoec̄hos genre classification is addressed using musical texture features (MTF), i-vectors and Mel-spectrograms through self-attention-based stacked bidirectional and unidirectional long-short term memory (SA-SBU-LSTM) and Gated recurrent units (SA-SBU-GRU) architectures. Musical features include timbral and rhythmic features. The complex music structural pattern is learned using musical texture features. The performance of the proposed approaches is evaluated using a newly created corpus of liturgical music in Malayalam. SA-SBU-LSTM and SA-SBU-GRU frameworks report average classification accuracy of 80% and 84%, with a significant margin over other frameworks. The experiments demonstrate the potential of stacked architectures with an attention mechanism.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?