Quranic Audio Dataset: Crowdsourced and Labeled Recitation from Non-Arabic Speakers

Raghad Salameh,Mohamad Al Mdfaa,Nursultan Askarbekuly,Manuel Mazzara
2024-05-04
Abstract:This paper addresses the challenge of learning to recite the Quran for non-Arabic speakers. We explore the possibility of crowdsourcing a carefully annotated Quranic dataset, on top of which AI models can be built to simplify the learning process. In particular, we use the volunteer-based crowdsourcing genre and implement a crowdsourcing API to gather audio assets. We integrated the API into an existing mobile application called NamazApp to collect audio recitations. We developed a crowdsourcing platform called Quran Voice for annotating the gathered audio assets. As a result, we have collected around 7000 Quranic recitations from a pool of 1287 participants across more than 11 non-Arabic countries, and we have annotated 1166 recitations from the dataset in six categories. We have achieved a crowd accuracy of 0.77, an inter-rater agreement of 0.63 between the annotators, and 0.89 between the labels assigned by the algorithm and the expert judgments.
Sound,Artificial Intelligence,Audio and Speech Processing
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to provide a crowdsourced and annotated Quran audio dataset for non - Arabic speakers, in order to simplify the process of learning to recite the Quran correctly. Specifically, the research aims to verify the following two hypotheses: 1. **Whether it is possible to crowdsource a Quran recitation audio dataset from beginners through an application**: By integrating into existing mobile applications (such as NamazApp), researchers were able to collect recitation audio from beginners in multiple countries around the world. 2. **Whether it is possible to annotate the collected data through a specialized crowdsourcing tool**: The researchers developed a platform named "Quran Voice" for pre - processing and annotating the collected audio data. In addition, the research also explored two key issues: - Are beginners willing to share their voices when reciting the Quran? - Are proficient reciters willing to participate in annotating audio recordings? To achieve these goals, the researchers adopted a multi - step approach, including: 1. **Constructing a dataset**: By integrating with NamazApp, collect Quran recitation data. 2. **Developing a crowdsourcing tool**: Create the "Quran Voice" platform for pre - processing and annotating the collected audio data. 3. **Quality control**: Conduct manual quality control to ensure the quality of the data so that it can be used to train machine - learning models. Eventually, the researchers successfully collected approximately 7,000 Quran recitation audios from 1,287 participants (from more than 11 non - Arabic countries) and annotated 1,166 of these audios. During the annotation process, a group accuracy of 0.77 and an inter - rater consistency of 0.63 were achieved. The inter - rater consistency between the labels generated by the algorithm and expert judgments reached 0.89. Through these efforts, the researchers hope to fill the gap in the existing publicly available Quran voice datasets and explore the possibility of using AI technology to simplify the process of learning to recite the Quran.