Quranic Audio Dataset: Crowdsourced and Labeled Recitation from Non-Arabic Speakers

Salameh, Raghad; Mdfaa, Mohamad Al; Askarbekuly, Nursultan; Mazzara, Manuel

Full-text links:

Download:

Current browse context:

cs.SD

< prev | next >

new | recent | 2405

Computer Science > Sound

Title: Quranic Audio Dataset: Crowdsourced and Labeled Recitation from Non-Arabic Speakers

Authors: Raghad Salameh, Mohamad Al Mdfaa, Nursultan Askarbekuly, Manuel Mazzara

(Submitted on 4 May 2024)

Abstract: This paper addresses the challenge of learning to recite the Quran for non-Arabic speakers. We explore the possibility of crowdsourcing a carefully annotated Quranic dataset, on top of which AI models can be built to simplify the learning process. In particular, we use the volunteer-based crowdsourcing genre and implement a crowdsourcing API to gather audio assets. We integrated the API into an existing mobile application called NamazApp to collect audio recitations. We developed a crowdsourcing platform called Quran Voice for annotating the gathered audio assets. As a result, we have collected around 7000 Quranic recitations from a pool of 1287 participants across more than 11 non-Arabic countries, and we have annotated 1166 recitations from the dataset in six categories. We have achieved a crowd accuracy of 0.77, an inter-rater agreement of 0.63 between the annotators, and 0.89 between the labels assigned by the algorithm and the expert judgments.

Subjects:	Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2405.02675 [cs.SD]
	(or arXiv:2405.02675v1 [cs.SD] for this version)

Submission history

From: Raghad Salameh [view email]
[v1] Sat, 4 May 2024 14:29:05 GMT (1902kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2405.02675

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Sound

Title: Quranic Audio Dataset: Crowdsourced and Labeled Recitation from Non-Arabic Speakers

Submission history