NaijaHate: Evaluating Hate Speech Detection on Nigerian Twitter Using Representative Data

Tonneau, Manuel; de Castro, Pedro Vitor Quinta; Lasri, Karim; Farouq, Ibrahim; Subramanian, Lakshminarayanan; Orozco-Olvera, Victor; Fraiberger, Samuel

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2403

Change to browse by:

Computer Science > Computation and Language

Title: NaijaHate: Evaluating Hate Speech Detection on Nigerian Twitter Using Representative Data

Authors: Manuel Tonneau, Pedro Vitor Quinta de Castro, Karim Lasri, Ibrahim Farouq, Lakshminarayanan Subramanian, Victor Orozco-Olvera, Samuel Fraiberger

(Submitted on 28 Mar 2024)

Abstract: To address the global issue of hateful content proliferating in online platforms, hate speech detection (HSD) models are typically developed on datasets collected in the United States, thereby failing to generalize to English dialects from the Majority World. Furthermore, HSD models are often evaluated on curated samples, raising concerns about overestimating model performance in real-world settings. In this work, we introduce NaijaHate, the first dataset annotated for HSD which contains a representative sample of Nigerian tweets. We demonstrate that HSD evaluated on biased datasets traditionally used in the literature largely overestimates real-world performance on representative data. We also propose NaijaXLM-T, a pretrained model tailored to the Nigerian Twitter context, and establish the key role played by domain-adaptive pretraining and finetuning in maximizing HSD performance. Finally, we show that in this context, a human-in-the-loop approach to content moderation where humans review 1% of Nigerian tweets flagged as hateful would enable to moderate 60% of all hateful content. Taken together, these results pave the way towards robust HSD systems and a better protection of social media users from hateful content in low-resource settings.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2403.19260 [cs.CL]
	(or arXiv:2403.19260v1 [cs.CL] for this version)

Submission history

From: Manuel Tonneau [view email]
[v1] Thu, 28 Mar 2024 09:34:31 GMT (8181kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2403.19260

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: NaijaHate: Evaluating Hate Speech Detection on Nigerian Twitter Using Representative Data

Submission history