DoDo Learning: DOmain-DemOgraphic Transfer in Language Models for Detecting Abuse Targeted at Public Figures

Kirk, Hannah Rose; Williams, Angus R.; Burke, Liam; Chung, Yi-Ling; Debono, Ivan; Johansson, Pica; Stevens, Francesca; Bright, Jonathan; Hale, Scott A.

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2307

Computer Science > Computation and Language

Title: DoDo Learning: DOmain-DemOgraphic Transfer in Language Models for Detecting Abuse Targeted at Public Figures

Authors: Hannah Rose Kirk, Angus R. Williams, Liam Burke, Yi-Ling Chung, Ivan Debono, Pica Johansson, Francesca Stevens, Jonathan Bright, Scott A. Hale

(Submitted on 31 Jul 2023 (v1), revised 21 Aug 2023 (this version, v2), latest version 25 Apr 2024 (v3))

Abstract: Public figures receive a disproportionate amount of abuse on social media, impacting their active participation in public life. Automated systems can identify abuse at scale but labelling training data is expensive, complex and potentially harmful. So, it is desirable that systems are efficient and generalisable, handling both shared and specific aspects of online abuse. We explore the dynamics of cross-group text classification in order to understand how well classifiers trained on one domain or demographic can transfer to others, with a view to building more generalisable abuse classifiers. We fine-tune language models to classify tweets targeted at public figures across DOmains (sport and politics) and DemOgraphics (women and men) using our novel DODO dataset, containing 28,000 labelled entries, split equally across four domain-demographic pairs. We find that (i) small amounts of diverse data are hugely beneficial to generalisation and model adaptation; (ii) models transfer more easily across demographics but models trained on cross-domain data are more generalisable; (iii) some groups contribute more to generalisability than others; and (iv) dataset similarity is a signal of transferability.

Comments:	15 pages, 7 figures, 4 tables
Subjects:	Computation and Language (cs.CL); Computers and Society (cs.CY)
Cite as:	arXiv:2307.16811 [cs.CL]
	(or arXiv:2307.16811v2 [cs.CL] for this version)

Submission history

From: Angus Williams [view email]
[v1] Mon, 31 Jul 2023 16:29:08 GMT (191kb,D)
[v2] Mon, 21 Aug 2023 10:20:02 GMT (199kb,D)
[v3] Thu, 25 Apr 2024 10:22:39 GMT (425kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2307.16811v2

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: DoDo Learning: DOmain-DemOgraphic Transfer in Language Models for Detecting Abuse Targeted at Public Figures

Submission history