The Role of Syntactic Span Preferences in Post-Hoc Explanation Disagreement

Kamp, Jonathan; Beinborn, Lisa; Fokkens, Antske

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2403

Computer Science > Computation and Language

Title: The Role of Syntactic Span Preferences in Post-Hoc Explanation Disagreement

Authors: Jonathan Kamp, Lisa Beinborn, Antske Fokkens

(Submitted on 28 Mar 2024)

Abstract: Post-hoc explanation methods are an important tool for increasing model transparency for users. Unfortunately, the currently used methods for attributing token importance often yield diverging patterns. In this work, we study potential sources of disagreement across methods from a linguistic perspective. We find that different methods systematically select different classes of words and that methods that agree most with other methods and with humans display similar linguistic preferences. Token-level differences between methods are smoothed out if we compare them on the syntactic span level. We also find higher agreement across methods by estimating the most important spans dynamically instead of relying on a fixed subset of size $k$. We systematically investigate the interaction between $k$ and spans and propose an improved configuration for selecting important tokens.

Comments:	Long paper accepted to LREC-Coling 2024 main conference. Please cite the conference proceedings version when available
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2403.19424 [cs.CL]
	(or arXiv:2403.19424v1 [cs.CL] for this version)

Submission history

From: Jonathan Kamp [view email]
[v1] Thu, 28 Mar 2024 13:56:23 GMT (470kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2403.19424

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: The Role of Syntactic Span Preferences in Post-Hoc Explanation Disagreement

Submission history