Towards Understanding In-Context Learning with Contrastive Demonstrations and Saliency Maps

Liu, Fuxiao; Xu, Paiheng; Li, Zongxia; Feng, Yue; Song, Hyemi

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2307

Computer Science > Computation and Language

Title: Towards Understanding In-Context Learning with Contrastive Demonstrations and Saliency Maps

Authors: Fuxiao Liu, Paiheng Xu, Zongxia Li, Yue Feng, Hyemi Song

(Submitted on 11 Jul 2023 (v1), last revised 26 Apr 2024 (this version, v4))

Abstract: We investigate the role of various demonstration components in the in-context learning (ICL) performance of large language models (LLMs). Specifically, we explore the impacts of ground-truth labels, input distribution, and complementary explanations, particularly when these are altered or perturbed. We build on previous work, which offers mixed findings on how these elements influence ICL. To probe these questions, we employ explainable NLP (XNLP) methods and utilize saliency maps of contrastive demonstrations for both qualitative and quantitative analysis. Our findings reveal that flipping ground-truth labels significantly affects the saliency, though it's more noticeable in larger LLMs. Our analysis of the input distribution at a granular level reveals that changing sentiment-indicative terms in a sentiment analysis task to neutral ones does not have as substantial an impact as altering ground-truth labels. Finally, we find that the effectiveness of complementary explanations in boosting ICL performance is task-dependent, with limited benefits seen in sentiment analysis tasks compared to symbolic reasoning tasks. These insights are critical for understanding the functionality of LLMs and guiding the development of effective demonstrations, which is increasingly relevant in light of the growing use of LLMs in applications such as ChatGPT. Our research code is publicly available at this https URL

Comments:	10 pages, 5 figures
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2307.05052 [cs.CL]
	(or arXiv:2307.05052v4 [cs.CL] for this version)

Submission history

From: Fuxiao Liu [view email]
[v1] Tue, 11 Jul 2023 07:03:29 GMT (1596kb,D)
[v2] Tue, 28 Nov 2023 20:12:36 GMT (1596kb,D)
[v3] Mon, 15 Apr 2024 15:54:51 GMT (1596kb,D)
[v4] Fri, 26 Apr 2024 01:18:13 GMT (1596kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2307.05052

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Towards Understanding In-Context Learning with Contrastive Demonstrations and Saliency Maps

Submission history