Leveraging Code to Improve In-context Learning for Semantic Parsing

Bogin, Ben; Gupta, Shivanshu; Clark, Peter; Sabharwal, Ashish

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2311

Change to browse by:

Computer Science > Computation and Language

Title: Leveraging Code to Improve In-context Learning for Semantic Parsing

Authors: Ben Bogin, Shivanshu Gupta, Peter Clark, Ashish Sabharwal

(Submitted on 16 Nov 2023 (v1), last revised 27 Mar 2024 (this version, v2))

Abstract: In-context learning (ICL) is an appealing approach for semantic parsing due to its few-shot nature and improved generalization. However, learning to parse to rare domain-specific languages (DSLs) from just a few demonstrations is challenging, limiting the performance of even the most capable LLMs. In this work, we improve the effectiveness of ICL for semantic parsing by (1) using general-purpose programming languages such as Python instead of DSLs, and (2) augmenting prompts with a structured domain description that includes, e.g., the available classes and functions. We show that both these changes significantly improve accuracy across three popular datasets. Combined, they lead to dramatic improvements (e.g. 7.9% to 66.5% on SMCalFlow compositional split), nearly closing the performance gap between easier i.i.d.\ and harder compositional splits when used with a strong model, and reducing the need for a large number of demonstrations. We find that the resemblance of the target parse language to general-purpose code is a more important factor than the language's popularity in pre-training corpora. Our findings provide an improved methodology for building semantic parsers in the modern context of ICL with LLMs.

Comments:	Accepted to NAACL 2024
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2311.09519 [cs.CL]
	(or arXiv:2311.09519v2 [cs.CL] for this version)

Submission history

From: Ben Bogin [view email]
[v1] Thu, 16 Nov 2023 02:50:06 GMT (7807kb,D)
[v2] Wed, 27 Mar 2024 21:52:11 GMT (7886kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2311.09519

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Leveraging Code to Improve In-context Learning for Semantic Parsing

Submission history