STaR-GATE: Teaching Language Models to Ask Clarifying Questions

Andukuri, Chinmaya; Fränken, Jan-Philipp; Gerstenberg, Tobias; Goodman, Noah D.

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2403

Computer Science > Computation and Language

Title: STaR-GATE: Teaching Language Models to Ask Clarifying Questions

Authors: Chinmaya Andukuri, Jan-Philipp Fränken, Tobias Gerstenberg, Noah D. Goodman

(Submitted on 28 Mar 2024 (v1), last revised 29 Mar 2024 (this version, v2))

Abstract: When prompting language models to complete a task, users often leave important aspects unsaid. While asking questions could resolve this ambiguity (GATE; Li et al., 2023), models often struggle to ask good questions. We explore a language model's ability to self-improve (STaR; Zelikman et al., 2022) by rewarding the model for generating useful questions-a simple method we dub STaR-GATE. We generate a synthetic dataset of 25,500 unique persona-task prompts to simulate conversations between a pretrained language model-the Questioner-and a Roleplayer whose preferences are unknown to the Questioner. By asking questions, the Questioner elicits preferences from the Roleplayer. The Questioner is iteratively finetuned on questions that increase the probability of high-quality responses to the task, which are generated by an Oracle with access to the Roleplayer's latent preferences. After two iterations of self-improvement, the Questioner asks better questions, allowing it to generate responses that are preferred over responses from the initial model on 72% of tasks. Our results indicate that teaching a language model to ask better questions leads to better personalized responses.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2403.19154 [cs.CL]
	(or arXiv:2403.19154v2 [cs.CL] for this version)

Submission history

From: Jan-Philipp Fränken [view email]
[v1] Thu, 28 Mar 2024 05:35:22 GMT (717kb,D)
[v2] Fri, 29 Mar 2024 05:15:12 GMT (717kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2403.19154

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: STaR-GATE: Teaching Language Models to Ask Clarifying Questions

Submission history