Large Language Models can Learn Rules

Zhu, Zhaocheng; Xue, Yuan; Chen, Xinyun; Zhou, Denny; Tang, Jian; Schuurmans, Dale; Dai, Hanjun

Full-text links:

Download:

Current browse context:

cs.AI

< prev | next >

new | recent | 2310

Computer Science > Artificial Intelligence

Title: Large Language Models can Learn Rules

Authors: Zhaocheng Zhu, Yuan Xue, Xinyun Chen, Denny Zhou, Jian Tang, Dale Schuurmans, Hanjun Dai

(Submitted on 10 Oct 2023 (v1), last revised 24 Apr 2024 (this version, v2))

Abstract: When prompted with a few examples and intermediate steps, large language models (LLMs) have demonstrated impressive performance in various reasoning tasks. However, prompting methods that rely on implicit knowledge in an LLM often generate incorrect answers when the implicit knowledge is wrong or inconsistent with the task. To tackle this problem, we present Hypotheses-to-Theories (HtT), a framework that learns a rule library for reasoning with LLMs. HtT contains two stages, an induction stage and a deduction stage. In the induction stage, an LLM is first asked to generate and verify rules over a set of training examples. Rules that appear and lead to correct answers sufficiently often are collected to form a rule library. In the deduction stage, the LLM is then prompted to employ the learned rule library to perform reasoning to answer test questions. Experiments on relational reasoning, numerical reasoning and concept learning problems show that HtT improves existing prompting methods, with an absolute gain of 10-30% in accuracy. The learned rules are also transferable to different models and to different forms of the same problem.

Subjects:	Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2310.07064 [cs.AI]
	(or arXiv:2310.07064v2 [cs.AI] for this version)

Submission history

From: Zhaocheng Zhu [view email]
[v1] Tue, 10 Oct 2023 23:07:01 GMT (253kb,D)
[v2] Wed, 24 Apr 2024 19:01:59 GMT (268kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2310.07064

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Artificial Intelligence

Title: Large Language Models can Learn Rules

Submission history