MacGyver: Are Large Language Models Creative Problem Solvers?

Tian, Yufei; Ravichander, Abhilasha; Qin, Lianhui; Bras, Ronan Le; Marjieh, Raja; Peng, Nanyun; Choi, Yejin; Griffiths, Thomas L.; Brahman, Faeze

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2311

Computer Science > Computation and Language

Title: MacGyver: Are Large Language Models Creative Problem Solvers?

Authors: Yufei Tian, Abhilasha Ravichander, Lianhui Qin, Ronan Le Bras, Raja Marjieh, Nanyun Peng, Yejin Choi, Thomas L. Griffiths, Faeze Brahman

(Submitted on 16 Nov 2023 (this version), latest version 27 Mar 2024 (v3))

Abstract: We explore the creative problem-solving capabilities of modern large language models (LLMs) in a constrained setting. The setting requires circumventing a cognitive bias known in psychology as ''functional fixedness'' to use familiar objects in innovative or unconventional ways. To this end, we create MacGyver, an automatically generated dataset consisting of 1,600 real-world problems that deliberately trigger functional fixedness and require thinking 'out-of-the-box'. We then present our collection of problems to both LLMs and humans to compare and contrast their problem-solving abilities. We show that MacGyver is challenging for both groups, but in unique and complementary ways. For example, humans typically excel in solving problems that they are familiar with but may struggle with tasks requiring domain-specific knowledge, leading to a higher variance. On the other hand, LLMs, being exposed to a variety of highly specialized knowledge, attempt broader problems but are prone to overconfidence and propose actions that are physically infeasible or inefficient. We also provide a detailed error analysis of LLMs, and demonstrate the potential of enhancing their problem-solving ability with novel prompting techniques such as iterative step-wise reflection and divergent-convergent thinking. This work provides insight into the creative problem-solving capabilities of humans and AI and illustrates how psychological paradigms can be extended into large-scale tasks for comparing humans and machines.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2311.09682 [cs.CL]
	(or arXiv:2311.09682v1 [cs.CL] for this version)

Submission history

From: Yufei Tian [view email]
[v1] Thu, 16 Nov 2023 08:52:27 GMT (6580kb,D)
[v2] Thu, 21 Mar 2024 22:44:41 GMT (8340kb,D)
[v3] Wed, 27 Mar 2024 23:43:54 GMT (8340kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2311.09682v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: MacGyver: Are Large Language Models Creative Problem Solvers?

Submission history