Adapting Large Language Models for Education: Foundational Capabilities, Potentials, and Challenges

Li, Qingyao; Fu, Lingyue; Zhang, Weiming; Chen, Xianyu; Yu, Jingwei; Xia, Wei; Zhang, Weinan; Tang, Ruiming; Yu, Yong

Full-text links:

Download:

Current browse context:

cs.AI

< prev | next >

new | recent | 2401

Computer Science > Artificial Intelligence

Title: Adapting Large Language Models for Education: Foundational Capabilities, Potentials, and Challenges

Authors: Qingyao Li, Lingyue Fu, Weiming Zhang, Xianyu Chen, Jingwei Yu, Wei Xia, Weinan Zhang, Ruiming Tang, Yong Yu

(Submitted on 27 Dec 2023 (v1), last revised 26 Apr 2024 (this version, v3))

Abstract: Online education platforms, leveraging the internet to distribute education resources, seek to provide convenient education but often fall short in real-time communication with students. They often struggle to address the diverse obstacles students encounter throughout their learning journey. Solving the problems encountered by students poses a significant challenge for traditional deep learning models, as it requires not only a broad spectrum of subject knowledge but also the ability to understand what constitutes a student's individual difficulties. It's challenging for traditional machine learning models, as they lack the capacity to comprehend students' personalized needs. Recently, the emergence of large language models (LLMs) offers the possibility for resolving this issue by comprehending individual requests. Although LLMs have been successful in various fields, creating an LLM-based education system is still challenging for the wide range of educational skills required. This paper reviews the recently emerged LLM research related to educational capabilities, including mathematics, writing, programming, reasoning, and knowledge-based question answering, with the aim to explore their potential in constructing the next-generation intelligent education system. Specifically, for each capability, we focus on investigating two aspects. Firstly, we examine the current state of LLMs regarding this capability: how advanced they have become, whether they surpass human abilities, and what deficiencies might exist. Secondly, we evaluate whether the development methods for LLMs in this area are generalizable, that is, whether these methods can be applied to construct a comprehensive educational supermodel with strengths across various capabilities, rather than being effective in only a singular aspect.

Comments:	31 pages, 5 figures, 1 table
Subjects:	Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2401.08664 [cs.AI]
	(or arXiv:2401.08664v3 [cs.AI] for this version)

Submission history

From: Qingyao Li [view email]
[v1] Wed, 27 Dec 2023 14:37:32 GMT (268kb,D)
[v2] Sun, 25 Feb 2024 05:41:24 GMT (268kb,D)
[v3] Fri, 26 Apr 2024 07:59:22 GMT (1474kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2401.08664

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Artificial Intelligence

Title: Adapting Large Language Models for Education: Foundational Capabilities, Potentials, and Challenges

Submission history