JaFIn: Japanese Financial Instruction Dataset

Tanabe, Kota; Suzuki, Masahiro; Sakaji, Hiroki; Noda, Itsuki

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2404

Computer Science > Computation and Language

Title: JaFIn: Japanese Financial Instruction Dataset

Authors: Kota Tanabe, Masahiro Suzuki, Hiroki Sakaji, Itsuki Noda

(Submitted on 14 Apr 2024)

Abstract: We construct an instruction dataset for the large language model (LLM) in the Japanese finance domain. Domain adaptation of language models, including LLMs, is receiving more attention as language models become more popular. This study demonstrates the effectiveness of domain adaptation through instruction tuning. To achieve this, we propose an instruction tuning data in Japanese called JaFIn, the Japanese Financial Instruction Dataset. JaFIn is manually constructed based on multiple data sources, including Japanese government websites, which provide extensive financial knowledge. We then utilize JaFIn to apply instruction tuning for several LLMs, demonstrating that our models specialized in finance have better domain adaptability than the original models. The financial-specialized LLMs created were evaluated using a quantitative Japanese financial benchmark and qualitative response comparisons, showing improved performance over the originals.

Comments:	10 pages, 1 figure
Subjects:	Computation and Language (cs.CL); Computational Engineering, Finance, and Science (cs.CE)
Cite as:	arXiv:2404.09260 [cs.CL]
	(or arXiv:2404.09260v1 [cs.CL] for this version)

Submission history

From: Masahiro Suzuki [view email]
[v1] Sun, 14 Apr 2024 14:01:53 GMT (126kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2404.09260

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: JaFIn: Japanese Financial Instruction Dataset

Submission history