We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cond-mat.mtrl-sci

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Condensed Matter > Materials Science

Title: Harnessing Large Language Model to collect and analyze Metal-organic framework property dataset

Abstract: This research was focused on the efficient collection of experimental Metal-Organic Framework (MOF) data from scientific literature to address the challenges of accessing hard-to-find data and improving the quality of information available for machine learning studies in materials science. Utilizing a chain of advanced Large Language Models (LLMs), we developed a systematic approach to extract and organize MOF data into a structured format. Our methodology successfully compiled information from more than 40,000 research articles, creating a comprehensive and ready-to-use dataset. The findings highlight the significant advantage of incorporating experimental data over relying solely on simulated data for enhancing the accuracy of machine learning predictions in the field of MOF research.
Subjects: Materials Science (cond-mat.mtrl-sci)
Cite as: arXiv:2404.13053 [cond-mat.mtrl-sci]
  (or arXiv:2404.13053v1 [cond-mat.mtrl-sci] for this version)

Submission history

From: Wonseok Lee [view email]
[v1] Sun, 31 Mar 2024 12:47:24 GMT (2742kb)

Link back to: arXiv, form interface, contact.