RD2Bench: Toward Data-Centric Automatic R&D

Chen, Haotian; Shen, Xinjie; Ye, Zeqi; Yang, Xiao; Yang, Xu; Liu, Weiqing; Bian, Jiang

Full-text links:

Download:

Current browse context:

cs.AI

< prev | next >

new | recent | 2404

Computer Science > Artificial Intelligence

Title: RD2Bench: Toward Data-Centric Automatic R&D

Authors: Haotian Chen, Xinjie Shen, Zeqi Ye, Xiao Yang, Xu Yang, Weiqing Liu, Jiang Bian

(Submitted on 17 Apr 2024)

Abstract: The progress of humanity is driven by those successful discoveries accompanied by countless failed experiments. Researchers often seek the potential research directions by reading and then verifying them through experiments. The process imposes a significant burden on researchers. In the past decade, the data-driven black-box deep learning method demonstrates its effectiveness in a wide range of real-world scenarios, which exacerbates the experimental burden of researchers and thus renders the potential successful discoveries veiled. Therefore, automating such a research and development (R&D) process is an urgent need. In this paper, we serve as the first effort to formalize the goal by proposing a Real-world Data-centric automatic R&D Benchmark, namely RD2Bench. RD2Bench benchmarks all the operations in data-centric automatic R&D (D-CARD) as a whole to navigate future work toward our goal directly. We focuses on evaluating the interaction and synergistic effects of various model capabilities and aiding to select the well-performed trustworthy models. Although RD2Bench is very challenging to the state-of-the-art (SOTA) large language model (LLM) named GPT-4, indicating ample research opportunities and more research efforts, LLMs possess promising potential to bring more significant development to D-CARD: They are able to implement some simple methods without adopting any additional techniques. We appeal to future work to take developing techniques for tackling automatic R&D into consideration, thus bringing the opportunities of the potential revolutionary upgrade to human productivity.

Comments:	17 pages, 5 figures,
Subjects:	Artificial Intelligence (cs.AI); General Finance (q-fin.GN)
Cite as:	arXiv:2404.11276 [cs.AI]
	(or arXiv:2404.11276v1 [cs.AI] for this version)

Submission history

From: Xu Yang [view email]
[v1] Wed, 17 Apr 2024 11:33:21 GMT (717kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2404.11276

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Artificial Intelligence

Title: RD2Bench: Toward Data-Centric Automatic R&D

Submission history