We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Structure in Deep Reinforcement Learning: A Survey and Open Problems

Abstract: Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural Networks (DNNs) for function approximation, has demonstrated considerable success in numerous applications. However, its practicality in addressing various real-world scenarios, characterized by diverse and unpredictable dynamics, noisy signals, and large state and action spaces, remains limited. This limitation stems from poor data efficiency, limited generalization capabilities, a lack of safety guarantees, and the absence of interpretability, among other factors. To overcome these challenges and improve performance across these crucial metrics, one promising avenue is to incorporate additional structural information about the problem into the RL learning process. Various sub-fields of RL have proposed methods for incorporating such inductive biases. We amalgamate these diverse methodologies under a unified framework, shedding light on the role of structure in the learning problem, and classify these methods into distinct patterns of incorporating structure. By leveraging this comprehensive framework, we provide valuable insights into the challenges of structured RL and lay the groundwork for a design pattern perspective on RL research. This novel perspective paves the way for future advancements and aids in developing more effective and efficient RL algorithms that can potentially handle real-world scenarios better.
Comments: Published at the Journal of Artificial Intelligence Research, Volume 79, Pages 1167-1236
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
DOI: 10.1613/jair.1.15703
Cite as: arXiv:2306.16021 [cs.LG]
  (or arXiv:2306.16021v3 [cs.LG] for this version)

Submission history

From: Aditya Mohan [view email]
[v1] Wed, 28 Jun 2023 08:48:40 GMT (523kb,D)
[v2] Wed, 9 Aug 2023 22:55:00 GMT (526kb,D)
[v3] Thu, 25 Apr 2024 14:40:51 GMT (1880kb,D)

Link back to: arXiv, form interface, contact.