Evolutionary Reinforcement Learning via Cooperative Coevolution

Hu, Chengpeng; Liu, Jialin; Yao, Xin

Full-text links:

Download:

Current browse context:

cs.NE

< prev | next >

new | recent | 2404

Computer Science > Neural and Evolutionary Computing

Title: Evolutionary Reinforcement Learning via Cooperative Coevolution

Authors: Chengpeng Hu, Jialin Liu, Xin Yao

(Submitted on 23 Apr 2024 (v1), last revised 29 Apr 2024 (this version, v2))

Abstract: Recently, evolutionary reinforcement learning has obtained much attention in various domains. Maintaining a population of actors, evolutionary reinforcement learning utilises the collected experiences to improve the behaviour policy through efficient exploration. However, the poor scalability of genetic operators limits the efficiency of optimising high-dimensional neural networks. To address this issue, this paper proposes a novel cooperative coevolutionary reinforcement learning (CoERL) algorithm. Inspired by cooperative coevolution, CoERL periodically and adaptively decomposes the policy optimisation problem into multiple subproblems and evolves a population of neural networks for each of the subproblems. Instead of using genetic operators, CoERL directly searches for partial gradients to update the policy. Updating policy with partial gradients maintains consistency between the behaviour spaces of parents and offspring across generations. The experiences collected by the population are then used to improve the entire policy, which enhances the sampling efficiency. Experiments on six benchmark locomotion tasks demonstrate that CoERL outperforms seven state-of-the-art algorithms and baselines. Ablation study verifies the unique contribution of CoERL's core ingredients.

Subjects:	Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2404.14763 [cs.NE]
	(or arXiv:2404.14763v2 [cs.NE] for this version)

Submission history

From: Chengpeng Hu [view email]
[v1] Tue, 23 Apr 2024 05:56:35 GMT (7390kb,D)
[v2] Mon, 29 Apr 2024 13:52:35 GMT (7390kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2404.14763

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Neural and Evolutionary Computing

Title: Evolutionary Reinforcement Learning via Cooperative Coevolution

Submission history