Evolutionary Reinforcement Learning via Cooperative Coevolution

Chengpeng HU, Jialin LIU, Xin YAO

Research output: Book Chapters | Papers in Conference ProceedingsConference paper (refereed)Researchpeer-review

Abstract

Recently, evolutionary reinforcement learning has obtained much attention in various domains.Maintaining a population of actors, evolutionary reinforcement learning utilises the collected experiences to improve the behaviour policy through efficient exploration.However, the poor scalability of genetic operators limits the efficiency of optimising high-dimensional neural networks.To address this issue, this paper proposes a novel cooperative coevolutionary reinforcement learning (CoERL) algorithm.Inspired by cooperative coevolution, CoERL periodically and adaptively decomposes the policy optimisation problem into multiple subproblems and evolves a population of neural networks for each of the subproblems.Instead of using genetic operators, CoERL directly searches for partial gradients to update the policy.Updating policy with partial gradients maintains consistency between the behaviour spaces of parents and offspring across generations.The experiences collected by the population are then used to improve the entire policy, which enhances the sampling efficiency.Experiments on six benchmark locomotion tasks demonstrate that CoERL outperforms seven state-of-the-art algorithms and baselines.Ablation study verifies the unique contribution of CoERL’s core ingredients.

Original languageEnglish
Title of host publicationECAI 2024 - 27th European Conference on Artificial Intelligence, Including 13th Conference on Prestigious Applications of Intelligent Systems, PAIS 2024, Proceedings
EditorsUlle ENDRISS, Francisco S. MELO, Kerstin BACH, Alberto BUGARIN-DIZ, Jose M. ALONSO-MORAL, Senen BARRO, Fredrik HEINTZ
PublisherIOS Press BV
Pages3300-3307
Number of pages8
ISBN (Electronic)9781643685489
DOIs
Publication statusPublished - 16 Oct 2024
Event27th European Conference on Artificial Intelligence, ECAI 2024 - Santiago de Compostela, Spain
Duration: 19 Oct 202424 Oct 2024

Publication series

NameFrontiers in Artificial Intelligence and Applications
Volume392
ISSN (Print)0922-6389
ISSN (Electronic)1879-8314

Conference

Conference27th European Conference on Artificial Intelligence, ECAI 2024
Country/TerritorySpain
CitySantiago de Compostela
Period19/10/2424/10/24

Bibliographical note

Publisher Copyright:
© 2024 The Authors.

Funding

This work was supported by the National Key R&D Program of China (Grant No. 2023YFE0106300), the National Natural Science Foundation of China (Grant No. 62250710682), the Shenzhen Science and Technology Program (Grant No. 20220815181327001), the Guangdong Major Project of Basic and Applied Basic Research (Grant No. 2023B0303000010), the Guangdong Provincial Key Laboratory (Grant No. 2020B121201001), the Research Institute of Trustworthy Autonomous Systems and the SUSTech Undergraduate Teaching Quality and Reform Project (Grant No. XJZLGC202215).

Fingerprint

Dive into the research topics of 'Evolutionary Reinforcement Learning via Cooperative Coevolution'. Together they form a unique fingerprint.

Cite this