Abstract
Recently, evolutionary reinforcement learning has obtained much attention in various domains.Maintaining a population of actors, evolutionary reinforcement learning utilises the collected experiences to improve the behaviour policy through efficient exploration.However, the poor scalability of genetic operators limits the efficiency of optimising high-dimensional neural networks.To address this issue, this paper proposes a novel cooperative coevolutionary reinforcement learning (CoERL) algorithm.Inspired by cooperative coevolution, CoERL periodically and adaptively decomposes the policy optimisation problem into multiple subproblems and evolves a population of neural networks for each of the subproblems.Instead of using genetic operators, CoERL directly searches for partial gradients to update the policy.Updating policy with partial gradients maintains consistency between the behaviour spaces of parents and offspring across generations.The experiences collected by the population are then used to improve the entire policy, which enhances the sampling efficiency.Experiments on six benchmark locomotion tasks demonstrate that CoERL outperforms seven state-of-the-art algorithms and baselines.Ablation study verifies the unique contribution of CoERL’s core ingredients.
Original language | English |
---|---|
Title of host publication | ECAI 2024 - 27th European Conference on Artificial Intelligence, Including 13th Conference on Prestigious Applications of Intelligent Systems, PAIS 2024, Proceedings |
Editors | Ulle ENDRISS, Francisco S. MELO, Kerstin BACH, Alberto BUGARIN-DIZ, Jose M. ALONSO-MORAL, Senen BARRO, Fredrik HEINTZ |
Publisher | IOS Press BV |
Pages | 3300-3307 |
Number of pages | 8 |
ISBN (Electronic) | 9781643685489 |
DOIs | |
Publication status | Published - 16 Oct 2024 |
Event | 27th European Conference on Artificial Intelligence, ECAI 2024 - Santiago de Compostela, Spain Duration: 19 Oct 2024 → 24 Oct 2024 |
Publication series
Name | Frontiers in Artificial Intelligence and Applications |
---|---|
Volume | 392 |
ISSN (Print) | 0922-6389 |
ISSN (Electronic) | 1879-8314 |
Conference
Conference | 27th European Conference on Artificial Intelligence, ECAI 2024 |
---|---|
Country/Territory | Spain |
City | Santiago de Compostela |
Period | 19/10/24 → 24/10/24 |
Bibliographical note
Publisher Copyright:© 2024 The Authors.
Funding
This work was supported by the National Key R&D Program of China (Grant No. 2023YFE0106300), the National Natural Science Foundation of China (Grant No. 62250710682), the Shenzhen Science and Technology Program (Grant No. 20220815181327001), the Guangdong Major Project of Basic and Applied Basic Research (Grant No. 2023B0303000010), the Guangdong Provincial Key Laboratory (Grant No. 2020B121201001), the Research Institute of Trustworthy Autonomous Systems and the SUSTech Undergraduate Teaching Quality and Reform Project (Grant No. XJZLGC202215).