Differential Evolution algorithm applied to non-stationary bandit problem

David L. ST-PIERRE*, Jialin LIU

*Corresponding author for this work

Research output: Book Chapters | Papers in Conference ProceedingsConference paper (refereed)Referred Conference Paperpeer-review

7 Citations (Scopus)

Abstract

In this paper we compare Differential Evolution (DE), an evolutionary algorithm, to classical bandit algorithms over the non-stationary bandit problem. First we define a testcase where the variation of the distributions depends on the number of times an option is evaluated rather than over time. This definition allows the possibility to apply these algorithms over a wide range of problems such as black-box portfolio selection. Second we present our own variant of discounted Upper Confidence Bound (UCB) algorithm that outperforms the current state-of-the-art algorithms for the non-stationary bandit problem. Third, we introduce a variant of DE and show that, on a selection over a portfolio of solvers for the Cart-Pole problem, our version of DE outperforms the current best UCB algorithms.

Original languageEnglish
Title of host publicationProceedings of the 2014 IEEE Congress on Evolutionary Computation, CEC 2014
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages2397-2403
Number of pages7
ISBN (Electronic)9781479914883
DOIs
Publication statusPublished - 2014
Externally publishedYes
Event2014 IEEE Congress on Evolutionary Computation, CEC 2014 - Beijing, China
Duration: 6 Jul 201411 Jul 2014

Conference

Conference2014 IEEE Congress on Evolutionary Computation, CEC 2014
Country/TerritoryChina
CityBeijing
Period6/07/1411/07/14

Bibliographical note

Publisher Copyright:
© 2014 IEEE.

Fingerprint

Dive into the research topics of 'Differential Evolution algorithm applied to non-stationary bandit problem'. Together they form a unique fingerprint.

Cite this