Skip to main navigation Skip to search Skip to main content

A Proposal for Population-Based Reinforcement Learning

Tim Kovacs, Reynolds Stuart I.

    Research output: Working paper

    Abstract

    We propose novel ways of solving Reinforcement Learning tasks (that is, stochastic optimal control tasks) by hybridising Evolutionary Algorithms with methods based on value functions. We call our approach Population-Based Reinforcement Learning. The key idea, from Evolutionary Computation, is that parallel interacting search processes (in this case Reinforcement Learning or Dynamic Programming algorithms) can aid each other, and produce improved results in less time than the same number of search processes running independently. This is a new and general direction in RL research, and is complementary to other directions as it can be combined with them. We briefly compare our approach to related ones.
    Translated title of the contributionA Proposal for Population-Based Reinforcement Learning
    Original languageEnglish
    PublisherUniversity of Bristol
    Publication statusPublished - 2003

    Bibliographical note

    Other page information: -
    Other identifier: 1000693

    Fingerprint

    Dive into the research topics of 'A Proposal for Population-Based Reinforcement Learning'. Together they form a unique fingerprint.

    Cite this