Staged Reinforcement Learning for Complex Tasks Through Decomposed Environments

Rafael Pina*, Corentin Artaud, Xiaolan Liu, Varuna De Silva

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference Contribution (Conference Proceeding)

1 Citation (Scopus)

Abstract

Reinforcement Learning (RL) is an area of growing interest in the field of artificial intelligence due to its many notable applications in diverse fields. Particularly within the context of intelligent vehicle control, RL has made impressive progress. However, currently it is still in simulated controlled environments where RL can achieve its full super-human potential. Although how to apply simulation experience in real scenarios has been studied, how to approximate simulated problems to the real dynamic problems is still a challenge. In this paper, we discuss two methods that approximate RL problems to real problems. In the context of traffic junction simulations, we demonstrate that, if we can decompose a complex task into multiple sub-tasks, solving these tasks first can be advantageous to help minimising possible occurrences of catastrophic events in the complex task. From a multi-agent perspective, we introduce a training structuring mechanism that exploits the use of experience learned under the popular paradigm called Centralised Training Decentralised Execution (CTDE). This experience can then be leveraged in fully decentralised settings that are conceptually closer to real settings, where agents often do not have access to a central oracle and must be treated as isolated independent units. The results show that the proposed approaches improve agents performance in complex tasks related to traffic junctions, minimizing potential safety-critical problems that might happen in these scenarios. Although still in simulation, the investigated situations are conceptually closer to real scenarios and thus, with these results, we intend to motivate further research in the subject.
Original languageEnglish
Title of host publicationIntelligent Systems and Pattern Recognition
Subtitle of host publicationThird International Conference, ISPR 2023, Hammamet, Tunisia, May 11–13, 2023, Revised Selected Papers, Part II
EditorsAkram Bennour, Ahmed Bouridane, Lotfi Chaari
PublisherSpringer, Cham
Chapter11
Pages141-154
Number of pages14
ISBN (Electronic)9783031463389
ISBN (Print)9783031463372
DOIs
Publication statusPublished - 5 Nov 2024

Publication series

NameCommunications in Computer and Information Science
Volume1941 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Bibliographical note

Publisher Copyright:
© 2024, The Author(s), under exclusive license to Springer Nature Switzerland AG.

Fingerprint

Dive into the research topics of 'Staged Reinforcement Learning for Complex Tasks Through Decomposed Environments'. Together they form a unique fingerprint.

Cite this