Sim-to-real transfer for fixed-wing uncrewed aerial vehicle: Pitch Control by High-Fidelity Modelling and Domain Randomization

Daichi Wada, Sergio Araujo-Estrada, Shane P Windsor

Research output: Contribution to journalArticle (Academic Journal)peer-review

10 Citations (Scopus)
108 Downloads (Pure)

Abstract

Deep reinforcement learning has great potential to automatically generate flight controllers for uncrewed aerial vehicles (UAVs), however these controllers often fail to perform as expected in real world environments due to differences between the simulation environment and reality. This letter experimentally investigated how this reality gap effect could be mitigated, focusing on fixed-wing UAV pitch control in wind tunnel tests. Three different training approaches were conducted: a baseline approach that used simple linear dynamics, a high-fidelity modeling approach, and a domain randomization approach. It was found that the base line controller was susceptible to the reality gap, while the other two approaches successfully transferred to real tests. To further examine the controllers' capabilities to generalize, a variety of configuration changes were experimentally implemented on the UAV, such as increased inertia, extended elevator area, and aileron offset. While the high-fidelity controller failed to cope with these changes, the controller with domain randomization maintained its performance. These results highlight the importance of selecting appropriate sim-to-real transfer approaches and how domain randomization is applicable to fixed-wing UAV control with uncertainty in real environments.
Original languageEnglish
Pages (from-to)11735-11742
Number of pages8
JournalIEEE Robotics and Automation Letters
Volume7
Issue number4
DOIs
Publication statusPublished - 9 Sept 2022

Bibliographical note

Publisher Copyright:
© 2016 IEEE.

Fingerprint

Dive into the research topics of 'Sim-to-real transfer for fixed-wing uncrewed aerial vehicle: Pitch Control by High-Fidelity Modelling and Domain Randomization'. Together they form a unique fingerprint.

Cite this