Genetic Algorithms Optimization of a Reinforcement Learning-based Controller for Vertical Landing Rocket Case

Diva Kartika Larasati, Larasmoyo Nugroho, Sastra Kusuma Wijaya, Rika Andiarti, Rini Akmeliawati, Prawito Prajitno, Ery Fitrianingsih

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

A reward function in reinforcement learning is the formalization of the objective. Finding the ideal reward function is a challenge, that needs a search strategy to be constructed. Genetic Algorithm is a suitable approach for reward function search due to its thoroughness. The Deep Deterministic Policy Gradient (DDPG) algorithm, which is the focus of this research, is a reinforcement learning-based controller which performances are improved after the Genetic Algorithms optimizes the agent's reward functions. The optimized controller results in narrower missed distance and lower landing velocity compared to referenced DDPG controller, and significantly less fuel consumption compared to PID.

Original languageEnglish
Title of host publication2022 IEEE International Conference on Aerospace Electronics and Remote Sensing Technology, ICARES 2022 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781665461917
DOIs
Publication statusPublished - 2022
Event2022 IEEE International Conference on Aerospace Electronics and Remote Sensing Technology, ICARES 2022 - Virtual, Online, Indonesia
Duration: 24 Nov 202225 Nov 2022

Publication series

Name2022 IEEE International Conference on Aerospace Electronics and Remote Sensing Technology, ICARES 2022 - Proceedings

Conference

Conference2022 IEEE International Conference on Aerospace Electronics and Remote Sensing Technology, ICARES 2022
Country/TerritoryIndonesia
CityVirtual, Online
Period24/11/2225/11/22

Keywords

  • DDPG
  • reinforcement learning
  • reward shaping function
  • rocket landing

Fingerprint

Dive into the research topics of 'Genetic Algorithms Optimization of a Reinforcement Learning-based Controller for Vertical Landing Rocket Case'. Together they form a unique fingerprint.

Cite this