Improvement of the DDPG Algorithm via Twin Delayed DDPG (TD3) on Vertical Rocket Landing Control System

Faisal Amir Maz, Prawito Prajitno, Rika Andiarti, Rini Akmeliawati, Djati Handoko, Sastra Kusuma Wijaya, Larasmoyo Nugroho

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The Double Deep Policy Gradient or DDPG Reinforcement Learning algorithm has a tendency to overestimate computations, which results in learning policies that are not optimum. When DDPG agent is applied in the rocket landing control system, high frequency chattering occurs at the main engine's control command and this gives substantial drawbacks for the liquid propulsion life longevity. The Twin-Delayed Deep Deterministic Policy Gradient (TD3) Reinforcement Learning agent is proposed to overcame this chattering by delaying the landing rocket control command, as simulated in two dimensions. This TD3 controller is shown to be able to dampen the engine command as its achievements are contrasted with the outcomes of the DDPG one.

Original languageEnglish
Title of host publicationAIP Conference Proceedings
EditorsHarry Septanto, Muhammad Ilham Adhynugraha, Yenni Vetrita, Cahya Edi Santosa, Peberlin Parulian Sitompul, Erma Yulihastin, Johan Muhamad, Mardianis, Ery Fitrianingsih, Mario Batubara, Prayitno Abadi, Afni Restasari
PublisherAmerican Institute of Physics Inc.
Edition1
ISBN (Electronic)9780735447554
DOIs
Publication statusPublished - 11 Dec 2023
Event9th International Seminar on Aerospace Science and Technology, ISAST 2022 - Virtual, Online, Indonesia
Duration: 22 Nov 202223 Nov 2022

Publication series

NameAIP Conference Proceedings
Number1
Volume2941
ISSN (Print)0094-243X
ISSN (Electronic)1551-7616

Conference

Conference9th International Seminar on Aerospace Science and Technology, ISAST 2022
Country/TerritoryIndonesia
CityVirtual, Online
Period22/11/2223/11/22

Fingerprint

Dive into the research topics of 'Improvement of the DDPG Algorithm via Twin Delayed DDPG (TD3) on Vertical Rocket Landing Control System'. Together they form a unique fingerprint.

Cite this