Tangled Program Graphs as an alternative to DRL-based control algorithms for UAVs

Deep reinforcement learning (DRL) is currently the most popular AI-based approach to autonomous vehicle control. An agent, trained for this purpose in simulation, can interact with the real environment with a human-level performance. Despite very good results in terms of selected metrics, this appro...

Full description

Saved in:
Bibliographic Details
Published inSignal Processing Algorithms, Architectures, Arrangements, and Applications Conference proceedings pp. 36 - 41
Main Authors Szolc, Hubert, Desnos, Karol, Kryjak, Tomasz
Format Conference Proceeding
LanguageEnglish
Published Division of Signal Processing and Electronic Systems, Poznan University of Technology (DSPES PUT) 25.09.2024
Subjects
Online AccessGet full text
ISSN2326-0319
DOI10.23919/SPA61993.2024.10715635

Cover

Abstract Deep reinforcement learning (DRL) is currently the most popular AI-based approach to autonomous vehicle control. An agent, trained for this purpose in simulation, can interact with the real environment with a human-level performance. Despite very good results in terms of selected metrics, this approach has some significant drawbacks: high computational requirements and low explainability. Because of that, a DRL-based agent cannot be used in some control tasks, especially when safety is the key issue. Therefore we propose to use Tangled Program Graphs (TPGs) as an alternative for deep reinforcement learning in control-related tasks. In this approach, input signals are processed by simple programs that are combined in a graph structure. As a result, TPGs are less computationally demanding and their actions can be explained based on the graph structure. In this paper, we present our studies on the use of TPGs as an alternative for DRL in control-related tasks. In particular, we consider the problem of navigating an unmanned aerial vehicle (UAV) through the unknown environment based solely on the onboard LiDAR sensor. The results of our work show promising prospects for the use of TPGs in control related-tasks.
AbstractList Deep reinforcement learning (DRL) is currently the most popular AI-based approach to autonomous vehicle control. An agent, trained for this purpose in simulation, can interact with the real environment with a human-level performance. Despite very good results in terms of selected metrics, this approach has some significant drawbacks: high computational requirements and low explainability. Because of that, a DRL-based agent cannot be used in some control tasks, especially when safety is the key issue. Therefore we propose to use Tangled Program Graphs (TPGs) as an alternative for deep reinforcement learning in control-related tasks. In this approach, input signals are processed by simple programs that are combined in a graph structure. As a result, TPGs are less computationally demanding and their actions can be explained based on the graph structure. In this paper, we present our studies on the use of TPGs as an alternative for DRL in control-related tasks. In particular, we consider the problem of navigating an unmanned aerial vehicle (UAV) through the unknown environment based solely on the onboard LiDAR sensor. The results of our work show promising prospects for the use of TPGs in control related-tasks.
Author Szolc, Hubert
Kryjak, Tomasz
Desnos, Karol
Author_xml – sequence: 1
  givenname: Hubert
  surname: Szolc
  fullname: Szolc, Hubert
  email: hubert.szolc@agh.edu.pl
  organization: AGH University of Krakow,Embedded Vision Systems Group,Department of Automatic Control and Robotics,Kraków,Poland
– sequence: 2
  givenname: Karol
  surname: Desnos
  fullname: Desnos, Karol
  email: karol.desnos@insa-rennes.fr
  organization: Univ Rennes, INSA Rennes CNRS, IETR - UMR 6164,Rennes,France,F-35000
– sequence: 3
  givenname: Tomasz
  surname: Kryjak
  fullname: Kryjak, Tomasz
  email: tomasz.kryjak@agh.edu.pl
  organization: AGH University of Krakow,Embedded Vision Systems Group,Department of Automatic Control and Robotics,Kraków,Poland
BookMark eNo1kM1Kw0AURkdRsNa-geC8QOqdv5vMslStQsCqrdtyk0yaSJopM0Hw7Y2o8MHZHM7iu2Rnve8dYzcC5lJZYW_f1gsU1qq5BKnnAlJhUJkTNrNplimUgEZneMomUklMQAl7wWYxfgCAwGxs4IS9bKjfd67i6-D3gQ58FejYRE7jek7d4EJPQ_vp-OD53WueFBRHu_T9EHw3Cnsf2qE5RF77wLeL93jFzmvqopv9ccq2D_eb5WOSP6-elos8aYQ0Q1LrDAjTCrCUJgWHWqemrjUpQKdNrR0JZwtCJ6A0UECZCQQtK1OJ8kebsuvfbuuc2x1De6Dwtft_QX0Dq8tSnw
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.23919/SPA61993.2024.10715635
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISBN 9788362065486
8362065486
EISSN 2326-0319
EndPage 41
ExternalDocumentID 10715635
Genre orig-research
GrantInformation_xml – fundername: Agence Nationale de la Recherche
  funderid: 10.13039/501100001665
GroupedDBID 6IE
6IF
6IK
6IL
6IN
AAJGR
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IPLJI
M43
OCL
RIE
RIL
ID FETCH-LOGICAL-h125t-f480a67d06c2570e64475ff4a306e45f4ea1e9ba6e10c50b0c816042d5d1cf4a3
IEDL.DBID RIE
IngestDate Wed Aug 27 02:16:56 EDT 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-h125t-f480a67d06c2570e64475ff4a306e45f4ea1e9ba6e10c50b0c816042d5d1cf4a3
OpenAccessLink https://hal.science/hal-04767061
PageCount 6
ParticipantIDs ieee_primary_10715635
PublicationCentury 2000
PublicationDate 2024-Sept.-25
PublicationDateYYYYMMDD 2024-09-25
PublicationDate_xml – month: 09
  year: 2024
  text: 2024-Sept.-25
  day: 25
PublicationDecade 2020
PublicationTitle Signal Processing Algorithms, Architectures, Arrangements, and Applications Conference proceedings
PublicationTitleAbbrev SPA
PublicationYear 2024
Publisher Division of Signal Processing and Electronic Systems, Poznan University of Technology (DSPES PUT)
Publisher_xml – name: Division of Signal Processing and Electronic Systems, Poznan University of Technology (DSPES PUT)
SSID ssj0001682396
Score 1.8948536
Snippet Deep reinforcement learning (DRL) is currently the most popular AI-based approach to autonomous vehicle control. An agent, trained for this purpose in...
SourceID ieee
SourceType Publisher
StartPage 36
SubjectTerms Autonomous aerial vehicles
Computer architecture
control algorithms
Deep reinforcement learning
DRL
Laser radar
Measurement
Navigation
Process control
Safety
Signal processing algorithms
Tangled Program Graphs
TPG
UAV
unmanned aerial vehicle
Title Tangled Program Graphs as an alternative to DRL-based control algorithms for UAVs
URI https://ieeexplore.ieee.org/document/10715635
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8NAEF5sT3rxVfHNHrwmJu3uJnssai2ipWorvZV9TFpRE2mTi7_e2aS1KgiGHJYwgWWHYfbb_eYbQs5MCx8LiacSKzzMENJTRnKEKgifFQ-0iFyB811PdIfsZsRHi2L1shYGAEryGfhuWN7l28wU7qgMIzxCuNHiNVKLYlEVa60OVETcbElRcbhwFMrzx35bOIIawsAm85d__-ijUqaRzibpLSdQsUde_CLXvvn4pc347xlukcaqYo_2v3LRNlmDdIdsfBMb3CX3A5VOXsE6O0fJotdOq3pOFb4pLW_N01IFnOYZvXy49VyCs3TBZUeDSTZ7zqdvc4r7XDpsP80bZNi5Glx0vUVDBW-K-5jcS1gcKBHZQBjXvA6EU_tLEqYQNwDjCQMVgtRKQBgY9FNg4lBgVFtuQ-PM9kg9zVLYJ1QxqxNlpQ4ZZ0pIGelAAwY_hBAhBDwgDbc64_dKM2O8XJjDP74fkXXnJMfEaPJjUs9nBZxgus_1aenmT0QbqQo
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LTwIxEG4UD-rFF8a3PXhd3IW2S49ERVQgqGC4kT5mwai7BpaLv97pLoiamLjZQ7OZTZpOJtOv_eYbQs5MBR8LkaciKzzMENJTRnKEKgifFfe1CF2Bc6stGj122-f9WbF6VgsDABn5DEpumN3l28RM3VEZRniIcKPCl8kKZ4zxvFxrcaQiquWKFDmLC0eBPH_s1ISjqCEQLLPS_P8fnVSyRFLfIO35FHL-yEtpmuqS-filzvjvOW6S4qJmj3a-stEWWYJ4m6x_kxvcIfddFQ9fwTo7R8qi106tekIVvjHN7s3jTAecpgm9fGh6LsVZOmOzo8EwGT-no7cJxZ0u7dWeJkXSq191LxrerKWCN8KdTOpFrOorEVpfGNe-DoTT-4siphA5AOMRAxWA1EpA4Bv0lG-qgcC4ttwGxpntkkKcxLBHqGJWR8pKHTDOlJAy1L4GDH8IIEQQuE-KbnUG77lqxmC-MAd_fD8lq41uqzlo3rTvDsmac5jjZZT5ESmk4ykcY_JP9Unm8k9O8KxX
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Signal+Processing+Algorithms%2C+Architectures%2C+Arrangements%2C+and+Applications+Conference+proceedings&rft.atitle=Tangled+Program+Graphs+as+an+alternative+to+DRL-based+control+algorithms+for+UAVs&rft.au=Szolc%2C+Hubert&rft.au=Desnos%2C+Karol&rft.au=Kryjak%2C+Tomasz&rft.date=2024-09-25&rft.pub=Division+of+Signal+Processing+and+Electronic+Systems%2C+Poznan+University+of+Technology+%28DSPES+PUT%29&rft.eissn=2326-0319&rft.spage=36&rft.epage=41&rft_id=info:doi/10.23919%2FSPA61993.2024.10715635&rft.externalDocID=10715635