Tangled Program Graphs as an alternative to DRL-based control algorithms for UAVs

Deep reinforcement learning (DRL) is currently the most popular AI-based approach to autonomous vehicle control. An agent, trained for this purpose in simulation, can interact with the real environment with a human-level performance. Despite very good results in terms of selected metrics, this appro...

Full description

Saved in:

Bibliographic Details
Published in	Signal Processing Algorithms, Architectures, Arrangements, and Applications Conference proceedings pp. 36 - 41
Main Authors	Szolc, Hubert, Desnos, Karol, Kryjak, Tomasz
Format	Conference Proceeding
Language	English
Published	Division of Signal Processing and Electronic Systems, Poznan University of Technology (DSPES PUT) 25.09.2024
Subjects	Autonomous aerial vehicles Computer architecture control algorithms Deep reinforcement learning DRL Laser radar Measurement Navigation Process control Safety Signal processing algorithms Tangled Program Graphs TPG UAV unmanned aerial vehicle
Online Access	Get full text
ISSN	2326-0319
DOI	10.23919/SPA61993.2024.10715635

Cover

Abstract	Deep reinforcement learning (DRL) is currently the most popular AI-based approach to autonomous vehicle control. An agent, trained for this purpose in simulation, can interact with the real environment with a human-level performance. Despite very good results in terms of selected metrics, this approach has some significant drawbacks: high computational requirements and low explainability. Because of that, a DRL-based agent cannot be used in some control tasks, especially when safety is the key issue. Therefore we propose to use Tangled Program Graphs (TPGs) as an alternative for deep reinforcement learning in control-related tasks. In this approach, input signals are processed by simple programs that are combined in a graph structure. As a result, TPGs are less computationally demanding and their actions can be explained based on the graph structure. In this paper, we present our studies on the use of TPGs as an alternative for DRL in control-related tasks. In particular, we consider the problem of navigating an unmanned aerial vehicle (UAV) through the unknown environment based solely on the onboard LiDAR sensor. The results of our work show promising prospects for the use of TPGs in control related-tasks.
AbstractList	Deep reinforcement learning (DRL) is currently the most popular AI-based approach to autonomous vehicle control. An agent, trained for this purpose in simulation, can interact with the real environment with a human-level performance. Despite very good results in terms of selected metrics, this approach has some significant drawbacks: high computational requirements and low explainability. Because of that, a DRL-based agent cannot be used in some control tasks, especially when safety is the key issue. Therefore we propose to use Tangled Program Graphs (TPGs) as an alternative for deep reinforcement learning in control-related tasks. In this approach, input signals are processed by simple programs that are combined in a graph structure. As a result, TPGs are less computationally demanding and their actions can be explained based on the graph structure. In this paper, we present our studies on the use of TPGs as an alternative for DRL in control-related tasks. In particular, we consider the problem of navigating an unmanned aerial vehicle (UAV) through the unknown environment based solely on the onboard LiDAR sensor. The results of our work show promising prospects for the use of TPGs in control related-tasks.
Author	Szolc, Hubert Kryjak, Tomasz Desnos, Karol
Author_xml	– sequence: 1 givenname: Hubert surname: Szolc fullname: Szolc, Hubert email: hubert.szolc@agh.edu.pl organization: AGH University of Krakow,Embedded Vision Systems Group,Department of Automatic Control and Robotics,Kraków,Poland – sequence: 2 givenname: Karol surname: Desnos fullname: Desnos, Karol email: karol.desnos@insa-rennes.fr organization: Univ Rennes, INSA Rennes CNRS, IETR - UMR 6164,Rennes,France,F-35000 – sequence: 3 givenname: Tomasz surname: Kryjak fullname: Kryjak, Tomasz email: tomasz.kryjak@agh.edu.pl organization: AGH University of Krakow,Embedded Vision Systems Group,Department of Automatic Control and Robotics,Kraków,Poland
BookMark	eNo1kM1Kw0AURkdRsNa-geC8QOqdv5vMslStQsCqrdtyk0yaSJopM0Hw7Y2o8MHZHM7iu2Rnve8dYzcC5lJZYW_f1gsU1qq5BKnnAlJhUJkTNrNplimUgEZneMomUklMQAl7wWYxfgCAwGxs4IS9bKjfd67i6-D3gQ58FejYRE7jek7d4EJPQ_vp-OD53WueFBRHu_T9EHw3Cnsf2qE5RF77wLeL93jFzmvqopv9ccq2D_eb5WOSP6-elos8aYQ0Q1LrDAjTCrCUJgWHWqemrjUpQKdNrR0JZwtCJ6A0UECZCQQtK1OJ8kebsuvfbuuc2x1De6Dwtft_QX0Dq8tSnw
ContentType	Conference Proceeding
DBID	6IE 6IL CBEJK RIE RIL
DOI	10.23919/SPA61993.2024.10715635
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml	– sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
Discipline	Engineering
EISBN	9788362065486 8362065486
EISSN	2326-0319
EndPage	41
ExternalDocumentID	10715635
Genre	orig-research
GrantInformation_xml	– fundername: Agence Nationale de la Recherche funderid: 10.13039/501100001665
GroupedDBID	6IE 6IF 6IK 6IL 6IN AAJGR AAWTH ABLEC ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IPLJI M43 OCL RIE RIL
ID	FETCH-LOGICAL-h125t-f480a67d06c2570e64475ff4a306e45f4ea1e9ba6e10c50b0c816042d5d1cf4a3
IEDL.DBID	RIE
IngestDate	Wed Aug 27 02:16:56 EDT 2025
IsDoiOpenAccess	false
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-h125t-f480a67d06c2570e64475ff4a306e45f4ea1e9ba6e10c50b0c816042d5d1cf4a3
OpenAccessLink	https://hal.science/hal-04767061
PageCount	6
ParticipantIDs	ieee_primary_10715635
PublicationCentury	2000
PublicationDate	2024-Sept.-25
PublicationDateYYYYMMDD	2024-09-25
PublicationDate_xml	– month: 09 year: 2024 text: 2024-Sept.-25 day: 25
PublicationDecade	2020
PublicationTitle	Signal Processing Algorithms, Architectures, Arrangements, and Applications Conference proceedings
PublicationTitleAbbrev	SPA
PublicationYear	2024
Publisher	Division of Signal Processing and Electronic Systems, Poznan University of Technology (DSPES PUT)
Publisher_xml	– name: Division of Signal Processing and Electronic Systems, Poznan University of Technology (DSPES PUT)
SSID	ssj0001682396
Score	1.8948536
Snippet	Deep reinforcement learning (DRL) is currently the most popular AI-based approach to autonomous vehicle control. An agent, trained for this purpose in...
SourceID	ieee
SourceType	Publisher
StartPage	36
SubjectTerms	Autonomous aerial vehicles Computer architecture control algorithms Deep reinforcement learning DRL Laser radar Measurement Navigation Process control Safety Signal processing algorithms Tangled Program Graphs TPG UAV unmanned aerial vehicle
Title	Tangled Program Graphs as an alternative to DRL-based control algorithms for UAVs
URI	https://ieeexplore.ieee.org/document/10715635
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8NAEF5sT3rxVfHNHrwmJu3uJnssai2ipWorvZV9TFpRE2mTi7_e2aS1KgiGHJYwgWWHYfbb_eYbQs5MCx8LiacSKzzMENJTRnKEKgifFQ-0iFyB811PdIfsZsRHi2L1shYGAEryGfhuWN7l28wU7qgMIzxCuNHiNVKLYlEVa60OVETcbElRcbhwFMrzx35bOIIawsAm85d__-ijUqaRzibpLSdQsUde_CLXvvn4pc347xlukcaqYo_2v3LRNlmDdIdsfBMb3CX3A5VOXsE6O0fJotdOq3pOFb4pLW_N01IFnOYZvXy49VyCs3TBZUeDSTZ7zqdvc4r7XDpsP80bZNi5Glx0vUVDBW-K-5jcS1gcKBHZQBjXvA6EU_tLEqYQNwDjCQMVgtRKQBgY9FNg4lBgVFtuQ-PM9kg9zVLYJ1QxqxNlpQ4ZZ0pIGelAAwY_hBAhBDwgDbc64_dKM2O8XJjDP74fkXXnJMfEaPJjUs9nBZxgus_1aenmT0QbqQo
linkProvider	IEEE
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LTwIxEG4UD-rFF8a3PXhd3IW2S49ERVQgqGC4kT5mwai7BpaLv97pLoiamLjZQ7OZTZpOJtOv_eYbQs5MBR8LkaciKzzMENJTRnKEKgifFfe1CF2Bc6stGj122-f9WbF6VgsDABn5DEpumN3l28RM3VEZRniIcKPCl8kKZ4zxvFxrcaQiquWKFDmLC0eBPH_s1ISjqCEQLLPS_P8fnVSyRFLfIO35FHL-yEtpmuqS-filzvjvOW6S4qJmj3a-stEWWYJ4m6x_kxvcIfddFQ9fwTo7R8qi106tekIVvjHN7s3jTAecpgm9fGh6LsVZOmOzo8EwGT-no7cJxZ0u7dWeJkXSq191LxrerKWCN8KdTOpFrOorEVpfGNe-DoTT-4siphA5AOMRAxWA1EpA4Bv0lG-qgcC4ttwGxpntkkKcxLBHqGJWR8pKHTDOlJAy1L4GDH8IIEQQuE-KbnUG77lqxmC-MAd_fD8lq41uqzlo3rTvDsmac5jjZZT5ESmk4ykcY_JP9Unm8k9O8KxX
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Signal+Processing+Algorithms%2C+Architectures%2C+Arrangements%2C+and+Applications+Conference+proceedings&rft.atitle=Tangled+Program+Graphs+as+an+alternative+to+DRL-based+control+algorithms+for+UAVs&rft.au=Szolc%2C+Hubert&rft.au=Desnos%2C+Karol&rft.au=Kryjak%2C+Tomasz&rft.date=2024-09-25&rft.pub=Division+of+Signal+Processing+and+Electronic+Systems%2C+Poznan+University+of+Technology+%28DSPES+PUT%29&rft.eissn=2326-0319&rft.spage=36&rft.epage=41&rft_id=info:doi/10.23919%2FSPA61993.2024.10715635&rft.externalDocID=10715635