TD3 Agent-Based Nonlinear Dynamic Inverse Control for Fixed-Wing UAV Attitudes

To enhance the robustness of the nonlinear dynamic inverse (NDI) technique in the presence of model uncertainties, this study introduces a control scheme that integrates a reinforcement learning (RL) agent with the NDI approach. Initially, a fixed-wing unmanned aerial vehicle (UAV) is selected as th...

Full description

Saved in:

Bibliographic Details
Published in	IEEE transactions on intelligent transportation systems pp. 1 - 12
Main Authors	Hu, Wenjun, Wang, Yujie, Chen, Qingyang, Wang, Peng, Wu, Erdong, Guo, Zheng, Hou, Zhongxi
Format	Journal Article
Language	English
Published	IEEE 01.05.2025
Subjects	Angular velocity Attitude control Autonomous aerial vehicles Control systems Disturbance observers Fixed-wing UAV Mathematical models nonlinear disturbance observer nonlinear dynamic inverse Nonlinear dynamical systems Reinforcement learning Robustness TD3 Torque
Online Access	Get full text
ISSN	1524-9050 1558-0016
DOI	10.1109/TITS.2025.3561517

Cover

Abstract	To enhance the robustness of the nonlinear dynamic inverse (NDI) technique in the presence of model uncertainties, this study introduces a control scheme that integrates a reinforcement learning (RL) agent with the NDI approach. Initially, a fixed-wing unmanned aerial vehicle (UAV) is selected as the primary subject of investigation, and its attitude dynamics model is established. A control system employing a nonlinear disturbance observer within a dynamic inverse framework has been developed based on this model. Subsequently, the stability of the resulting control system is verified through Lyapunov analysis. Following this, a twin delayed deep deterministic policy gradient (TD3) agent is introduced, with the closed-loop system serving as the training environment. Through continuous interaction with its surroundings, the agent learns to dynamically adjust control parameters in response to control errors. Ultimately, the trained RL agent is utilized to optimize the control parameters for the dynamic system, and a flight simulation of the fixed-wing UAV's attitude control is conducted. The simulation results demonstrate that the control parameters can be adaptively adjusted using the TD3-NDI method, which mitigates overshoot and suppresses oscillations during the control process. These findings confirm the effectiveness and robustness of the proposed control strategy.
AbstractList	To enhance the robustness of the nonlinear dynamic inverse (NDI) technique in the presence of model uncertainties, this study introduces a control scheme that integrates a reinforcement learning (RL) agent with the NDI approach. Initially, a fixed-wing unmanned aerial vehicle (UAV) is selected as the primary subject of investigation, and its attitude dynamics model is established. A control system employing a nonlinear disturbance observer within a dynamic inverse framework has been developed based on this model. Subsequently, the stability of the resulting control system is verified through Lyapunov analysis. Following this, a twin delayed deep deterministic policy gradient (TD3) agent is introduced, with the closed-loop system serving as the training environment. Through continuous interaction with its surroundings, the agent learns to dynamically adjust control parameters in response to control errors. Ultimately, the trained RL agent is utilized to optimize the control parameters for the dynamic system, and a flight simulation of the fixed-wing UAV's attitude control is conducted. The simulation results demonstrate that the control parameters can be adaptively adjusted using the TD3-NDI method, which mitigates overshoot and suppresses oscillations during the control process. These findings confirm the effectiveness and robustness of the proposed control strategy.
Author	Chen, Qingyang Wang, Peng Guo, Zheng Wang, Yujie Hu, Wenjun Wu, Erdong Hou, Zhongxi
Author_xml	– sequence: 1 givenname: Wenjun orcidid: 0000-0003-2952-808X surname: Hu fullname: Hu, Wenjun email: 15271891485@163.com organization: College of Aerospace Science and Engineering, National University of Defense Technology, Changsha, China – sequence: 2 givenname: Yujie orcidid: 0000-0002-8304-9277 surname: Wang fullname: Wang, Yujie email: yjwang@nudt.edu.cn organization: College of Aerospace Science and Engineering, National University of Defense Technology, Changsha, China – sequence: 3 givenname: Qingyang orcidid: 0000-0002-5134-8184 surname: Chen fullname: Chen, Qingyang email: chy1982_008@nudt.edu.cn organization: College of Aerospace Science and Engineering, National University of Defense Technology, Changsha, China – sequence: 4 givenname: Peng orcidid: 0009-0005-1007-0779 surname: Wang fullname: Wang, Peng email: wangp_xt@163.com organization: College of Aerospace Science and Engineering, National University of Defense Technology, Changsha, China – sequence: 5 givenname: Erdong surname: Wu fullname: Wu, Erdong email: 1957468263@qq.com organization: College of Aerospace Science and Engineering, National University of Defense Technology, Changsha, China – sequence: 6 givenname: Zheng surname: Guo fullname: Guo, Zheng email: guozheng@nudt.edu.cn organization: College of Aerospace Science and Engineering, National University of Defense Technology, Changsha, China – sequence: 7 givenname: Zhongxi surname: Hou fullname: Hou, Zhongxi email: hzx@nudt.edu.cn organization: College of Aerospace Science and Engineering, National University of Defense Technology, Changsha, China
BookMark	eNpNkMFOAjEQhhujiYA-gImHvkBx2tLt9riCKAnBg4seN6WdJWuga9rVyNvLBg6e_smf-SaTb0guQxuQkDsOY87BPJSL8m0sQKixVBlXXF-QAVcqZwA8u-xnMWEGFFyTYUqfx3aiOB-QVTmTtNhi6NijTejpqg27JqCNdHYIdt84ugg_GBPSaRu62O5o3UY6b37Rs48mbOm6eKdF1zXdt8d0Q65qu0t4e84RWc-fyukLW74-L6bFkjkhoGPae5g45wCF8yCFRZc5pzf6-KWGmvtMy1ybDL1E43DjbCZqC7kFqJFnuRwRfrrrYptSxLr6is3exkPFoeqFVL2QqhdSnYUcmfsT0yDiv32Tc6ON_AP7VF5u
CODEN	ITISFG
ContentType	Journal Article
DBID	97E RIA RIE AAYXX CITATION
DOI	10.1109/TITS.2025.3561517
DatabaseName	Accès INSA - IEEE Xplore ASPP 2005 IEEE All-Society Periodicals Package (ASPP) 1998–Present IEEE Electronic Library (IEL) CrossRef
DatabaseTitle	CrossRef
DatabaseTitleList
Database_xml	– sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
Discipline	Engineering
EISSN	1558-0016
EndPage	12
ExternalDocumentID	10_1109_TITS_2025_3561517 10981979
Genre	orig-research
GrantInformation_xml	– fundername: National Natural Science Foundation of China grantid: 52172410 funderid: 10.13039/501100001809 – fundername: Natural Science Foundation of Hunan Province grantid: 2023JJ30631 funderid: 10.13039/501100004735 – fundername: Major Science and Technology Innovation 2030 projects grantid: 2021ZD0140300
GroupedDBID	-~X 0R~ 29I 4.4 5GY 5VS 6IK 97E AAJGR AARMG AASAJ AAWTH ABAZT ABQJQ ABVLG ACGFO ACGFS ACIWK ACNCT AENEX AGQYO AHBIQ AKJIK AKQYR ALMA_UNASSIGNED_HOLDINGS ATWAV BEFXN BFFAM BGNUA BKEBE BPEOZ CS3 DU5 EBS HZ~ IFIPE IPLJI JAVBF LAI M43 O9- OCL P2P PQQKQ RIA RIE RNS AAYXX AETIX AGSQL AIBXA CITATION EJD H~9 ZY4
ID	FETCH-LOGICAL-c220t-7dd04ccc0e2cd032aec6cc7b752470f1d6738796ed3e9cebca62fa08a00fe1683
IEDL.DBID	RIE
ISSN	1524-9050
IngestDate	Wed Oct 01 08:27:36 EDT 2025 Wed Aug 27 01:53:16 EDT 2025
IsPeerReviewed	true
IsScholarly	true
Language	English
License	https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html https://doi.org/10.15223/policy-029 https://doi.org/10.15223/policy-037
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-c220t-7dd04ccc0e2cd032aec6cc7b752470f1d6738796ed3e9cebca62fa08a00fe1683
ORCID	0000-0002-5134-8184 0000-0003-2952-808X 0009-0005-1007-0779 0000-0002-8304-9277
PageCount	12
ParticipantIDs	crossref_primary_10_1109_TITS_2025_3561517 ieee_primary_10981979
ProviderPackageCode	CITATION AAYXX
PublicationCentury	2000
PublicationDate	2025-05-01
PublicationDateYYYYMMDD	2025-05-01
PublicationDate_xml	– month: 05 year: 2025 text: 2025-05-01 day: 01
PublicationDecade	2020
PublicationTitle	IEEE transactions on intelligent transportation systems
PublicationTitleAbbrev	TITS
PublicationYear	2025
Publisher	IEEE
Publisher_xml	– name: IEEE
SSID	ssj0014511
Score	2.4491658
Snippet	To enhance the robustness of the nonlinear dynamic inverse (NDI) technique in the presence of model uncertainties, this study introduces a control scheme that...
SourceID	crossref ieee
SourceType	Index Database Publisher
StartPage	1
SubjectTerms	Angular velocity Attitude control Autonomous aerial vehicles Control systems Disturbance observers Fixed-wing UAV Mathematical models nonlinear disturbance observer nonlinear dynamic inverse Nonlinear dynamical systems Reinforcement learning Robustness TD3 Torque
Title	TD3 Agent-Based Nonlinear Dynamic Inverse Control for Fixed-Wing UAV Attitudes
URI	https://ieeexplore.ieee.org/document/10981979
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
journalDatabaseRights	– providerCode: PRVIEE databaseName: IEEE Electronic Library (IEL) customDbUrl: eissn: 1558-0016 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0014511 issn: 1524-9050 databaseCode: RIE dateStart: 20000101 isFulltext: true titleUrlDefault: https://ieeexplore.ieee.org/ providerName: IEEE
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NSwMxEA22Jz34WbF-kYMnITWb7W6SY20tVbAXW-ltyU5mRYRW2i2Iv95k00oVBG_LsoEwM-FNdt68IeQqVsb68g_jJhLMRUjBFCaC2bhQNoklyGrc2-MwHYzbD5NksmpWr3phELEin2HLP1a1fDuDpf9V5k64dgAmdY3UpEpDs9Z3ycALbVXiqKLNNE_WJUy35mZ0P3pyV0GRtOLEI7j8AUIbU1UqUOnvkeF6O4FL8tZalnkLPn8pNf57v_tkd5Ve0k6IhwOyhdNDsrMhOnhEhqNeTDu-o4rdOgizdBjUMsyc9sJ4eurFN-YLpN3AY6cusaX91w_09NrpCx13nmmn9BwDi4sGGffvRt0BW01VYCAEL5m0lrcBgKMAy2NhEFIAmUtnOMmLyPo5oFKnaGPU4LlSqSgMV4bzAqNUxcekPp1N8YRQbfM816AiQNP2qYc2UZKniYFUAUSqSa7XZs7eg3hGVl06uM68TzLvk2zlkyZpeAtufBiMd_rH-zOy7ZcH8uE5qZfzJV64BKHML6vA-AIWZLbQ
linkProvider	IEEE
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LT-MwELZYOLAceCNYHusDJyR3HSeO42N5VC1bciFF3CJnPEEIqaCSSohfv3bcooKEtLcoiiNrZqJvnPnmG0JO48xYX_5h3ESCuQipWYZSMBvXmZWxAtWOe7vJ0_4oub6X97Nm9bYXBhFb8hl2_GVby7fPMPW_ytwXrh2AKf2DrMgkSWRo1_ooGniprVYeVSRMczkvYrpVf4pBcesOg0J2YukxXH2CoYW5Ki2s9DZIPt9QYJM8daZN1YH3L1qN_73jTbI-SzBpN0TEFlnC8TZZW5Ad3CF5cRnTru-pYucOxCzNg16GmdDLMKCeevmNySvSi8Bkpy61pb3HN_QE2_EDHXXvaLfxLAOLr7tk1LsqLvpsNleBgRC8YcpangAARwGWx8IgpACqUs5witeR9ZNAlU7RxqjBs6VSURueGc5rjNIs3iPL4-cx7hOqbVVVGrII0CQ--dAmklUqDaQZQJQdkLO5mcuXIJ9RtscOrkvvk9L7pJz55IDsegsuPBiM9-ub-7_Jar-4GZbDQf73kPz0rwpUxCOy3EymeOzShaY6aYPkH9lnuh0
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=TD3+Agent-Based+Nonlinear+Dynamic+Inverse+Control+for+Fixed-Wing+UAV+Attitudes&rft.jtitle=IEEE+transactions+on+intelligent+transportation+systems&rft.au=Hu%2C+Wenjun&rft.au=Wang%2C+Yujie&rft.au=Chen%2C+Qingyang&rft.au=Wang%2C+Peng&rft.date=2025-05-01&rft.issn=1524-9050&rft.eissn=1558-0016&rft.spage=1&rft.epage=12&rft_id=info:doi/10.1109%2FTITS.2025.3561517&rft.externalDBID=n%2Fa&rft.externalDocID=10_1109_TITS_2025_3561517
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1524-9050&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1524-9050&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1524-9050&client=summon