TD3 Agent-Based Nonlinear Dynamic Inverse Control for Fixed-Wing UAV Attitudes

To enhance the robustness of the nonlinear dynamic inverse (NDI) technique in the presence of model uncertainties, this study introduces a control scheme that integrates a reinforcement learning (RL) agent with the NDI approach. Initially, a fixed-wing unmanned aerial vehicle (UAV) is selected as th...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on intelligent transportation systems pp. 1 - 12
Main Authors Hu, Wenjun, Wang, Yujie, Chen, Qingyang, Wang, Peng, Wu, Erdong, Guo, Zheng, Hou, Zhongxi
Format Journal Article
LanguageEnglish
Published IEEE 01.05.2025
Subjects
Online AccessGet full text
ISSN1524-9050
1558-0016
DOI10.1109/TITS.2025.3561517

Cover

Abstract To enhance the robustness of the nonlinear dynamic inverse (NDI) technique in the presence of model uncertainties, this study introduces a control scheme that integrates a reinforcement learning (RL) agent with the NDI approach. Initially, a fixed-wing unmanned aerial vehicle (UAV) is selected as the primary subject of investigation, and its attitude dynamics model is established. A control system employing a nonlinear disturbance observer within a dynamic inverse framework has been developed based on this model. Subsequently, the stability of the resulting control system is verified through Lyapunov analysis. Following this, a twin delayed deep deterministic policy gradient (TD3) agent is introduced, with the closed-loop system serving as the training environment. Through continuous interaction with its surroundings, the agent learns to dynamically adjust control parameters in response to control errors. Ultimately, the trained RL agent is utilized to optimize the control parameters for the dynamic system, and a flight simulation of the fixed-wing UAV's attitude control is conducted. The simulation results demonstrate that the control parameters can be adaptively adjusted using the TD3-NDI method, which mitigates overshoot and suppresses oscillations during the control process. These findings confirm the effectiveness and robustness of the proposed control strategy.
AbstractList To enhance the robustness of the nonlinear dynamic inverse (NDI) technique in the presence of model uncertainties, this study introduces a control scheme that integrates a reinforcement learning (RL) agent with the NDI approach. Initially, a fixed-wing unmanned aerial vehicle (UAV) is selected as the primary subject of investigation, and its attitude dynamics model is established. A control system employing a nonlinear disturbance observer within a dynamic inverse framework has been developed based on this model. Subsequently, the stability of the resulting control system is verified through Lyapunov analysis. Following this, a twin delayed deep deterministic policy gradient (TD3) agent is introduced, with the closed-loop system serving as the training environment. Through continuous interaction with its surroundings, the agent learns to dynamically adjust control parameters in response to control errors. Ultimately, the trained RL agent is utilized to optimize the control parameters for the dynamic system, and a flight simulation of the fixed-wing UAV's attitude control is conducted. The simulation results demonstrate that the control parameters can be adaptively adjusted using the TD3-NDI method, which mitigates overshoot and suppresses oscillations during the control process. These findings confirm the effectiveness and robustness of the proposed control strategy.
Author Chen, Qingyang
Wang, Peng
Guo, Zheng
Wang, Yujie
Hu, Wenjun
Wu, Erdong
Hou, Zhongxi
Author_xml – sequence: 1
  givenname: Wenjun
  orcidid: 0000-0003-2952-808X
  surname: Hu
  fullname: Hu, Wenjun
  email: 15271891485@163.com
  organization: College of Aerospace Science and Engineering, National University of Defense Technology, Changsha, China
– sequence: 2
  givenname: Yujie
  orcidid: 0000-0002-8304-9277
  surname: Wang
  fullname: Wang, Yujie
  email: yjwang@nudt.edu.cn
  organization: College of Aerospace Science and Engineering, National University of Defense Technology, Changsha, China
– sequence: 3
  givenname: Qingyang
  orcidid: 0000-0002-5134-8184
  surname: Chen
  fullname: Chen, Qingyang
  email: chy1982_008@nudt.edu.cn
  organization: College of Aerospace Science and Engineering, National University of Defense Technology, Changsha, China
– sequence: 4
  givenname: Peng
  orcidid: 0009-0005-1007-0779
  surname: Wang
  fullname: Wang, Peng
  email: wangp_xt@163.com
  organization: College of Aerospace Science and Engineering, National University of Defense Technology, Changsha, China
– sequence: 5
  givenname: Erdong
  surname: Wu
  fullname: Wu, Erdong
  email: 1957468263@qq.com
  organization: College of Aerospace Science and Engineering, National University of Defense Technology, Changsha, China
– sequence: 6
  givenname: Zheng
  surname: Guo
  fullname: Guo, Zheng
  email: guozheng@nudt.edu.cn
  organization: College of Aerospace Science and Engineering, National University of Defense Technology, Changsha, China
– sequence: 7
  givenname: Zhongxi
  surname: Hou
  fullname: Hou, Zhongxi
  email: hzx@nudt.edu.cn
  organization: College of Aerospace Science and Engineering, National University of Defense Technology, Changsha, China
BookMark eNpNkMFOAjEQhhujiYA-gImHvkBx2tLt9riCKAnBg4seN6WdJWuga9rVyNvLBg6e_smf-SaTb0guQxuQkDsOY87BPJSL8m0sQKixVBlXXF-QAVcqZwA8u-xnMWEGFFyTYUqfx3aiOB-QVTmTtNhi6NijTejpqg27JqCNdHYIdt84ugg_GBPSaRu62O5o3UY6b37Rs48mbOm6eKdF1zXdt8d0Q65qu0t4e84RWc-fyukLW74-L6bFkjkhoGPae5g45wCF8yCFRZc5pzf6-KWGmvtMy1ybDL1E43DjbCZqC7kFqJFnuRwRfrrrYptSxLr6is3exkPFoeqFVL2QqhdSnYUcmfsT0yDiv32Tc6ON_AP7VF5u
CODEN ITISFG
ContentType Journal Article
DBID 97E
RIA
RIE
AAYXX
CITATION
DOI 10.1109/TITS.2025.3561517
DatabaseName Accès INSA - IEEE Xplore ASPP 2005
IEEE All-Society Periodicals Package (ASPP) 1998–Present
IEEE Electronic Library (IEL)
CrossRef
DatabaseTitle CrossRef
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISSN 1558-0016
EndPage 12
ExternalDocumentID 10_1109_TITS_2025_3561517
10981979
Genre orig-research
GrantInformation_xml – fundername: National Natural Science Foundation of China
  grantid: 52172410
  funderid: 10.13039/501100001809
– fundername: Natural Science Foundation of Hunan Province
  grantid: 2023JJ30631
  funderid: 10.13039/501100004735
– fundername: Major Science and Technology Innovation 2030 projects
  grantid: 2021ZD0140300
GroupedDBID -~X
0R~
29I
4.4
5GY
5VS
6IK
97E
AAJGR
AARMG
AASAJ
AAWTH
ABAZT
ABQJQ
ABVLG
ACGFO
ACGFS
ACIWK
ACNCT
AENEX
AGQYO
AHBIQ
AKJIK
AKQYR
ALMA_UNASSIGNED_HOLDINGS
ATWAV
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CS3
DU5
EBS
HZ~
IFIPE
IPLJI
JAVBF
LAI
M43
O9-
OCL
P2P
PQQKQ
RIA
RIE
RNS
AAYXX
AETIX
AGSQL
AIBXA
CITATION
EJD
H~9
ZY4
ID FETCH-LOGICAL-c220t-7dd04ccc0e2cd032aec6cc7b752470f1d6738796ed3e9cebca62fa08a00fe1683
IEDL.DBID RIE
ISSN 1524-9050
IngestDate Wed Oct 01 08:27:36 EDT 2025
Wed Aug 27 01:53:16 EDT 2025
IsPeerReviewed true
IsScholarly true
Language English
License https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html
https://doi.org/10.15223/policy-029
https://doi.org/10.15223/policy-037
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c220t-7dd04ccc0e2cd032aec6cc7b752470f1d6738796ed3e9cebca62fa08a00fe1683
ORCID 0000-0002-5134-8184
0000-0003-2952-808X
0009-0005-1007-0779
0000-0002-8304-9277
PageCount 12
ParticipantIDs crossref_primary_10_1109_TITS_2025_3561517
ieee_primary_10981979
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2025-05-01
PublicationDateYYYYMMDD 2025-05-01
PublicationDate_xml – month: 05
  year: 2025
  text: 2025-05-01
  day: 01
PublicationDecade 2020
PublicationTitle IEEE transactions on intelligent transportation systems
PublicationTitleAbbrev TITS
PublicationYear 2025
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0014511
Score 2.4491658
Snippet To enhance the robustness of the nonlinear dynamic inverse (NDI) technique in the presence of model uncertainties, this study introduces a control scheme that...
SourceID crossref
ieee
SourceType Index Database
Publisher
StartPage 1
SubjectTerms Angular velocity
Attitude control
Autonomous aerial vehicles
Control systems
Disturbance observers
Fixed-wing UAV
Mathematical models
nonlinear disturbance observer
nonlinear dynamic inverse
Nonlinear dynamical systems
Reinforcement learning
Robustness
TD3
Torque
Title TD3 Agent-Based Nonlinear Dynamic Inverse Control for Fixed-Wing UAV Attitudes
URI https://ieeexplore.ieee.org/document/10981979
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVIEE
  databaseName: IEEE Electronic Library (IEL)
  customDbUrl:
  eissn: 1558-0016
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0014511
  issn: 1524-9050
  databaseCode: RIE
  dateStart: 20000101
  isFulltext: true
  titleUrlDefault: https://ieeexplore.ieee.org/
  providerName: IEEE
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NSwMxEA22Jz34WbF-kYMnITWb7W6SY20tVbAXW-ltyU5mRYRW2i2Iv95k00oVBG_LsoEwM-FNdt68IeQqVsb68g_jJhLMRUjBFCaC2bhQNoklyGrc2-MwHYzbD5NksmpWr3phELEin2HLP1a1fDuDpf9V5k64dgAmdY3UpEpDs9Z3ycALbVXiqKLNNE_WJUy35mZ0P3pyV0GRtOLEI7j8AUIbU1UqUOnvkeF6O4FL8tZalnkLPn8pNf57v_tkd5Ve0k6IhwOyhdNDsrMhOnhEhqNeTDu-o4rdOgizdBjUMsyc9sJ4eurFN-YLpN3AY6cusaX91w_09NrpCx13nmmn9BwDi4sGGffvRt0BW01VYCAEL5m0lrcBgKMAy2NhEFIAmUtnOMmLyPo5oFKnaGPU4LlSqSgMV4bzAqNUxcekPp1N8YRQbfM816AiQNP2qYc2UZKniYFUAUSqSa7XZs7eg3hGVl06uM68TzLvk2zlkyZpeAtufBiMd_rH-zOy7ZcH8uE5qZfzJV64BKHML6vA-AIWZLbQ
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LT-MwELZYOLAceCNYHusDJyR3HSeO42N5VC1bciFF3CJnPEEIqaCSSohfv3bcooKEtLcoiiNrZqJvnPnmG0JO48xYX_5h3ESCuQipWYZSMBvXmZWxAtWOe7vJ0_4oub6X97Nm9bYXBhFb8hl2_GVby7fPMPW_ytwXrh2AKf2DrMgkSWRo1_ooGniprVYeVSRMczkvYrpVf4pBcesOg0J2YukxXH2CoYW5Ki2s9DZIPt9QYJM8daZN1YH3L1qN_73jTbI-SzBpN0TEFlnC8TZZW5Ad3CF5cRnTru-pYucOxCzNg16GmdDLMKCeevmNySvSi8Bkpy61pb3HN_QE2_EDHXXvaLfxLAOLr7tk1LsqLvpsNleBgRC8YcpangAARwGWx8IgpACqUs5witeR9ZNAlU7RxqjBs6VSURueGc5rjNIs3iPL4-cx7hOqbVVVGrII0CQ--dAmklUqDaQZQJQdkLO5mcuXIJ9RtscOrkvvk9L7pJz55IDsegsuPBiM9-ub-7_Jar-4GZbDQf73kPz0rwpUxCOy3EymeOzShaY6aYPkH9lnuh0
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=TD3+Agent-Based+Nonlinear+Dynamic+Inverse+Control+for+Fixed-Wing+UAV+Attitudes&rft.jtitle=IEEE+transactions+on+intelligent+transportation+systems&rft.au=Hu%2C+Wenjun&rft.au=Wang%2C+Yujie&rft.au=Chen%2C+Qingyang&rft.au=Wang%2C+Peng&rft.date=2025-05-01&rft.issn=1524-9050&rft.eissn=1558-0016&rft.spage=1&rft.epage=12&rft_id=info:doi/10.1109%2FTITS.2025.3561517&rft.externalDBID=n%2Fa&rft.externalDocID=10_1109_TITS_2025_3561517
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1524-9050&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1524-9050&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1524-9050&client=summon