TD3 Agent-Based Nonlinear Dynamic Inverse Control for Fixed-Wing UAV Attitudes
To enhance the robustness of the nonlinear dynamic inverse (NDI) technique in the presence of model uncertainties, this study introduces a control scheme that integrates a reinforcement learning (RL) agent with the NDI approach. Initially, a fixed-wing unmanned aerial vehicle (UAV) is selected as th...
Saved in:
| Published in | IEEE transactions on intelligent transportation systems pp. 1 - 12 |
|---|---|
| Main Authors | , , , , , , |
| Format | Journal Article |
| Language | English |
| Published |
IEEE
01.05.2025
|
| Subjects | |
| Online Access | Get full text |
| ISSN | 1524-9050 1558-0016 |
| DOI | 10.1109/TITS.2025.3561517 |
Cover
| Abstract | To enhance the robustness of the nonlinear dynamic inverse (NDI) technique in the presence of model uncertainties, this study introduces a control scheme that integrates a reinforcement learning (RL) agent with the NDI approach. Initially, a fixed-wing unmanned aerial vehicle (UAV) is selected as the primary subject of investigation, and its attitude dynamics model is established. A control system employing a nonlinear disturbance observer within a dynamic inverse framework has been developed based on this model. Subsequently, the stability of the resulting control system is verified through Lyapunov analysis. Following this, a twin delayed deep deterministic policy gradient (TD3) agent is introduced, with the closed-loop system serving as the training environment. Through continuous interaction with its surroundings, the agent learns to dynamically adjust control parameters in response to control errors. Ultimately, the trained RL agent is utilized to optimize the control parameters for the dynamic system, and a flight simulation of the fixed-wing UAV's attitude control is conducted. The simulation results demonstrate that the control parameters can be adaptively adjusted using the TD3-NDI method, which mitigates overshoot and suppresses oscillations during the control process. These findings confirm the effectiveness and robustness of the proposed control strategy. |
|---|---|
| AbstractList | To enhance the robustness of the nonlinear dynamic inverse (NDI) technique in the presence of model uncertainties, this study introduces a control scheme that integrates a reinforcement learning (RL) agent with the NDI approach. Initially, a fixed-wing unmanned aerial vehicle (UAV) is selected as the primary subject of investigation, and its attitude dynamics model is established. A control system employing a nonlinear disturbance observer within a dynamic inverse framework has been developed based on this model. Subsequently, the stability of the resulting control system is verified through Lyapunov analysis. Following this, a twin delayed deep deterministic policy gradient (TD3) agent is introduced, with the closed-loop system serving as the training environment. Through continuous interaction with its surroundings, the agent learns to dynamically adjust control parameters in response to control errors. Ultimately, the trained RL agent is utilized to optimize the control parameters for the dynamic system, and a flight simulation of the fixed-wing UAV's attitude control is conducted. The simulation results demonstrate that the control parameters can be adaptively adjusted using the TD3-NDI method, which mitigates overshoot and suppresses oscillations during the control process. These findings confirm the effectiveness and robustness of the proposed control strategy. |
| Author | Chen, Qingyang Wang, Peng Guo, Zheng Wang, Yujie Hu, Wenjun Wu, Erdong Hou, Zhongxi |
| Author_xml | – sequence: 1 givenname: Wenjun orcidid: 0000-0003-2952-808X surname: Hu fullname: Hu, Wenjun email: 15271891485@163.com organization: College of Aerospace Science and Engineering, National University of Defense Technology, Changsha, China – sequence: 2 givenname: Yujie orcidid: 0000-0002-8304-9277 surname: Wang fullname: Wang, Yujie email: yjwang@nudt.edu.cn organization: College of Aerospace Science and Engineering, National University of Defense Technology, Changsha, China – sequence: 3 givenname: Qingyang orcidid: 0000-0002-5134-8184 surname: Chen fullname: Chen, Qingyang email: chy1982_008@nudt.edu.cn organization: College of Aerospace Science and Engineering, National University of Defense Technology, Changsha, China – sequence: 4 givenname: Peng orcidid: 0009-0005-1007-0779 surname: Wang fullname: Wang, Peng email: wangp_xt@163.com organization: College of Aerospace Science and Engineering, National University of Defense Technology, Changsha, China – sequence: 5 givenname: Erdong surname: Wu fullname: Wu, Erdong email: 1957468263@qq.com organization: College of Aerospace Science and Engineering, National University of Defense Technology, Changsha, China – sequence: 6 givenname: Zheng surname: Guo fullname: Guo, Zheng email: guozheng@nudt.edu.cn organization: College of Aerospace Science and Engineering, National University of Defense Technology, Changsha, China – sequence: 7 givenname: Zhongxi surname: Hou fullname: Hou, Zhongxi email: hzx@nudt.edu.cn organization: College of Aerospace Science and Engineering, National University of Defense Technology, Changsha, China |
| BookMark | eNpNkMFOAjEQhhujiYA-gImHvkBx2tLt9riCKAnBg4seN6WdJWuga9rVyNvLBg6e_smf-SaTb0guQxuQkDsOY87BPJSL8m0sQKixVBlXXF-QAVcqZwA8u-xnMWEGFFyTYUqfx3aiOB-QVTmTtNhi6NijTejpqg27JqCNdHYIdt84ugg_GBPSaRu62O5o3UY6b37Rs48mbOm6eKdF1zXdt8d0Q65qu0t4e84RWc-fyukLW74-L6bFkjkhoGPae5g45wCF8yCFRZc5pzf6-KWGmvtMy1ybDL1E43DjbCZqC7kFqJFnuRwRfrrrYptSxLr6is3exkPFoeqFVL2QqhdSnYUcmfsT0yDiv32Tc6ON_AP7VF5u |
| CODEN | ITISFG |
| ContentType | Journal Article |
| DBID | 97E RIA RIE AAYXX CITATION |
| DOI | 10.1109/TITS.2025.3561517 |
| DatabaseName | Accès INSA - IEEE Xplore ASPP 2005 IEEE All-Society Periodicals Package (ASPP) 1998–Present IEEE Electronic Library (IEL) CrossRef |
| DatabaseTitle | CrossRef |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Engineering |
| EISSN | 1558-0016 |
| EndPage | 12 |
| ExternalDocumentID | 10_1109_TITS_2025_3561517 10981979 |
| Genre | orig-research |
| GrantInformation_xml | – fundername: National Natural Science Foundation of China grantid: 52172410 funderid: 10.13039/501100001809 – fundername: Natural Science Foundation of Hunan Province grantid: 2023JJ30631 funderid: 10.13039/501100004735 – fundername: Major Science and Technology Innovation 2030 projects grantid: 2021ZD0140300 |
| GroupedDBID | -~X 0R~ 29I 4.4 5GY 5VS 6IK 97E AAJGR AARMG AASAJ AAWTH ABAZT ABQJQ ABVLG ACGFO ACGFS ACIWK ACNCT AENEX AGQYO AHBIQ AKJIK AKQYR ALMA_UNASSIGNED_HOLDINGS ATWAV BEFXN BFFAM BGNUA BKEBE BPEOZ CS3 DU5 EBS HZ~ IFIPE IPLJI JAVBF LAI M43 O9- OCL P2P PQQKQ RIA RIE RNS AAYXX AETIX AGSQL AIBXA CITATION EJD H~9 ZY4 |
| ID | FETCH-LOGICAL-c220t-7dd04ccc0e2cd032aec6cc7b752470f1d6738796ed3e9cebca62fa08a00fe1683 |
| IEDL.DBID | RIE |
| ISSN | 1524-9050 |
| IngestDate | Wed Oct 01 08:27:36 EDT 2025 Wed Aug 27 01:53:16 EDT 2025 |
| IsPeerReviewed | true |
| IsScholarly | true |
| Language | English |
| License | https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html https://doi.org/10.15223/policy-029 https://doi.org/10.15223/policy-037 |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c220t-7dd04ccc0e2cd032aec6cc7b752470f1d6738796ed3e9cebca62fa08a00fe1683 |
| ORCID | 0000-0002-5134-8184 0000-0003-2952-808X 0009-0005-1007-0779 0000-0002-8304-9277 |
| PageCount | 12 |
| ParticipantIDs | crossref_primary_10_1109_TITS_2025_3561517 ieee_primary_10981979 |
| ProviderPackageCode | CITATION AAYXX |
| PublicationCentury | 2000 |
| PublicationDate | 2025-05-01 |
| PublicationDateYYYYMMDD | 2025-05-01 |
| PublicationDate_xml | – month: 05 year: 2025 text: 2025-05-01 day: 01 |
| PublicationDecade | 2020 |
| PublicationTitle | IEEE transactions on intelligent transportation systems |
| PublicationTitleAbbrev | TITS |
| PublicationYear | 2025 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| SSID | ssj0014511 |
| Score | 2.4491658 |
| Snippet | To enhance the robustness of the nonlinear dynamic inverse (NDI) technique in the presence of model uncertainties, this study introduces a control scheme that... |
| SourceID | crossref ieee |
| SourceType | Index Database Publisher |
| StartPage | 1 |
| SubjectTerms | Angular velocity Attitude control Autonomous aerial vehicles Control systems Disturbance observers Fixed-wing UAV Mathematical models nonlinear disturbance observer nonlinear dynamic inverse Nonlinear dynamical systems Reinforcement learning Robustness TD3 Torque |
| Title | TD3 Agent-Based Nonlinear Dynamic Inverse Control for Fixed-Wing UAV Attitudes |
| URI | https://ieeexplore.ieee.org/document/10981979 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVIEE databaseName: IEEE Electronic Library (IEL) customDbUrl: eissn: 1558-0016 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0014511 issn: 1524-9050 databaseCode: RIE dateStart: 20000101 isFulltext: true titleUrlDefault: https://ieeexplore.ieee.org/ providerName: IEEE |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NSwMxEA22Jz34WbF-kYMnITWb7W6SY20tVbAXW-ltyU5mRYRW2i2Iv95k00oVBG_LsoEwM-FNdt68IeQqVsb68g_jJhLMRUjBFCaC2bhQNoklyGrc2-MwHYzbD5NksmpWr3phELEin2HLP1a1fDuDpf9V5k64dgAmdY3UpEpDs9Z3ycALbVXiqKLNNE_WJUy35mZ0P3pyV0GRtOLEI7j8AUIbU1UqUOnvkeF6O4FL8tZalnkLPn8pNf57v_tkd5Ve0k6IhwOyhdNDsrMhOnhEhqNeTDu-o4rdOgizdBjUMsyc9sJ4eurFN-YLpN3AY6cusaX91w_09NrpCx13nmmn9BwDi4sGGffvRt0BW01VYCAEL5m0lrcBgKMAy2NhEFIAmUtnOMmLyPo5oFKnaGPU4LlSqSgMV4bzAqNUxcekPp1N8YRQbfM816AiQNP2qYc2UZKniYFUAUSqSa7XZs7eg3hGVl06uM68TzLvk2zlkyZpeAtufBiMd_rH-zOy7ZcH8uE5qZfzJV64BKHML6vA-AIWZLbQ |
| linkProvider | IEEE |
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LT-MwELZYOLAceCNYHusDJyR3HSeO42N5VC1bciFF3CJnPEEIqaCSSohfv3bcooKEtLcoiiNrZqJvnPnmG0JO48xYX_5h3ESCuQipWYZSMBvXmZWxAtWOe7vJ0_4oub6X97Nm9bYXBhFb8hl2_GVby7fPMPW_ytwXrh2AKf2DrMgkSWRo1_ooGniprVYeVSRMczkvYrpVf4pBcesOg0J2YukxXH2CoYW5Ki2s9DZIPt9QYJM8daZN1YH3L1qN_73jTbI-SzBpN0TEFlnC8TZZW5Ad3CF5cRnTru-pYucOxCzNg16GmdDLMKCeevmNySvSi8Bkpy61pb3HN_QE2_EDHXXvaLfxLAOLr7tk1LsqLvpsNleBgRC8YcpangAARwGWx8IgpACqUs5witeR9ZNAlU7RxqjBs6VSURueGc5rjNIs3iPL4-cx7hOqbVVVGrII0CQ--dAmklUqDaQZQJQdkLO5mcuXIJ9RtscOrkvvk9L7pJz55IDsegsuPBiM9-ub-7_Jar-4GZbDQf73kPz0rwpUxCOy3EymeOzShaY6aYPkH9lnuh0 |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=TD3+Agent-Based+Nonlinear+Dynamic+Inverse+Control+for+Fixed-Wing+UAV+Attitudes&rft.jtitle=IEEE+transactions+on+intelligent+transportation+systems&rft.au=Hu%2C+Wenjun&rft.au=Wang%2C+Yujie&rft.au=Chen%2C+Qingyang&rft.au=Wang%2C+Peng&rft.date=2025-05-01&rft.issn=1524-9050&rft.eissn=1558-0016&rft.spage=1&rft.epage=12&rft_id=info:doi/10.1109%2FTITS.2025.3561517&rft.externalDBID=n%2Fa&rft.externalDocID=10_1109_TITS_2025_3561517 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1524-9050&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1524-9050&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1524-9050&client=summon |