Adaptive Dynamic Programming for Stochastic Systems With State and Control Dependent Noise
In this technical note, the adaptive optimal control problem is investigated for a class of continuous-time stochastic systems subject to multiplicative noise. A novel non-model-based optimal control design methodology is employed to iteratively update the control policy on-line by using directly th...
Saved in:
Published in | IEEE transactions on automatic control Vol. 61; no. 12; pp. 4170 - 4175 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
New York
IEEE
01.12.2016
The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
Subjects | |
Online Access | Get full text |
ISSN | 0018-9286 1558-2523 |
DOI | 10.1109/TAC.2016.2550518 |
Cover
Abstract | In this technical note, the adaptive optimal control problem is investigated for a class of continuous-time stochastic systems subject to multiplicative noise. A novel non-model-based optimal control design methodology is employed to iteratively update the control policy on-line by using directly the data of the system state and input. Both adaptive dynamic programming (ADP) and robust ADP algorithms are developed, along with rigorous stability and convergence analysis. The effectiveness of the obtained methods is illustrated by an example arising from biological sensorimotor control. |
---|---|
AbstractList | In this technical note, the adaptive optimal control problem is investigated for a class of continuous-time stochastic systems subject to multiplicative noise. A novel non-model-based optimal control design methodology is employed to iteratively update the control policy on-line by using directly the data of the system state and input. Both adaptive dynamic programming (ADP) and robust ADP algorithms are developed, along with rigorous stability and convergence analysis. The effectiveness of the obtained methods is illustrated by an example arising from biological sensorimotor control. |
Author | Tao Bian Zhong-Ping Jiang Yu Jiang |
Author_xml | – sequence: 1 givenname: Tao surname: Bian fullname: Bian, Tao – sequence: 2 givenname: Yu surname: Jiang fullname: Jiang, Yu – sequence: 3 givenname: Zhong-Ping surname: Jiang fullname: Jiang, Zhong-Ping |
BookMark | eNp9kM1LAzEQxYNUsK3eBS8Bz1uTbJLdPZbWLygqtCJ4WbLZ2Talm9QkFfrfu6XFgwdPw8y8N8P7DVDPOgsIXVMyopQUd4vxZMQIlSMmBBE0P0N9KkSeMMHSHuoTQvOkYLm8QIMQ1l0rOad99Dmu1Taab8DTvVWt0fjNu6VXbWvsEjfO43l0eqVC7FbzfYjQBvxh4qqbqwhY2RpPnI3ebfAUtmBrsBG_OBPgEp03ahPg6lSH6P3hfjF5Smavj8-T8SzRrKAxURkFJYBXjJCUFIoCyyshlahA5qrhhdKSpbrICqpkTSUTDZFMSw0kFTWt0iG6Pd7deve1gxDLtdt5270sac6lSHnOWaciR5X2LgQPTbn1plV-X1JSHgiWHcHyQLA8Eews8o9Fmy60OcRVZvOf8eZoNADw-yfjPMtYmv4ARaJ_Ug |
CODEN | IETAA9 |
CitedBy_id | crossref_primary_10_1109_TASE_2019_2948431 crossref_primary_10_1049_iet_cta_2019_0934 crossref_primary_10_1016_j_amc_2019_124568 crossref_primary_10_1109_JAS_2023_123186 crossref_primary_10_1007_s11424_022_1146_0 crossref_primary_10_1109_TSMC_2023_3284612 crossref_primary_10_3390_math12101533 crossref_primary_10_1016_j_arcontrol_2022_03_005 crossref_primary_10_1016_j_neucom_2025_129758 crossref_primary_10_1002_rnc_6191 crossref_primary_10_1080_00207721_2021_1929554 crossref_primary_10_1016_j_asoc_2024_112417 crossref_primary_10_1109_TCYB_2025_3530951 crossref_primary_10_1109_TCYB_2024_3468875 crossref_primary_10_1002_rnc_6432 crossref_primary_10_1109_TNNLS_2023_3347663 crossref_primary_10_1162_neco_a_01260 crossref_primary_10_3390_photonics11100927 crossref_primary_10_1007_s12190_023_01857_9 crossref_primary_10_1002_oca_2794 crossref_primary_10_1109_TCYB_2021_3050619 crossref_primary_10_1016_j_neucom_2024_127269 crossref_primary_10_1109_TCYB_2022_3203795 crossref_primary_10_1109_ACCESS_2023_3254879 crossref_primary_10_1080_00207721_2024_2392834 crossref_primary_10_1109_TNNLS_2021_3053269 crossref_primary_10_1007_s00422_022_00922_z crossref_primary_10_1109_TCYB_2023_3320441 crossref_primary_10_1016_j_automatica_2024_111848 crossref_primary_10_1109_TNNLS_2021_3136939 crossref_primary_10_1109_TAC_2022_3172250 crossref_primary_10_1016_j_ast_2024_109446 crossref_primary_10_1109_TCNS_2021_3074256 crossref_primary_10_1109_TAC_2019_2905215 crossref_primary_10_1016_j_amc_2022_127763 crossref_primary_10_1109_TNNLS_2023_3244934 crossref_primary_10_1109_TAC_2022_3145632 crossref_primary_10_1109_TCYB_2024_3403680 crossref_primary_10_1016_j_neucom_2019_12_001 crossref_primary_10_1109_TETCI_2023_3301789 crossref_primary_10_1016_j_automatica_2018_09_028 crossref_primary_10_1016_j_ifacol_2023_10_880 crossref_primary_10_1002_acs_2862 crossref_primary_10_1007_s11424_025_4572_y crossref_primary_10_1016_j_automatica_2016_05_003 crossref_primary_10_1007_s11424_024_2421_z crossref_primary_10_1016_j_amc_2024_128803 crossref_primary_10_1007_s11768_021_00046_y crossref_primary_10_1016_j_automatica_2025_112144 crossref_primary_10_1109_LCSYS_2020_2995547 crossref_primary_10_1109_TCYB_2021_3108034 crossref_primary_10_1016_j_isatra_2020_02_019 crossref_primary_10_1016_j_automatica_2023_111490 |
Cites_doi | 10.1109/TSMCB.2010.2043839 10.1109/TNNLS.2013.2294968 10.1109/MCAS.2009.933854 10.1016/j.ejcon.2013.05.017 10.1109/TAC.1971.1099828 10.1109/TNN.2011.2165729 10.1016/j.automatica.2014.08.023 10.1109/TNNLS.2015.2453320 10.1137/S0363012996301336 10.1523/JNEUROSCI.05-07-01688.1985 10.1523/JNEUROSCI.1110-06.2007 10.1016/j.automatica.2012.06.096 10.1016/j.sysconle.2005.07.005 10.1007/978-1-4471-4757-2 10.1109/TAC.2003.821400 10.1016/j.automatica.2016.05.003 10.1007/BF01211469 10.1016/S0167-6911(97)00008-X 10.1016/j.automatica.2007.02.024 10.1109/TAC.1969.1099303 10.1007/s00422-014-0613-7 10.1007/s00221-003-1443-3 10.1109/TAC.2015.2414811 10.1016/j.automatica.2014.10.015 10.1109/TIE.2014.2345343 10.1109/TAC.2014.2317301 10.1109/MCS.2012.2214134 10.1007/s11768-016-5117-7 10.1109/TNNLS.2015.2424971 10.1016/j.automatica.2006.08.028 10.1016/0005-1098(76)90029-7 10.1016/j.automatica.2012.05.049 10.1007/978-1-4757-6577-9 |
ContentType | Journal Article |
Copyright | Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2016 |
Copyright_xml | – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2016 |
DBID | 97E RIA RIE AAYXX CITATION 7SC 7SP 7TB 8FD FR3 JQ2 L7M L~C L~D |
DOI | 10.1109/TAC.2016.2550518 |
DatabaseName | IEEE Xplore (IEEE) IEEE All-Society Periodicals Package (ASPP) 1998–Present IEEE Electronic Library (IEL) CrossRef Computer and Information Systems Abstracts Electronics & Communications Abstracts Mechanical & Transportation Engineering Abstracts Technology Research Database Engineering Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional |
DatabaseTitle | CrossRef Technology Research Database Computer and Information Systems Abstracts – Academic Mechanical & Transportation Engineering Abstracts Electronics & Communications Abstracts ProQuest Computer Science Collection Computer and Information Systems Abstracts Engineering Research Database Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Professional |
DatabaseTitleList | Technology Research Database |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Xplore url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Engineering |
EISSN | 1558-2523 |
EndPage | 4175 |
ExternalDocumentID | 10_1109_TAC_2016_2550518 7447723 |
Genre | orig-research |
GrantInformation_xml | – fundername: National Science Foundation grantid: ECCS-1101401; ECCS-1230040; ECCS-1501044 funderid: 10.13039/100000001 |
GroupedDBID | -~X .DC 0R~ 29I 3EH 4.4 5GY 5VS 6IK 97E AAJGR AARMG AASAJ AAWTH ABAZT ABQJQ ABVLG ACGFO ACGFS ACIWK ACNCT AENEX AETIX AGQYO AGSQL AHBIQ AI. AIBXA AKJIK AKQYR ALLEH ALMA_UNASSIGNED_HOLDINGS ASUFR ATWAV BEFXN BFFAM BGNUA BKEBE BPEOZ CS3 DU5 EBS EJD F5P HZ~ H~9 IAAWW IBMZZ ICLAB IDIHD IFIPE IFJZH IPLJI JAVBF LAI M43 MS~ O9- OCL P2P RIA RIE RNS TAE TN5 VH1 VJK ~02 AAYXX CITATION 7SC 7SP 7TB 8FD FR3 JQ2 L7M L~C L~D RIG |
ID | FETCH-LOGICAL-c291t-a71ea5e4b200309a1e28b56a5be68af49ac623c9791a6d1625f062c6ce035d1b3 |
IEDL.DBID | RIE |
ISSN | 0018-9286 |
IngestDate | Mon Jun 30 10:21:13 EDT 2025 Thu Apr 24 23:01:39 EDT 2025 Wed Oct 01 04:15:23 EDT 2025 Wed Aug 27 02:52:18 EDT 2025 |
IsDoiOpenAccess | false |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 12 |
Language | English |
License | https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html https://doi.org/10.15223/policy-029 https://doi.org/10.15223/policy-037 |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c291t-a71ea5e4b200309a1e28b56a5be68af49ac623c9791a6d1625f062c6ce035d1b3 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
PQID | 1846534842 |
PQPubID | 85475 |
PageCount | 6 |
ParticipantIDs | crossref_primary_10_1109_TAC_2016_2550518 crossref_citationtrail_10_1109_TAC_2016_2550518 proquest_journals_1846534842 ieee_primary_7447723 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | 2016-Dec. 2016-12-00 20161201 |
PublicationDateYYYYMMDD | 2016-12-01 |
PublicationDate_xml | – month: 12 year: 2016 text: 2016-Dec. |
PublicationDecade | 2010 |
PublicationPlace | New York |
PublicationPlace_xml | – name: New York |
PublicationTitle | IEEE transactions on automatic control |
PublicationTitleAbbrev | TAC |
PublicationYear | 2016 |
Publisher | IEEE The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
Publisher_xml | – name: IEEE – name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
References | ref35 ref13 ref34 ref12 ref37 ref15 krsti? (ref24) 1998 ref31 ref30 ref33 ref11 ref32 sutton (ref36) 1998 ref17 ref38 ref16 ref19 ref18 lewis (ref25) 2013 krsti? (ref23) 1995 ref26 flash (ref10) 1985; 5 ref20 ref41 ref22 ref21 horn (ref14) 2013 ref28 ref27 åström (ref2) 1997 ref29 ref8 ref7 werbos (ref39) 1968; 3 ref9 ref4 ref3 ref6 arnold (ref1) 1974 ref5 ref40 |
References_xml | – year: 2013 ident: ref14 publication-title: Matrix Analysis – volume: 3 start-page: 131 year: 1968 ident: ref39 article-title: The elements of intelligence publication-title: Cybernetica (Namur) – year: 1995 ident: ref23 publication-title: Nonlinear and Adaptive Control Design – ident: ref26 doi: 10.1109/TSMCB.2010.2043839 – ident: ref18 doi: 10.1109/TNNLS.2013.2294968 – ident: ref27 doi: 10.1109/MCAS.2009.933854 – ident: ref20 doi: 10.1016/j.ejcon.2013.05.017 – ident: ref31 doi: 10.1109/TAC.1971.1099828 – ident: ref15 doi: 10.1109/TNN.2011.2165729 – ident: ref4 doi: 10.1016/j.automatica.2014.08.023 – ident: ref34 doi: 10.1109/TNNLS.2015.2453320 – ident: ref13 doi: 10.1137/S0363012996301336 – volume: 5 start-page: 1688 year: 1985 ident: ref10 article-title: The coordination of arm movements: An experimentally confirmed mathematical model publication-title: J Neurosci doi: 10.1523/JNEUROSCI.05-07-01688.1985 – ident: ref29 doi: 10.1523/JNEUROSCI.1110-06.2007 – ident: ref16 doi: 10.1016/j.automatica.2012.06.096 – ident: ref3 doi: 10.1016/j.sysconle.2005.07.005 – ident: ref41 doi: 10.1007/978-1-4471-4757-2 – ident: ref8 doi: 10.1109/TAC.2003.821400 – ident: ref7 doi: 10.1016/j.automatica.2016.05.003 – ident: ref21 doi: 10.1007/BF01211469 – year: 1998 ident: ref36 publication-title: Reinforcement Learning An Introduction – ident: ref9 doi: 10.1016/S0167-6911(97)00008-X – ident: ref35 doi: 10.1016/j.automatica.2007.02.024 – ident: ref22 doi: 10.1109/TAC.1969.1099303 – ident: ref17 doi: 10.1007/s00422-014-0613-7 – ident: ref11 doi: 10.1007/s00221-003-1443-3 – ident: ref19 doi: 10.1109/TAC.2015.2414811 – ident: ref37 doi: 10.1016/j.automatica.2014.10.015 – ident: ref5 doi: 10.1109/TIE.2014.2345343 – ident: ref32 doi: 10.1109/TAC.2014.2317301 – ident: ref28 doi: 10.1109/MCS.2012.2214134 – ident: ref6 doi: 10.1007/s11768-016-5117-7 – ident: ref33 doi: 10.1109/TNNLS.2015.2424971 – ident: ref30 doi: 10.1016/j.automatica.2006.08.028 – year: 1974 ident: ref1 publication-title: Stochastic Differential Equations Theory and Applications – year: 2013 ident: ref25 publication-title: Reinforcement Learning and Approximate Dynamic Programming for Feedback Control – ident: ref40 doi: 10.1016/0005-1098(76)90029-7 – ident: ref38 doi: 10.1016/j.automatica.2012.05.049 – year: 1998 ident: ref24 publication-title: Stabilization of Nonlinear Uncertain Systems – ident: ref12 doi: 10.1007/978-1-4757-6577-9 – year: 1997 ident: ref2 publication-title: Adaptive Control |
SSID | ssj0016441 |
Score | 2.4934123 |
Snippet | In this technical note, the adaptive optimal control problem is investigated for a class of continuous-time stochastic systems subject to multiplicative noise.... |
SourceID | proquest crossref ieee |
SourceType | Aggregation Database Enrichment Source Index Database Publisher |
StartPage | 4170 |
SubjectTerms | Adaptive control Adaptive dynamic programming adaptive optimal control Adaptive systems Algorithm design and analysis Algorithms Convergence Dynamic programming Noise control Optimal control Robustness Stability analysis Stochastic processes Stochastic systems |
Title | Adaptive Dynamic Programming for Stochastic Systems With State and Control Dependent Noise |
URI | https://ieeexplore.ieee.org/document/7447723 https://www.proquest.com/docview/1846534842 |
Volume | 61 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
journalDatabaseRights | – providerCode: PRVIEE databaseName: IEEE Xplore customDbUrl: eissn: 1558-2523 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0016441 issn: 0018-9286 databaseCode: RIE dateStart: 19630101 isFulltext: true titleUrlDefault: https://ieeexplore.ieee.org/ providerName: IEEE |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFH7oTnrwtzidkoMXwXZNljbNcWyKCBNBRfFSkjRFUTvR7uJfb36tiIp4CyQpIS8v-dL38n0AhyrJudaSRZxWKqJcmBJmOqJSpQbAUla5ZMzJRXZ2Q8_v0rsFOG7fwmitXfKZjm3RxfLLqZrZX2V9RqkBg4NFWGSM-7dabcTAnut-1zUOTPI2JJnw_vVwZHO4sphYOG7lPb4cQU5T5cdG7E6X01WYzMflk0qe4lkjY_XxjbLxvwNfg5UAM9HQr4t1WND1Bix_IR_chPthKV7tZofGXpUeXfpcrRdTjQyWRVfNVD0IS-SMArE5un1sHpADqEjUJRr5RHc0DlK6DbqYPr7rLbg5PbkenUVBaSFShOMmEgxrkWoqiQuNCqxJLtNMpFJnuaiMCZWBSYozjkVWYnNnqpKMqMyKjaUlloNt6NTTWu8AMj6NsbT1lhvN4EPJDCyQJBeSaMZUF_rzyS9UoCG3ahjPhbuOJLww5iqsuYpgri4ctT1ePQXHH2037ey37cLEd6E3t28RfPS9MHfbLB3QnJLd33vtwZL9tk9e6UGneZvpfQNBGnng1t4n9z_WRw |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PT8IwFH5BPKgHfxtR1B68mDigpVu3IwENKhATIRIvS9uVaFQgMi7-9fbHWIga461J26zp62u_7n19H8C5rIWRUoJ5ER1Jj0ZclzBTHhXS1wCWspElY3Z7QXtAb4f-sACX-VsYpZQln6mKKdpYfjKRc_OrrMoo1WCwvgKrvi4R91orjxmYk93tu9qFSZgHJWtRtd9oGhZXUCEGkBuBj6VDyKqq_NiK7flyvQXdxcgcreS1Mk9FRX5-S9r436Fvw2YGNFHDrYwdKKjxLmwspR_cg6dGwqdmu0Mtp0uP7h1b611XI41m0UM6kc_cpHJGWWpz9PiSPiMLUREfJ6jpqO6olYnppqg3eZmpfRhcX_WbbS_TWvAkiXDqcYYV9xUVxAZHOVYkFH7AfaGCkI-0EaUGSjJiEeZBgvWtaVQLiAyM3JifYFE_gOJ4MlaHgLRXYyxMvcmOphGiYBoYCBJyQRRjsgTVxeTHMktEbvQw3mJ7IalFsTZXbMwVZ-YqwUXeY-qScPzRds_Mft4um_gSlBf2jTMvncX6dhv4dRpScvR7rzNYa_e7nbhz07s7hnXzHUdlKUMx_ZirEw1IUnFq1-EXVbrZkg |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Adaptive+Dynamic+Programming+for+Stochastic+Systems+With+State+and+Control+Dependent+Noise&rft.jtitle=IEEE+transactions+on+automatic+control&rft.au=Bian%2C+Tao&rft.au=Jiang%2C+Yu&rft.au=Zhong-Ping%2C+Jiang&rft.date=2016-12-01&rft.pub=The+Institute+of+Electrical+and+Electronics+Engineers%2C+Inc.+%28IEEE%29&rft.issn=0018-9286&rft.eissn=1558-2523&rft.volume=61&rft.issue=12&rft.spage=4170&rft_id=info:doi/10.1109%2FTAC.2016.2550518&rft.externalDBID=NO_FULL_TEXT |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0018-9286&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0018-9286&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0018-9286&client=summon |