Adaptive Dynamic Programming for Stochastic Systems With State and Control Dependent Noise

In this technical note, the adaptive optimal control problem is investigated for a class of continuous-time stochastic systems subject to multiplicative noise. A novel non-model-based optimal control design methodology is employed to iteratively update the control policy on-line by using directly th...

Full description

Saved in:

Bibliographic Details
Published in	IEEE transactions on automatic control Vol. 61; no. 12; pp. 4170 - 4175
Main Authors	Bian, Tao, Jiang, Yu, Jiang, Zhong-Ping
Format	Journal Article
Language	English
Published	New York IEEE 01.12.2016 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	Adaptive control Adaptive dynamic programming adaptive optimal control Adaptive systems Algorithm design and analysis Algorithms Convergence Dynamic programming Noise control Optimal control Robustness Stability analysis Stochastic processes Stochastic systems
Online Access	Get full text
ISSN	0018-9286 1558-2523
DOI	10.1109/TAC.2016.2550518

Cover

Abstract	In this technical note, the adaptive optimal control problem is investigated for a class of continuous-time stochastic systems subject to multiplicative noise. A novel non-model-based optimal control design methodology is employed to iteratively update the control policy on-line by using directly the data of the system state and input. Both adaptive dynamic programming (ADP) and robust ADP algorithms are developed, along with rigorous stability and convergence analysis. The effectiveness of the obtained methods is illustrated by an example arising from biological sensorimotor control.
AbstractList	In this technical note, the adaptive optimal control problem is investigated for a class of continuous-time stochastic systems subject to multiplicative noise. A novel non-model-based optimal control design methodology is employed to iteratively update the control policy on-line by using directly the data of the system state and input. Both adaptive dynamic programming (ADP) and robust ADP algorithms are developed, along with rigorous stability and convergence analysis. The effectiveness of the obtained methods is illustrated by an example arising from biological sensorimotor control.
Author	Tao Bian Zhong-Ping Jiang Yu Jiang
Author_xml	– sequence: 1 givenname: Tao surname: Bian fullname: Bian, Tao – sequence: 2 givenname: Yu surname: Jiang fullname: Jiang, Yu – sequence: 3 givenname: Zhong-Ping surname: Jiang fullname: Jiang, Zhong-Ping
BookMark	eNp9kM1LAzEQxYNUsK3eBS8Bz1uTbJLdPZbWLygqtCJ4WbLZ2Talm9QkFfrfu6XFgwdPw8y8N8P7DVDPOgsIXVMyopQUd4vxZMQIlSMmBBE0P0N9KkSeMMHSHuoTQvOkYLm8QIMQ1l0rOad99Dmu1Taab8DTvVWt0fjNu6VXbWvsEjfO43l0eqVC7FbzfYjQBvxh4qqbqwhY2RpPnI3ebfAUtmBrsBG_OBPgEp03ahPg6lSH6P3hfjF5Smavj8-T8SzRrKAxURkFJYBXjJCUFIoCyyshlahA5qrhhdKSpbrICqpkTSUTDZFMSw0kFTWt0iG6Pd7deve1gxDLtdt5270sac6lSHnOWaciR5X2LgQPTbn1plV-X1JSHgiWHcHyQLA8Eews8o9Fmy60OcRVZvOf8eZoNADw-yfjPMtYmv4ARaJ_Ug
CODEN	IETAA9
CitedBy_id	crossref_primary_10_1109_TASE_2019_2948431 crossref_primary_10_1049_iet_cta_2019_0934 crossref_primary_10_1016_j_amc_2019_124568 crossref_primary_10_1109_JAS_2023_123186 crossref_primary_10_1007_s11424_022_1146_0 crossref_primary_10_1109_TSMC_2023_3284612 crossref_primary_10_3390_math12101533 crossref_primary_10_1016_j_arcontrol_2022_03_005 crossref_primary_10_1016_j_neucom_2025_129758 crossref_primary_10_1002_rnc_6191 crossref_primary_10_1080_00207721_2021_1929554 crossref_primary_10_1016_j_asoc_2024_112417 crossref_primary_10_1109_TCYB_2025_3530951 crossref_primary_10_1109_TCYB_2024_3468875 crossref_primary_10_1002_rnc_6432 crossref_primary_10_1109_TNNLS_2023_3347663 crossref_primary_10_1162_neco_a_01260 crossref_primary_10_3390_photonics11100927 crossref_primary_10_1007_s12190_023_01857_9 crossref_primary_10_1002_oca_2794 crossref_primary_10_1109_TCYB_2021_3050619 crossref_primary_10_1016_j_neucom_2024_127269 crossref_primary_10_1109_TCYB_2022_3203795 crossref_primary_10_1109_ACCESS_2023_3254879 crossref_primary_10_1080_00207721_2024_2392834 crossref_primary_10_1109_TNNLS_2021_3053269 crossref_primary_10_1007_s00422_022_00922_z crossref_primary_10_1109_TCYB_2023_3320441 crossref_primary_10_1016_j_automatica_2024_111848 crossref_primary_10_1109_TNNLS_2021_3136939 crossref_primary_10_1109_TAC_2022_3172250 crossref_primary_10_1016_j_ast_2024_109446 crossref_primary_10_1109_TCNS_2021_3074256 crossref_primary_10_1109_TAC_2019_2905215 crossref_primary_10_1016_j_amc_2022_127763 crossref_primary_10_1109_TNNLS_2023_3244934 crossref_primary_10_1109_TAC_2022_3145632 crossref_primary_10_1109_TCYB_2024_3403680 crossref_primary_10_1016_j_neucom_2019_12_001 crossref_primary_10_1109_TETCI_2023_3301789 crossref_primary_10_1016_j_automatica_2018_09_028 crossref_primary_10_1016_j_ifacol_2023_10_880 crossref_primary_10_1002_acs_2862 crossref_primary_10_1007_s11424_025_4572_y crossref_primary_10_1016_j_automatica_2016_05_003 crossref_primary_10_1007_s11424_024_2421_z crossref_primary_10_1016_j_amc_2024_128803 crossref_primary_10_1007_s11768_021_00046_y crossref_primary_10_1016_j_automatica_2025_112144 crossref_primary_10_1109_LCSYS_2020_2995547 crossref_primary_10_1109_TCYB_2021_3108034 crossref_primary_10_1016_j_isatra_2020_02_019 crossref_primary_10_1016_j_automatica_2023_111490
Cites_doi	10.1109/TSMCB.2010.2043839 10.1109/TNNLS.2013.2294968 10.1109/MCAS.2009.933854 10.1016/j.ejcon.2013.05.017 10.1109/TAC.1971.1099828 10.1109/TNN.2011.2165729 10.1016/j.automatica.2014.08.023 10.1109/TNNLS.2015.2453320 10.1137/S0363012996301336 10.1523/JNEUROSCI.05-07-01688.1985 10.1523/JNEUROSCI.1110-06.2007 10.1016/j.automatica.2012.06.096 10.1016/j.sysconle.2005.07.005 10.1007/978-1-4471-4757-2 10.1109/TAC.2003.821400 10.1016/j.automatica.2016.05.003 10.1007/BF01211469 10.1016/S0167-6911(97)00008-X 10.1016/j.automatica.2007.02.024 10.1109/TAC.1969.1099303 10.1007/s00422-014-0613-7 10.1007/s00221-003-1443-3 10.1109/TAC.2015.2414811 10.1016/j.automatica.2014.10.015 10.1109/TIE.2014.2345343 10.1109/TAC.2014.2317301 10.1109/MCS.2012.2214134 10.1007/s11768-016-5117-7 10.1109/TNNLS.2015.2424971 10.1016/j.automatica.2006.08.028 10.1016/0005-1098(76)90029-7 10.1016/j.automatica.2012.05.049 10.1007/978-1-4757-6577-9
ContentType	Journal Article
Copyright	Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2016
Copyright_xml	– notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2016
DBID	97E RIA RIE AAYXX CITATION 7SC 7SP 7TB 8FD FR3 JQ2 L7M L~C L~D
DOI	10.1109/TAC.2016.2550518
DatabaseName	IEEE Xplore (IEEE) IEEE All-Society Periodicals Package (ASPP) 1998–Present IEEE Electronic Library (IEL) CrossRef Computer and Information Systems Abstracts Electronics & Communications Abstracts Mechanical & Transportation Engineering Abstracts Technology Research Database Engineering Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional
DatabaseTitle	CrossRef Technology Research Database Computer and Information Systems Abstracts – Academic Mechanical & Transportation Engineering Abstracts Electronics & Communications Abstracts ProQuest Computer Science Collection Computer and Information Systems Abstracts Engineering Research Database Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Professional
DatabaseTitleList	Technology Research Database
Database_xml	– sequence: 1 dbid: RIE name: IEEE Xplore url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
Discipline	Engineering
EISSN	1558-2523
EndPage	4175
ExternalDocumentID	10_1109_TAC_2016_2550518 7447723
Genre	orig-research
GrantInformation_xml	– fundername: National Science Foundation grantid: ECCS-1101401; ECCS-1230040; ECCS-1501044 funderid: 10.13039/100000001
GroupedDBID	-~X .DC 0R~ 29I 3EH 4.4 5GY 5VS 6IK 97E AAJGR AARMG AASAJ AAWTH ABAZT ABQJQ ABVLG ACGFO ACGFS ACIWK ACNCT AENEX AETIX AGQYO AGSQL AHBIQ AI. AIBXA AKJIK AKQYR ALLEH ALMA_UNASSIGNED_HOLDINGS ASUFR ATWAV BEFXN BFFAM BGNUA BKEBE BPEOZ CS3 DU5 EBS EJD F5P HZ~ H~9 IAAWW IBMZZ ICLAB IDIHD IFIPE IFJZH IPLJI JAVBF LAI M43 MS~ O9- OCL P2P RIA RIE RNS TAE TN5 VH1 VJK ~02 AAYXX CITATION 7SC 7SP 7TB 8FD FR3 JQ2 L7M L~C L~D RIG
ID	FETCH-LOGICAL-c291t-a71ea5e4b200309a1e28b56a5be68af49ac623c9791a6d1625f062c6ce035d1b3
IEDL.DBID	RIE
ISSN	0018-9286
IngestDate	Mon Jun 30 10:21:13 EDT 2025 Thu Apr 24 23:01:39 EDT 2025 Wed Oct 01 04:15:23 EDT 2025 Wed Aug 27 02:52:18 EDT 2025
IsDoiOpenAccess	false
IsOpenAccess	true
IsPeerReviewed	true
IsScholarly	true
Issue	12
Language	English
License	https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html https://doi.org/10.15223/policy-029 https://doi.org/10.15223/policy-037
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-c291t-a71ea5e4b200309a1e28b56a5be68af49ac623c9791a6d1625f062c6ce035d1b3
Notes	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
PQID	1846534842
PQPubID	85475
PageCount	6
ParticipantIDs	crossref_primary_10_1109_TAC_2016_2550518 crossref_citationtrail_10_1109_TAC_2016_2550518 proquest_journals_1846534842 ieee_primary_7447723
ProviderPackageCode	CITATION AAYXX
PublicationCentury	2000
PublicationDate	2016-Dec. 2016-12-00 20161201
PublicationDateYYYYMMDD	2016-12-01
PublicationDate_xml	– month: 12 year: 2016 text: 2016-Dec.
PublicationDecade	2010
PublicationPlace	New York
PublicationPlace_xml	– name: New York
PublicationTitle	IEEE transactions on automatic control
PublicationTitleAbbrev	TAC
PublicationYear	2016
Publisher	IEEE The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Publisher_xml	– name: IEEE – name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
References	ref35 ref13 ref34 ref12 ref37 ref15 krsti? (ref24) 1998 ref31 ref30 ref33 ref11 ref32 sutton (ref36) 1998 ref17 ref38 ref16 ref19 ref18 lewis (ref25) 2013 krsti? (ref23) 1995 ref26 flash (ref10) 1985; 5 ref20 ref41 ref22 ref21 horn (ref14) 2013 ref28 ref27 åström (ref2) 1997 ref29 ref8 ref7 werbos (ref39) 1968; 3 ref9 ref4 ref3 ref6 arnold (ref1) 1974 ref5 ref40
References_xml	– year: 2013 ident: ref14 publication-title: Matrix Analysis – volume: 3 start-page: 131 year: 1968 ident: ref39 article-title: The elements of intelligence publication-title: Cybernetica (Namur) – year: 1995 ident: ref23 publication-title: Nonlinear and Adaptive Control Design – ident: ref26 doi: 10.1109/TSMCB.2010.2043839 – ident: ref18 doi: 10.1109/TNNLS.2013.2294968 – ident: ref27 doi: 10.1109/MCAS.2009.933854 – ident: ref20 doi: 10.1016/j.ejcon.2013.05.017 – ident: ref31 doi: 10.1109/TAC.1971.1099828 – ident: ref15 doi: 10.1109/TNN.2011.2165729 – ident: ref4 doi: 10.1016/j.automatica.2014.08.023 – ident: ref34 doi: 10.1109/TNNLS.2015.2453320 – ident: ref13 doi: 10.1137/S0363012996301336 – volume: 5 start-page: 1688 year: 1985 ident: ref10 article-title: The coordination of arm movements: An experimentally confirmed mathematical model publication-title: J Neurosci doi: 10.1523/JNEUROSCI.05-07-01688.1985 – ident: ref29 doi: 10.1523/JNEUROSCI.1110-06.2007 – ident: ref16 doi: 10.1016/j.automatica.2012.06.096 – ident: ref3 doi: 10.1016/j.sysconle.2005.07.005 – ident: ref41 doi: 10.1007/978-1-4471-4757-2 – ident: ref8 doi: 10.1109/TAC.2003.821400 – ident: ref7 doi: 10.1016/j.automatica.2016.05.003 – ident: ref21 doi: 10.1007/BF01211469 – year: 1998 ident: ref36 publication-title: Reinforcement Learning An Introduction – ident: ref9 doi: 10.1016/S0167-6911(97)00008-X – ident: ref35 doi: 10.1016/j.automatica.2007.02.024 – ident: ref22 doi: 10.1109/TAC.1969.1099303 – ident: ref17 doi: 10.1007/s00422-014-0613-7 – ident: ref11 doi: 10.1007/s00221-003-1443-3 – ident: ref19 doi: 10.1109/TAC.2015.2414811 – ident: ref37 doi: 10.1016/j.automatica.2014.10.015 – ident: ref5 doi: 10.1109/TIE.2014.2345343 – ident: ref32 doi: 10.1109/TAC.2014.2317301 – ident: ref28 doi: 10.1109/MCS.2012.2214134 – ident: ref6 doi: 10.1007/s11768-016-5117-7 – ident: ref33 doi: 10.1109/TNNLS.2015.2424971 – ident: ref30 doi: 10.1016/j.automatica.2006.08.028 – year: 1974 ident: ref1 publication-title: Stochastic Differential Equations Theory and Applications – year: 2013 ident: ref25 publication-title: Reinforcement Learning and Approximate Dynamic Programming for Feedback Control – ident: ref40 doi: 10.1016/0005-1098(76)90029-7 – ident: ref38 doi: 10.1016/j.automatica.2012.05.049 – year: 1998 ident: ref24 publication-title: Stabilization of Nonlinear Uncertain Systems – ident: ref12 doi: 10.1007/978-1-4757-6577-9 – year: 1997 ident: ref2 publication-title: Adaptive Control
SSID	ssj0016441
Score	2.4934123
Snippet	In this technical note, the adaptive optimal control problem is investigated for a class of continuous-time stochastic systems subject to multiplicative noise....
SourceID	proquest crossref ieee
SourceType	Aggregation Database Enrichment Source Index Database Publisher
StartPage	4170
SubjectTerms	Adaptive control Adaptive dynamic programming adaptive optimal control Adaptive systems Algorithm design and analysis Algorithms Convergence Dynamic programming Noise control Optimal control Robustness Stability analysis Stochastic processes Stochastic systems
Title	Adaptive Dynamic Programming for Stochastic Systems With State and Control Dependent Noise
URI	https://ieeexplore.ieee.org/document/7447723 https://www.proquest.com/docview/1846534842
Volume	61
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
journalDatabaseRights	– providerCode: PRVIEE databaseName: IEEE Xplore customDbUrl: eissn: 1558-2523 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0016441 issn: 0018-9286 databaseCode: RIE dateStart: 19630101 isFulltext: true titleUrlDefault: https://ieeexplore.ieee.org/ providerName: IEEE
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFH7oTnrwtzidkoMXwXZNljbNcWyKCBNBRfFSkjRFUTvR7uJfb36tiIp4CyQpIS8v-dL38n0AhyrJudaSRZxWKqJcmBJmOqJSpQbAUla5ZMzJRXZ2Q8_v0rsFOG7fwmitXfKZjm3RxfLLqZrZX2V9RqkBg4NFWGSM-7dabcTAnut-1zUOTPI2JJnw_vVwZHO4sphYOG7lPb4cQU5T5cdG7E6X01WYzMflk0qe4lkjY_XxjbLxvwNfg5UAM9HQr4t1WND1Bix_IR_chPthKV7tZofGXpUeXfpcrRdTjQyWRVfNVD0IS-SMArE5un1sHpADqEjUJRr5RHc0DlK6DbqYPr7rLbg5PbkenUVBaSFShOMmEgxrkWoqiQuNCqxJLtNMpFJnuaiMCZWBSYozjkVWYnNnqpKMqMyKjaUlloNt6NTTWu8AMj6NsbT1lhvN4EPJDCyQJBeSaMZUF_rzyS9UoCG3ahjPhbuOJLww5iqsuYpgri4ctT1ePQXHH2037ey37cLEd6E3t28RfPS9MHfbLB3QnJLd33vtwZL9tk9e6UGneZvpfQNBGnng1t4n9z_WRw
linkProvider	IEEE
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PT8IwFH5BPKgHfxtR1B68mDigpVu3IwENKhATIRIvS9uVaFQgMi7-9fbHWIga461J26zp62u_7n19H8C5rIWRUoJ5ER1Jj0ZclzBTHhXS1wCWspElY3Z7QXtAb4f-sACX-VsYpZQln6mKKdpYfjKRc_OrrMoo1WCwvgKrvi4R91orjxmYk93tu9qFSZgHJWtRtd9oGhZXUCEGkBuBj6VDyKqq_NiK7flyvQXdxcgcreS1Mk9FRX5-S9r436Fvw2YGNFHDrYwdKKjxLmwspR_cg6dGwqdmu0Mtp0uP7h1b611XI41m0UM6kc_cpHJGWWpz9PiSPiMLUREfJ6jpqO6olYnppqg3eZmpfRhcX_WbbS_TWvAkiXDqcYYV9xUVxAZHOVYkFH7AfaGCkI-0EaUGSjJiEeZBgvWtaVQLiAyM3JifYFE_gOJ4MlaHgLRXYyxMvcmOphGiYBoYCBJyQRRjsgTVxeTHMktEbvQw3mJ7IalFsTZXbMwVZ-YqwUXeY-qScPzRds_Mft4um_gSlBf2jTMvncX6dhv4dRpScvR7rzNYa_e7nbhz07s7hnXzHUdlKUMx_ZirEw1IUnFq1-EXVbrZkg
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Adaptive+Dynamic+Programming+for+Stochastic+Systems+With+State+and+Control+Dependent+Noise&rft.jtitle=IEEE+transactions+on+automatic+control&rft.au=Bian%2C+Tao&rft.au=Jiang%2C+Yu&rft.au=Zhong-Ping%2C+Jiang&rft.date=2016-12-01&rft.pub=The+Institute+of+Electrical+and+Electronics+Engineers%2C+Inc.+%28IEEE%29&rft.issn=0018-9286&rft.eissn=1558-2523&rft.volume=61&rft.issue=12&rft.spage=4170&rft_id=info:doi/10.1109%2FTAC.2016.2550518&rft.externalDBID=NO_FULL_TEXT
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0018-9286&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0018-9286&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0018-9286&client=summon