Double Q-PID algorithm for mobile robot control

•A model-free reinforcement learning algorithm for adaptive low-level PID control.•Double Q-learning provides excelent performance with low computational cost.•Incremental state and action spaces discretization for fast reinforcement learning.•The Double Q-PID algorithm is effectively applied to mul...

Full description

Saved in:

Bibliographic Details
Published in	Expert systems with applications Vol. 137; pp. 292 - 307
Main Authors	Carlucho, Ignacio, De Paula, Mariano, Acosta, Gerardo G.
Format	Journal Article
Language	English
Published	New York Elsevier Ltd 15.12.2019 Elsevier BV
Subjects	Adaptive control Algorithms Applications programs Computer simulation Controllers Double Q-learning Double Q-PID Expert systems Incremental learning Machine learning Mobile robots Multi-platforms Proportional integral derivative Real time Reinforcement learning Robot control Robots Self adaptive control systems Space exploration Tuning Double Q-learning Incremental learning Multi-platforms Mobile robots Reinforcement learning Double Q-PID
Online Access	Get full text
ISSN	0957-4174 1873-6793
DOI	10.1016/j.eswa.2019.06.066

Cover

Abstract	•A model-free reinforcement learning algorithm for adaptive low-level PID control.•Double Q-learning provides excelent performance with low computational cost.•Incremental state and action spaces discretization for fast reinforcement learning.•The Double Q-PID algorithm is effectively applied to multiple robotic platforms.•The proposed algorithm effectively improves the adaptability of the PID controllers. Many expert systems have been developed for self-adaptive PID controllers of mobile robots. However, the high computational requirements of the expert systems layers, developed for the tuning of the PID controllers, still require previous expert knowledge and high efficiency in algorithmic and software execution for real-time applications. To address these problems, in this paper we propose an expert agent-based system, based on a reinforcement learning agent, for self-adapting multiple low-level PID controllers in mobile robots. For the formulation of the artificial expert agent, we develop an incremental model-free algorithm version of the double Q-Learning algorithm for fast on-line adaptation of multiple low-level PID controllers. Fast learning and high on-line adaptability of the artificial expert agent is achieved by means of a proposed incremental active-learning exploration-exploitation procedure, for a non-uniform state space exploration, along with an experience replay mechanism for multiple value functions updates in the double Q-learning algorithm. A comprehensive comparative simulation study and experiments in a real mobile robot demonstrate the high performance of the proposed algorithm for a real-time simultaneous tuning of multiple adaptive low-level PID controllers of mobile robots in real world conditions.
AbstractList	•A model-free reinforcement learning algorithm for adaptive low-level PID control.•Double Q-learning provides excelent performance with low computational cost.•Incremental state and action spaces discretization for fast reinforcement learning.•The Double Q-PID algorithm is effectively applied to multiple robotic platforms.•The proposed algorithm effectively improves the adaptability of the PID controllers. Many expert systems have been developed for self-adaptive PID controllers of mobile robots. However, the high computational requirements of the expert systems layers, developed for the tuning of the PID controllers, still require previous expert knowledge and high efficiency in algorithmic and software execution for real-time applications. To address these problems, in this paper we propose an expert agent-based system, based on a reinforcement learning agent, for self-adapting multiple low-level PID controllers in mobile robots. For the formulation of the artificial expert agent, we develop an incremental model-free algorithm version of the double Q-Learning algorithm for fast on-line adaptation of multiple low-level PID controllers. Fast learning and high on-line adaptability of the artificial expert agent is achieved by means of a proposed incremental active-learning exploration-exploitation procedure, for a non-uniform state space exploration, along with an experience replay mechanism for multiple value functions updates in the double Q-learning algorithm. A comprehensive comparative simulation study and experiments in a real mobile robot demonstrate the high performance of the proposed algorithm for a real-time simultaneous tuning of multiple adaptive low-level PID controllers of mobile robots in real world conditions. Many expert systems have been developed for self-adaptive PID controllers of mobile robots. However, the high computational requirements of the expert systems layers, developed for the tuning of the PID controllers, still require previous expert knowledge and high efficiency in algorithmic and software execution for real-time applications. To address these problems, in this paper we propose an expert agent-based system, based on a reinforcement learning agent, for self-adapting multiple low-level PID controllers in mobile robots. For the formulation of the artificial expert agent, we develop an incremental model-free algorithm version of the double Q-Learning algorithm for fast on-line adaptation of multiple low-level PID controllers. Fast learning and high on-line adaptability of the artificial expert agent is achieved by means of a proposed incremental active-learning exploration-exploitation procedure, for a non-uniform state space exploration, along with an experience replay mechanism for multiple value functions updates in the double Q-learning algorithm. A comprehensive comparative simulation study and experiments in a real mobile robot demonstrate the high performance of the proposed algorithm for a real-time simultaneous tuning of multiple adaptive low-level PID controllers of mobile robots in real world conditions.
Author	Acosta, Gerardo G. De Paula, Mariano Carlucho, Ignacio
Author_xml	– sequence: 1 givenname: Ignacio orcidid: 0000-0002-6262-480X surname: Carlucho fullname: Carlucho, Ignacio email: ignacio.carlucho@fio.unicen.edu.ar – sequence: 2 givenname: Mariano orcidid: 0000-0001-7582-9188 surname: De Paula fullname: De Paula, Mariano email: mariano.depaula@fio.unicen.edu.ar – sequence: 3 givenname: Gerardo G. surname: Acosta fullname: Acosta, Gerardo G. email: ggacosta@fio.unicen.edu.ar
BookMark	eNp9kEtLAzEUhYMo2Fb_gKsB1zO9eUymATfS-igUVNB1yCQZzTCd1CRV_PfOUFcuCgfu4p7vPs4Unfa-twhdYSgwYD5vCxu_VUEAiwL4IH6CJnhR0ZxXgp6iCYiyyhmu2DmaxtgC4AqgmqD5yu_rzmYv-fN6lanu3QeXPrZZ40O29bUbWsHXPmXa9yn47gKdNaqL9vKvztDb_d3r8jHfPD2sl7ebXFOySLngAFoQKwg3ShuiGIVaUI1LbHCpmqZh2nLGhCkpAaJA2ZrXxiihcM20pjN0fZi7C_5zb2OSrd-HflgpCQXOKg6MDK7FwaWDjzHYRmqXVHLjrcp1EoMc45GtHOORYzwS-CA-oOQfugtuq8LPcejmANnh9S9ng4za2V5b44LVSRrvjuG_hdx_3Q
CitedBy_id	crossref_primary_10_1016_j_oceaneng_2024_117727 crossref_primary_10_1016_j_eswa_2022_117191 crossref_primary_10_21595_jmai_2023_23410 crossref_primary_10_1002_ird_2852 crossref_primary_10_1007_s12204_023_2631_x crossref_primary_10_1016_j_ifacol_2023_10_1768 crossref_primary_10_1016_j_ifacol_2022_07_517 crossref_primary_10_31466_kfbd_1564085 crossref_primary_10_1016_j_autcon_2022_104576 crossref_primary_10_3390_electronics12183925 crossref_primary_10_1007_s10015_020_00629_z crossref_primary_10_1080_10255842_2023_2241945 crossref_primary_10_3934_mbe_2023463 crossref_primary_10_3390_act11110330 crossref_primary_10_3390_w16121699 crossref_primary_10_1016_j_eswa_2023_121037 crossref_primary_10_1109_ACCESS_2020_3018026 crossref_primary_10_1002_rnc_5662 crossref_primary_10_1109_TIM_2025_3545523 crossref_primary_10_1109_TITS_2024_3382816 crossref_primary_10_3390_jmse7120443 crossref_primary_10_1016_j_oceaneng_2020_108053 crossref_primary_10_1016_j_isatra_2019_12_011 crossref_primary_10_1016_j_eswa_2020_114504 crossref_primary_10_1002_int_22345 crossref_primary_10_28979_jarnas_1119957 crossref_primary_10_1016_j_eswa_2021_114589 crossref_primary_10_1109_TIE_2024_3433532 crossref_primary_10_1007_s12555_023_0616_z crossref_primary_10_1088_1742_6596_2213_1_012005 crossref_primary_10_1016_j_ast_2025_110053 crossref_primary_10_1016_j_eswa_2022_116714 crossref_primary_10_1049_cth2_12125 crossref_primary_10_1016_j_knosys_2023_110355 crossref_primary_10_1016_j_measurement_2024_116045 crossref_primary_10_33793_acperpro_04_01_22 crossref_primary_10_3390_act12080326
Cites_doi	10.1016/j.isatra.2014.09.012 10.1002/cae.21769 10.1016/S1474-6670(17)31769-X 10.1109/87.944470 10.3390/s16091429 10.1109/TNNLS.2018.2806087 10.1016/j.eswa.2012.09.008 10.1016/j.eswa.2013.03.013 10.1038/nature14236 10.1115/1.2899252 10.1016/j.jprocont.2004.01.002 10.1007/s00773-015-0312-7 10.1007/s10846-016-0359-7 10.2478/s13540-014-0216-y 10.1016/j.eswa.2013.12.030 10.1016/0005-1098(91)90052-4 10.1016/j.mechatronics.2016.06.005 10.1016/S0019-0578(99)00020-8 10.1007/BF00872055 10.15837/ijccc.2017.4.2962 10.1016/j.neucom.2007.04.014 10.1109/TCST.2003.809244 10.1007/s00521-014-1805-8 10.1109/TSMC.1973.5408575 10.1016/j.eswa.2017.03.002 10.1115/1.2899060 10.1287/mnsc.28.1.1 10.1080/00207721.2016.1186238 10.1049/ip-d.1993.0030 10.14257/ijhit.2015.8.7.32 10.1504/IJFCM.2014.064230 10.1109/JOE.2007.893685 10.1016/j.arcontrol.2018.09.005 10.1016/j.automatica.2009.07.008 10.1016/j.eswa.2018.01.039 10.4028/www.scientific.net/AMR.590.268 10.1038/nature14539 10.1109/9.587340 10.1080/00207178908559683
ContentType	Journal Article
Copyright	2019 Elsevier Ltd Copyright Elsevier BV Dec 15, 2019
Copyright_xml	– notice: 2019 Elsevier Ltd – notice: Copyright Elsevier BV Dec 15, 2019
DBID	AAYXX CITATION 7SC 8FD JQ2 L7M L~C L~D
DOI	10.1016/j.eswa.2019.06.066
DatabaseName	CrossRef Computer and Information Systems Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional
DatabaseTitle	CrossRef Computer and Information Systems Abstracts Technology Research Database Computer and Information Systems Abstracts – Academic Advanced Technologies Database with Aerospace ProQuest Computer Science Collection Computer and Information Systems Abstracts Professional
DatabaseTitleList	Computer and Information Systems Abstracts
DeliveryMethod	fulltext_linktorsrc
Discipline	Computer Science
EISSN	1873-6793
EndPage	307
ExternalDocumentID	10_1016_j_eswa_2019_06_066 S0957417419304749
GroupedDBID	--K --M .DC .~1 0R~ 13V 1B1 1RT 1~. 1~5 4.4 457 4G. 5GY 5VS 7-5 71M 8P~ 9JN 9JO AAAKF AABNK AACTN AAEDT AAEDW AAIAV AAIKJ AAKOC AALRI AAOAW AAQFI AARIN AAXUO AAYFN ABBOA ABFNM ABMAC ABMVD ABUCO ABYKQ ACDAQ ACGFS ACHRH ACNTT ACRLP ACZNC ADBBV ADEZE ADTZH AEBSH AECPX AEKER AENEX AFKWA AFTJW AGHFR AGJBL AGUBO AGUMN AGYEJ AHHHB AHJVU AHZHX AIALX AIEXJ AIKHN AITUG AJOXV ALEQD ALMA_UNASSIGNED_HOLDINGS AMFUW AMRAJ AOUOD APLSM AXJTR BJAXD BKOJK BLXMC BNSAS CS3 DU5 EBS EFJIC EFLBG EJD EO8 EO9 EP2 EP3 F5P FDB FIRID FNPLU FYGXN G-Q GBLVA GBOLZ HAMUX IHE J1W JJJVA KOM LG9 LY1 LY7 M41 MO0 N9A O-L O9- OAUVE OZT P-8 P-9 P2P PC. PQQKQ Q38 RIG ROL RPZ SDF SDG SDP SDS SES SPC SPCBC SSB SSD SSL SST SSV SSZ T5K TN5 ~G- 29G AAAKG AAQXK AATTM AAXKI AAYWO AAYXX ABJNI ABKBG ABUFD ABWVN ABXDB ACLOT ACNNM ACRPL ACVFH ADCNI ADJOM ADMUD ADNMO AEIPS AEUPX AFJKZ AFPUW AGQPQ AIGII AIIUN AKBMS AKRWK AKYEP ANKPU APXCP ASPBG AVWKF AZFZN CITATION EFKBS FEDTE FGOYB G-2 HLZ HVGLF HZ~ R2- SBC SET SEW WUQ XPP ZMT ~HD 7SC 8FD AFXIZ AGCQF AGRNS JQ2 L7M L~C L~D SSH
ID	FETCH-LOGICAL-c328t-9600c92e926dacd2a430b93c151d15afff4ce6449d53202a0aeb6bdda9a1b4cc3
IEDL.DBID	.~1
ISSN	0957-4174
IngestDate	Mon Jul 14 09:26:22 EDT 2025 Thu Apr 24 22:54:16 EDT 2025 Sat Oct 25 06:20:46 EDT 2025 Fri Feb 23 02:24:25 EST 2024
IsPeerReviewed	true
IsScholarly	true
Keywords	Double Q-learning Incremental learning Multi-platforms Mobile robots Reinforcement learning Double Q-PID
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-c328t-9600c92e926dacd2a430b93c151d15afff4ce6449d53202a0aeb6bdda9a1b4cc3
Notes	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ORCID	0000-0002-6262-480X 0000-0001-7582-9188
PQID	2306476042
PQPubID	2045477
PageCount	16
ParticipantIDs	proquest_journals_2306476042 crossref_citationtrail_10_1016_j_eswa_2019_06_066 crossref_primary_10_1016_j_eswa_2019_06_066 elsevier_sciencedirect_doi_10_1016_j_eswa_2019_06_066
PublicationCentury	2000
PublicationDate	2019-12-15
PublicationDateYYYYMMDD	2019-12-15
PublicationDate_xml	– month: 12 year: 2019 text: 2019-12-15 day: 15
PublicationDecade	2010
PublicationPlace	New York
PublicationPlace_xml	– name: New York
PublicationTitle	Expert systems with applications
PublicationYear	2019
Publisher	Elsevier Ltd Elsevier BV
Publisher_xml	– name: Elsevier Ltd – name: Elsevier BV
References	Pessen (bib0047) 1994; 116 Monahan (bib0039) 1982; 28 Chang, Jin (bib0016) 2013 Fatan, Sefidgari, Barenji (bib0022) 2013 Gloye, Göktekin, Egorova, Tenchio, Rojas (bib0025) 2005 Ogata (bib0042) 2010 Wei, Cohen (bib0069) 2015 Hägglund, Åström (bib0027) 1991; 27 Åström, Hägglund (bib0050) 2006 Li, He, Yin (bib0036) 2012; 590 Antonelli (bib0005) 2007; 32 Calvo-Rolle, Casteleiro-Roca, Quintián, Del Carmen Meizoso-Lopez (bib0014) 2013; 40 Arokiasami, Vadakkepa, Tan, Srinivasan (bib0008) 2016 Haykin (bib0030) 1998 Mnih, Kavukcuoglu, Silver, Rusu, Veness, Bellemare (bib0038) 2015; 518 Sood, Kumar (bib0061) 2014; 1 Zhuang, Atherton (bib0074) 1993; 140 Petit, Paulo, Carlucho, Menna, Paula (bib0048) 2016 Cohen, Bossert, Col (bib0019) 2002 Bhatnagar, Sutton, Ghavamzadeh, Lee (bib0011) 2009; 45 Kuantama, Vesselenyi, Dzitac, Tarca (bib0033) 2017; 12 Safonov, Tung-Ching Tsao (bib0053) 1997; 42 el Hakim, Hindersah, Rijanto (bib0028) 2013 Zhang, Zhiyuan, Kochenderfer (bib0077) 2017 Pan, Lai, Yang, Wu (bib0044) 2013; 40 Pan, Zaheer, White, Patterson, White (bib0046) 2018 MyungsooJun, Safonov (bib0040) 1999 Sood (bib0060) 2011 Cohen, Coon (bib0018) 1953; 75 Singh, Bisht, Padhy (bib0059) 2013 Åström, Hägglund (bib0049) 2004; 14 Sutton, Barto (bib0062) 1998 Hasselt (bib0029) 2010 Rout, Subudhi (bib0052) 2017; 48 Watkins, Dayan (bib0068) 1992; 8 Kuo (bib0034) 1995 Ziegler, Nichols (bib0076) 1993; 115 Wang, Fung, Zhang (bib0067) 1999; 38 Hernández-Alvarado, García-Valdovinos, Salgado-Jiménez, Gómez-Espinosa, Fonseca-Navarro (bib0031) 2016; 16 Liu, Pan, Xue (bib0037) 2015; 55 Chen, Billings (bib0017) 1989; 49 Ahmed, Kushsairy, Bakar, Hazry, Joyo (bib0003) 2015 Dong, Guo, Lin, Li, Wang (bib0020) 2012 Neumann, Peters (bib0041) 2008 Abu Rmilah, Hassan, Bin Mardi (bib0001) 2016; 24 Rossomando, Soria (bib0051) 2015; 26 Pan, Wang, Cheng, Yu (bib0045) 2018; 29 Ziegler, Nichols (bib0075) 1942; 64 Buçoniu, Babuska, Schutter, Ernst (bib0012) 2010 Busoniu, de Bruin, Toli, Kober, Palunko (bib0013) 2018; 46 Barbalata, De Carolis, Dunnigan, Petillot, Lane (bib0010) 2015 Tandan, Swarnkar (bib0064) 2015; 8 Acosta, Mayosky, Catalfo (bib0002) 1994; 4 LeCun, Bengio, Hinton, Y., Y., G. (bib0035) 2015; 521 Salhi, Alimi (bib0054) 2013 Bansal, Sharma, Shreeraman (bib0009) 2012; 2 Van Hasselt, Guez, Silver (bib0066) 2016 Antonelli, Chiaverini, Sarkar, West (bib0007) 2001; 9 Sharma, Rana, Kumar (bib0057) 2014; 41 Takači, Takači, Takači (bib0063) 2014; 17 Alejo, Cobano, Heredia, Ollero (bib0004) 2016; 84 Ye (bib0070) 2008; 71 Khodayari, Balochian (bib0032) 2015; 20 van Seijen, Sutton (bib0055) 2015 Timmer, Riedmiller (bib0065) 2007 Dyda, Os’kin (bib0021) 2004; 37 Zadeh (bib0071) 1973; SMC-3 Antonelli, Caccavale, Chiaverini, Fusco (bib0006) 2003; 11 Önder Efe (bib0043) 2011; 7 Fossen (bib0023) 2002 Carlucho, De Paula, Villar, Acosta (bib0015) 2017; 80 Zhang, Sutton (bib0072) 2017 Sheikhlar, Fakharian, Adhami-Mirhosseini (bib0058) 2013 Geder, Palmisano, Ramamurti, Sandberg, Ratna (bib0024) 2008 Goumagias, Hristu-Varsakelis, Assael (bib0026) 2018; 101 Zhao, Yi, Peng, Peng (bib0073) 2017 Shah, Agashe (bib0056) 2016; 38 Ye (10.1016/j.eswa.2019.06.066_bib0070) 2008; 71 Goumagias (10.1016/j.eswa.2019.06.066_bib0026) 2018; 101 Sharma (10.1016/j.eswa.2019.06.066_bib0057) 2014; 41 Önder Efe (10.1016/j.eswa.2019.06.066_bib0043) 2011; 7 Ahmed (10.1016/j.eswa.2019.06.066_bib0003) 2015 Watkins (10.1016/j.eswa.2019.06.066_bib0068) 1992; 8 Acosta (10.1016/j.eswa.2019.06.066_bib0002) 1994; 4 Geder (10.1016/j.eswa.2019.06.066_bib0024) 2008 Khodayari (10.1016/j.eswa.2019.06.066_bib0032) 2015; 20 Takači (10.1016/j.eswa.2019.06.066_bib0063) 2014; 17 Sood (10.1016/j.eswa.2019.06.066_bib0061) 2014; 1 LeCun (10.1016/j.eswa.2019.06.066_sbref0035) 2015; 521 Barbalata (10.1016/j.eswa.2019.06.066_bib0010) 2015 Kuo (10.1016/j.eswa.2019.06.066_bib0034) 1995 Monahan (10.1016/j.eswa.2019.06.066_bib0039) 1982; 28 Mnih (10.1016/j.eswa.2019.06.066_sbref0038) 2015; 518 Singh (10.1016/j.eswa.2019.06.066_bib0059) 2013 van Seijen (10.1016/j.eswa.2019.06.066_bib0055) 2015 Timmer (10.1016/j.eswa.2019.06.066_bib0065) 2007 Pessen (10.1016/j.eswa.2019.06.066_bib0047) 1994; 116 Tandan (10.1016/j.eswa.2019.06.066_bib0064) 2015; 8 Åström (10.1016/j.eswa.2019.06.066_bib0049) 2004; 14 Shah (10.1016/j.eswa.2019.06.066_bib0056) 2016; 38 Wang (10.1016/j.eswa.2019.06.066_bib0067) 1999; 38 Bansal (10.1016/j.eswa.2019.06.066_bib0009) 2012; 2 Dyda (10.1016/j.eswa.2019.06.066_bib0021) 2004; 37 el Hakim (10.1016/j.eswa.2019.06.066_bib0028) 2013 Zhang (10.1016/j.eswa.2019.06.066_bib0072) 2017 Zadeh (10.1016/j.eswa.2019.06.066_bib0071) 1973; SMC-3 Ziegler (10.1016/j.eswa.2019.06.066_bib0075) 1942; 64 Antonelli (10.1016/j.eswa.2019.06.066_bib0005) 2007; 32 Gloye (10.1016/j.eswa.2019.06.066_sbref0025) 2005 MyungsooJun (10.1016/j.eswa.2019.06.066_bib0040) 1999 Safonov (10.1016/j.eswa.2019.06.066_bib0053) 1997; 42 Cohen (10.1016/j.eswa.2019.06.066_sbref0019) 2002 Pan (10.1016/j.eswa.2019.06.066_bib0044) 2013; 40 Kuantama (10.1016/j.eswa.2019.06.066_bib0033) 2017; 12 Rout (10.1016/j.eswa.2019.06.066_bib0052) 2017; 48 Chen (10.1016/j.eswa.2019.06.066_bib0017) 1989; 49 Bhatnagar (10.1016/j.eswa.2019.06.066_bib0011) 2009; 45 Alejo (10.1016/j.eswa.2019.06.066_bib0004) 2016; 84 Carlucho (10.1016/j.eswa.2019.06.066_bib0015) 2017; 80 Fatan (10.1016/j.eswa.2019.06.066_bib0022) 2013 Salhi (10.1016/j.eswa.2019.06.066_bib0054) 2013 Sheikhlar (10.1016/j.eswa.2019.06.066_bib0058) 2013 Neumann (10.1016/j.eswa.2019.06.066_bib0041) 2008 Arokiasami (10.1016/j.eswa.2019.06.066_bib0008) 2016 Liu (10.1016/j.eswa.2019.06.066_bib0037) 2015; 55 Wei (10.1016/j.eswa.2019.06.066_bib0069) 2015 Van Hasselt (10.1016/j.eswa.2019.06.066_sbref0066) 2016 Busoniu (10.1016/j.eswa.2019.06.066_bib0013) 2018; 46 Antonelli (10.1016/j.eswa.2019.06.066_bib0006) 2003; 11 Zhuang (10.1016/j.eswa.2019.06.066_bib0074) 1993; 140 Petit (10.1016/j.eswa.2019.06.066_bib0048) 2016 Antonelli (10.1016/j.eswa.2019.06.066_bib0007) 2001; 9 Chang (10.1016/j.eswa.2019.06.066_bib0016) 2013 Zhang (10.1016/j.eswa.2019.06.066_bib0077) 2017 Hägglund (10.1016/j.eswa.2019.06.066_bib0027) 1991; 27 Rossomando (10.1016/j.eswa.2019.06.066_bib0051) 2015; 26 Sood (10.1016/j.eswa.2019.06.066_bib0060) 2011 Abu Rmilah (10.1016/j.eswa.2019.06.066_bib0001) 2016; 24 Fossen (10.1016/j.eswa.2019.06.066_bib0023) 2002 Hernández-Alvarado (10.1016/j.eswa.2019.06.066_bib0031) 2016; 16 Ziegler (10.1016/j.eswa.2019.06.066_bib0076) 1993; 115 Hasselt (10.1016/j.eswa.2019.06.066_bib0029) 2010 Li (10.1016/j.eswa.2019.06.066_sbref0036) 2012; 590 Dong (10.1016/j.eswa.2019.06.066_bib0020) 2012 Ogata (10.1016/j.eswa.2019.06.066_bib0042) 2010 Pan (10.1016/j.eswa.2019.06.066_bib0045) 2018; 29 Åström (10.1016/j.eswa.2019.06.066_bib0050) 2006 Cohen (10.1016/j.eswa.2019.06.066_bib0018) 1953; 75 Pan (10.1016/j.eswa.2019.06.066_sbref0046) 2018 Sutton (10.1016/j.eswa.2019.06.066_bib0062) 1998 Zhao (10.1016/j.eswa.2019.06.066_bib0073) 2017 Buçoniu (10.1016/j.eswa.2019.06.066_bib0012) 2010 Haykin (10.1016/j.eswa.2019.06.066_bib0030) 1998 Calvo-Rolle (10.1016/j.eswa.2019.06.066_bib0014) 2013; 40
References_xml	– volume: 75 start-page: 827 year: 1953 end-page: 834 ident: bib0018 article-title: Theoretical consideration of retarded control publication-title: Transactions of ASME – start-page: 662 year: 2013 end-page: 665 ident: bib0022 article-title: An adaptive neuro PID for controlling the altitude of quadcopter robot publication-title: Proceedings of the 18th international conference on methods & models in automation & robotics (MMAR) – year: 2006 ident: bib0050 article-title: Advanced PID control – volume: 1 start-page: 37 year: 2014 ident: bib0061 article-title: Adaptive fuzzy controller for controlling the dynamics of robot motion publication-title: International Journal of Fuzzy Computation and Modelling – start-page: 1375 year: 2016 end-page: 1381 ident: bib0008 article-title: Vector directed path generation and tracking for autonomous unmanned aerial/ ground vehicles publication-title: Proceedings of the ieee congress on evolutionary computation (CEC) – start-page: 1 year: 2013 end-page: 6 ident: bib0028 article-title: Application of reinforcement learning on self-tuning PID controller for soccer robot multi-agent system publication-title: Proceedings of the joint international conference on rural information & communication technology and electric-vehicle technology (RICT & ICEV-T) – volume: 26 start-page: 1179 year: 2015 end-page: 1191 ident: bib0051 article-title: Identification and control of nonlinear dynamics of a mobile robot in discrete time using an adaptive technique based on neural PID publication-title: Neural Computing and Applications – volume: 29 start-page: 2227 year: 2018 end-page: 2238 ident: bib0045 article-title: Multisource transfer double DQN based on actor learning publication-title: IEEE Transactions on Neural Networks and Learning Systems – volume: 28 start-page: 1 year: 1982 end-page: 16 ident: bib0039 article-title: State of the art—A survey of partially observable Markov decision processes: theory, models, and algorithms publication-title: Management Science – year: 2015 ident: bib0069 article-title: Development of a model based fuzzy-PID controller for the AeroQuad cyclone quad-copter publication-title: Proceedings of the AIAA infotech @ aerospace – year: 2002 ident: bib0019 article-title: PID and fuzzy logic pitch attitude hold systems for a fighter jet AIAA-2002-4646 PID and fuzzy logic pitch attitude hold systems for a fighter jet publication-title: Proceedings of the AIAA guidance, navigation, and control conference and exhibit – volume: 7 year: 2011 ident: bib0043 article-title: Neural network assisted computationally simple PI d control of a quadrotor UAV publication-title: IEEE Transactions on Industrial Informatics – volume: 48 start-page: 367 year: 2017 end-page: 375 ident: bib0052 article-title: Inverse optimal self-tuning PID control design for an autonomous underwater vehicle publication-title: International Journal of Systems Science – volume: 46 start-page: 8 year: 2018 end-page: 28 ident: bib0013 article-title: Reinforcement learning for control: Performance, stability, and deep approximators publication-title: Annual Reviews in Control – volume: 20 start-page: 559 year: 2015 end-page: 578 ident: bib0032 article-title: Modeling and control of autonomous underwater vehicle (AUV) in heading and depth attitude via self-adaptive fuzzy PID controller publication-title: Journal of Marine Science and Technology (Japan) – year: 2017 ident: bib0072 article-title: A deeper look at experience replay publication-title: Proceedings of the NIPS deep reinforcement learning symposium – volume: 16 year: 2016 ident: bib0031 article-title: Neural network-based self-Tuning PID control for underwater vehicles publication-title: Sensors (Basel, Switzerland) – volume: 49 start-page: 1013 year: 1989 end-page: 1032 ident: bib0017 article-title: Representations of non-linear systems: the NARMAX model publication-title: International Journal of Control – volume: 27 start-page: 599 year: 1991 end-page: 609 ident: bib0027 article-title: Industrial adaptive controllers based on frequency response techniques publication-title: Automatica – start-page: 385 year: 2011 end-page: 390 ident: bib0060 article-title: Autonomous robot motion control using fuzzy PID controller publication-title: High performance architecture and grid computing – volume: 71 start-page: 1561 year: 2008 end-page: 1565 ident: bib0070 article-title: Adaptive control of nonlinear PID-based analog neural networks for a nonholonomic mobile robot publication-title: Neurocomputing – start-page: 2094 year: 2016 end-page: 2100 ident: bib0066 article-title: Deep reinforcement learning with double Q-Learning publication-title: Proceedings of the 30th AAAI conference on artificial intelligence, AAAI – volume: 17 year: 2014 ident: bib0063 article-title: On the operational solutions of fuzzy fractional differential equations publication-title: Fractional Calculus and Applied Analysis – volume: 45 start-page: 2471 year: 2009 end-page: 2482 ident: bib0011 article-title: Natural actor-critic algorithms publication-title: Automatica – year: 2015 ident: bib0055 article-title: A deeper look at planning as learning from replay publication-title: Proceedings of the 2nd multidisciplinary conference on reinforcement learning and decision making – volume: 41 start-page: 4274 year: 2014 end-page: 4289 ident: bib0057 article-title: Performance analysis of fractional order fuzzy PID controllers applied to a robotic manipulator publication-title: Expert Systems with Applications – start-page: 160 year: 2005 end-page: 171 ident: bib0025 article-title: Learning to Drive and Simulate Autonomous Mobile Robots publication-title: Proceedings of the Robocup – start-page: 40 year: 2008 end-page: 46 ident: bib0024 article-title: Fuzzy logic PID based control design and performance for a pectoral fin propelled unmanned underwater vehicle publication-title: Proceedings of the international conference on control, automation and systems – volume: 24 start-page: 934 year: 2016 end-page: 950 ident: bib0001 article-title: A PC-based simulation platform for a quadcopter system with self-tuning fuzzy PID controllers publication-title: Computer Applications in Engineering Education – volume: 115 start-page: 220 year: 1993 ident: bib0076 article-title: Optimum settings for automatic controllers publication-title: Journal of Dynamic Systems, Measurement, and Control – volume: 11 start-page: 221 year: 2003 end-page: 232 ident: bib0006 article-title: A novel adaptive control law for underwater vehicles publication-title: IEEE Transactions on Control Systems Technology – year: 2002 ident: bib0023 article-title: Marine control systems : guidance, navigation and control of ships, rigs and underwater vehicles – volume: 8 start-page: 357 year: 2015 end-page: 362 ident: bib0064 article-title: PID controller optimization by soft computing techniques–A review publication-title: International Journal of Hybrid Information Technology – start-page: 92 year: 2013 end-page: 97 ident: bib0054 article-title: Fuzzy-PID hybrid controller for mobile robot using point cloud and low cost depth sensor publication-title: Proceedings of the international conference on individual and collective behaviors in robotics (ICBR) – volume: 40 start-page: 5188 year: 2013 end-page: 5196 ident: bib0014 article-title: A hybrid intelligent system for PID controller using in a steel rolling process publication-title: Expert Systems with Applications – start-page: 1 year: 2016 end-page: 6 ident: bib0048 article-title: Prediction of the hydrodynamic coefficients of an autonomous underwater vehicle publication-title: Proceedings of the 3rd IEEE/OES south american international symposium on oceanic engineering (SAISOE) – year: 1998 ident: bib0062 article-title: Reinforcement learning : An introduction – volume: 38 start-page: 243 year: 1999 end-page: 249 ident: bib0067 article-title: PID Tuning with exact gain and phase margins publication-title: ISA Transactions – volume: 64 start-page: 759 year: 1942 end-page: 768 ident: bib0075 article-title: Optimum settings for automatic controllers publication-title: Transactions of the ASME – volume: 55 start-page: 227 year: 2015 end-page: 233 ident: bib0037 article-title: Variable-order fuzzy fractional PID controller publication-title: ISA Transactions – start-page: 1658 year: 2015 end-page: 1663 ident: bib0010 article-title: An adaptive controller for autonomous underwater vehicles publication-title: Proceedings of the IEEE/RSJ international conference on intelligent robots and systems (IROS) – volume: 4 start-page: 53 year: 1994 end-page: 66 ident: bib0002 article-title: An expert PID controller uses refined Ziegler and Nichols rules and fuzzy logic ideas publication-title: Applied Intelligence – volume: 80 start-page: 183 year: 2017 end-page: 199 ident: bib0015 article-title: Incremental Q-learning strategy for adaptive PID control of mobile robots publication-title: Expert Systems with Applications – volume: SMC-3 start-page: 28 year: 1973 end-page: 44 ident: bib0071 article-title: Outline of a new approach to the analysis of complex systems and decision processes publication-title: IEEE Transactions on Systems, Man, and Cybernetics – volume: 521 start-page: 436 year: 2015 end-page: 444 ident: bib0035 article-title: Deep learning publication-title: Nature – volume: 140 start-page: 216 year: 1993 ident: bib0074 article-title: Automatic tuning of optimum PID controllers publication-title: IEE Proceedings D Control Theory and Applications – volume: 38 start-page: 29 year: 2016 end-page: 41 ident: bib0056 article-title: Review of fractional PID controller publication-title: Mechatronics – start-page: 1 year: 2013 end-page: 6 ident: bib0059 article-title: Neural network based adaptive non linear PID controller for non-holonomic mobile robot publication-title: Proceedings of the international conference on control, automation, robotics and embedded systems (CARE) – start-page: 1 year: 2007 end-page: 8 ident: bib0065 article-title: Fitted Q iteration with CMACs publication-title: Proceedings of the IEEE international symposium on approximate dynamic programming and reinforcement learning – volume: 9 start-page: 756 year: 2001 end-page: 765 ident: bib0007 article-title: Adaptive control of an autonomous underwater vehicle: experimental results on ODIN publication-title: IEEE Transactions on Control Systems Technology – volume: 84 start-page: 745 year: 2016 end-page: 758 ident: bib0004 article-title: A reactive method for collision avoidance in industrial environments publication-title: Journal of Intelligent & Robotic Systems – volume: 2 start-page: 168 year: 2012 end-page: 176 ident: bib0009 article-title: PID Controller tuning techniques: A Review publication-title: Journal of Control Engineering and Technology – start-page: 898 year: 2012 end-page: 903 ident: bib0020 article-title: A neural network-based self-tuning PID controller of an autonomous underwater vehicle publication-title: Proceedings of the IEEE international conference on mechatronics and automation – volume: 101 start-page: 258 year: 2018 end-page: 270 ident: bib0026 article-title: Using deep Q-learning to understand the tax evasion behavior of risk-averse firms publication-title: Expert Systems with Applications – start-page: 540 year: 2013 end-page: 549 ident: bib0016 article-title: Adaptive tracking controller based on the PID for mobile robot path tracking publication-title: Intelligent robotics and applications – start-page: 2613 year: 2010 end-page: 2621 ident: bib0029 article-title: Double q-learning publication-title: Proceedings of the 23rd international conference on neural information processing systems - volume 2 – start-page: 867 year: 2017 end-page: 878 ident: bib0073 article-title: Design and simulation of a Self-adaptive Fuzzy-PID Controller for an autonomous underwater vehicle – volume: 42 start-page: 843 year: 1997 end-page: 847 ident: bib0053 article-title: The unfalsified control concept and learning publication-title: IEEE Transactions on Automatic Control – year: 1998 ident: bib0030 article-title: Neural networks: A comprehensive foundation – start-page: 99 year: 2015 end-page: 104 ident: bib0003 article-title: Attitude stabilization of quad-rotor (UAV) system using fuzzy PID controller (an experimental test) publication-title: Proceedings of the second international conference on computing technology and information management (ICCTIM) – start-page: 328 year: 1999 end-page: 333 ident: bib0040 article-title: Automatic PID tuning: an application of unfalsified control publication-title: Proceedings of the IEEE international symposium on computer aided control system design – year: 2010 ident: bib0042 article-title: Modern control engineering – year: 2018 ident: bib0046 article-title: Organizing experience: A deeper look at replay mechanisms for sample-based planning in continuous state domains publication-title: Technical Report – volume: 40 start-page: 1629 year: 2013 end-page: 1635 ident: bib0044 article-title: An efficient neural network approach to tracking control of an autonomous surface vehicle with unknown dynamics publication-title: Expert Systems with Applications – volume: 8 start-page: 279 year: 1992 end-page: 292 ident: bib0068 article-title: Technical note: Q-Learning publication-title: Machine Learning – start-page: 1 year: 2013 end-page: 4 ident: bib0058 article-title: Fuzzy adaptive PI control of omni-directional mobile robot publication-title: Proceedings of the 13th Iranian conference on fuzzy systems (IFSC) – volume: 116 start-page: 553 year: 1994 ident: bib0047 article-title: A new look at PID-controller tuning publication-title: Journal of Dynamic Systems, Measurement, and Control – volume: 32 start-page: 300 year: 2007 end-page: 312 ident: bib0005 article-title: On the use of adaptive/integral actions for six-degrees-of-freedom control of autonomous underwater vehicles publication-title: IEEE Journal of Oceanic Engineering – volume: 14 start-page: 635 year: 2004 end-page: 650 ident: bib0049 article-title: Revisiting the Ziegler–Nichols step response method for PID control publication-title: Journal of Process Control – volume: 12 start-page: 519 year: 2017 ident: bib0033 article-title: PID And fuzzy-PID control model for quadcopter attitude with disturbance parameter publication-title: International Journal of Computers Communications & Control – start-page: 3455 year: 2017 end-page: 3461 ident: bib0077 article-title: Weighted double Q-learning publication-title: Proceedings of the twenty-sixth international joint conference on artificial intelligence, IJCAI – volume: 590 start-page: 268 year: 2012 end-page: 271 ident: bib0036 article-title: Trajectory tracking control of mobile robot based on self-Adaptive PID combined with preview and fuzzy logic publication-title: Advanced Materials Research – year: 2010 ident: bib0012 article-title: Reinforcement learning and dynamic programming using function approximators – year: 1995 ident: bib0034 article-title: Automatic control systems – volume: 518 start-page: 529 year: 2015 end-page: 533 ident: bib0038 article-title: Human-level control through deep reinforcement learning publication-title: Nature – volume: 37 start-page: 427 year: 2004 end-page: 432 ident: bib0021 article-title: Neural network control system for underwater robots publication-title: IFAC Proceedings Volumes – start-page: 1177 year: 2008 end-page: 1184 ident: bib0041 article-title: Fitted Q-iteration by advantage weighted regression publication-title: Advances in neural information processing systems 21 – volume: 55 start-page: 227 year: 2015 ident: 10.1016/j.eswa.2019.06.066_bib0037 article-title: Variable-order fuzzy fractional PID controller publication-title: ISA Transactions doi: 10.1016/j.isatra.2014.09.012 – volume: 24 start-page: 934 issue: 6 year: 2016 ident: 10.1016/j.eswa.2019.06.066_bib0001 article-title: A PC-based simulation platform for a quadcopter system with self-tuning fuzzy PID controllers publication-title: Computer Applications in Engineering Education doi: 10.1002/cae.21769 – volume: 2 start-page: 168 issue: 4 year: 2012 ident: 10.1016/j.eswa.2019.06.066_bib0009 article-title: PID Controller tuning techniques: A Review publication-title: Journal of Control Engineering and Technology – volume: 37 start-page: 427 issue: 10 year: 2004 ident: 10.1016/j.eswa.2019.06.066_bib0021 article-title: Neural network control system for underwater robots publication-title: IFAC Proceedings Volumes doi: 10.1016/S1474-6670(17)31769-X – volume: 9 start-page: 756 issue: 5 year: 2001 ident: 10.1016/j.eswa.2019.06.066_bib0007 article-title: Adaptive control of an autonomous underwater vehicle: experimental results on ODIN publication-title: IEEE Transactions on Control Systems Technology doi: 10.1109/87.944470 – start-page: 1 year: 2016 ident: 10.1016/j.eswa.2019.06.066_bib0048 article-title: Prediction of the hydrodynamic coefficients of an autonomous underwater vehicle – volume: 16 issue: 9 year: 2016 ident: 10.1016/j.eswa.2019.06.066_bib0031 article-title: Neural network-based self-Tuning PID control for underwater vehicles publication-title: Sensors (Basel, Switzerland) doi: 10.3390/s16091429 – volume: 29 start-page: 2227 issue: 6 year: 2018 ident: 10.1016/j.eswa.2019.06.066_bib0045 article-title: Multisource transfer double DQN based on actor learning publication-title: IEEE Transactions on Neural Networks and Learning Systems doi: 10.1109/TNNLS.2018.2806087 – year: 2006 ident: 10.1016/j.eswa.2019.06.066_bib0050 – volume: 40 start-page: 1629 issue: 5 year: 2013 ident: 10.1016/j.eswa.2019.06.066_bib0044 article-title: An efficient neural network approach to tracking control of an autonomous surface vehicle with unknown dynamics publication-title: Expert Systems with Applications doi: 10.1016/j.eswa.2012.09.008 – start-page: 540 year: 2013 ident: 10.1016/j.eswa.2019.06.066_bib0016 article-title: Adaptive tracking controller based on the PID for mobile robot path tracking – start-page: 385 year: 2011 ident: 10.1016/j.eswa.2019.06.066_bib0060 article-title: Autonomous robot motion control using fuzzy PID controller – volume: 40 start-page: 5188 issue: 13 year: 2013 ident: 10.1016/j.eswa.2019.06.066_bib0014 article-title: A hybrid intelligent system for PID controller using in a steel rolling process publication-title: Expert Systems with Applications doi: 10.1016/j.eswa.2013.03.013 – start-page: 1375 year: 2016 ident: 10.1016/j.eswa.2019.06.066_bib0008 article-title: Vector directed path generation and tracking for autonomous unmanned aerial/ ground vehicles – start-page: 1177 year: 2008 ident: 10.1016/j.eswa.2019.06.066_bib0041 article-title: Fitted Q-iteration by advantage weighted regression – start-page: 99 year: 2015 ident: 10.1016/j.eswa.2019.06.066_bib0003 article-title: Attitude stabilization of quad-rotor (UAV) system using fuzzy PID controller (an experimental test) – start-page: 662 year: 2013 ident: 10.1016/j.eswa.2019.06.066_bib0022 article-title: An adaptive neuro PID for controlling the altitude of quadcopter robot – volume: 518 start-page: 529 issue: 7540 year: 2015 ident: 10.1016/j.eswa.2019.06.066_sbref0038 article-title: Human-level control through deep reinforcement learning publication-title: Nature doi: 10.1038/nature14236 – volume: 7 issue: 2 year: 2011 ident: 10.1016/j.eswa.2019.06.066_bib0043 article-title: Neural network assisted computationally simple PI d control of a quadrotor UAV publication-title: IEEE Transactions on Industrial Informatics – start-page: 3455 year: 2017 ident: 10.1016/j.eswa.2019.06.066_bib0077 article-title: Weighted double Q-learning – volume: 116 start-page: 553 issue: 3 year: 1994 ident: 10.1016/j.eswa.2019.06.066_bib0047 article-title: A new look at PID-controller tuning publication-title: Journal of Dynamic Systems, Measurement, and Control doi: 10.1115/1.2899252 – volume: 14 start-page: 635 issue: 6 year: 2004 ident: 10.1016/j.eswa.2019.06.066_bib0049 article-title: Revisiting the Ziegler–Nichols step response method for PID control publication-title: Journal of Process Control doi: 10.1016/j.jprocont.2004.01.002 – volume: 20 start-page: 559 issue: 3 year: 2015 ident: 10.1016/j.eswa.2019.06.066_bib0032 article-title: Modeling and control of autonomous underwater vehicle (AUV) in heading and depth attitude via self-adaptive fuzzy PID controller publication-title: Journal of Marine Science and Technology (Japan) doi: 10.1007/s00773-015-0312-7 – year: 2018 ident: 10.1016/j.eswa.2019.06.066_sbref0046 article-title: Organizing experience: A deeper look at replay mechanisms for sample-based planning in continuous state domains – volume: 64 start-page: 759 year: 1942 ident: 10.1016/j.eswa.2019.06.066_bib0075 article-title: Optimum settings for automatic controllers publication-title: Transactions of the ASME – volume: 84 start-page: 745 issue: 1 year: 2016 ident: 10.1016/j.eswa.2019.06.066_bib0004 article-title: A reactive method for collision avoidance in industrial environments publication-title: Journal of Intelligent & Robotic Systems doi: 10.1007/s10846-016-0359-7 – start-page: 867 year: 2017 ident: 10.1016/j.eswa.2019.06.066_bib0073 – volume: 8 start-page: 279 issue: 3/4 year: 1992 ident: 10.1016/j.eswa.2019.06.066_bib0068 article-title: Technical note: Q-Learning publication-title: Machine Learning – year: 2002 ident: 10.1016/j.eswa.2019.06.066_sbref0019 article-title: PID and fuzzy logic pitch attitude hold systems for a fighter jet AIAA-2002-4646 PID and fuzzy logic pitch attitude hold systems for a fighter jet – volume: 17 issue: 4 year: 2014 ident: 10.1016/j.eswa.2019.06.066_bib0063 article-title: On the operational solutions of fuzzy fractional differential equations publication-title: Fractional Calculus and Applied Analysis doi: 10.2478/s13540-014-0216-y – start-page: 92 year: 2013 ident: 10.1016/j.eswa.2019.06.066_bib0054 article-title: Fuzzy-PID hybrid controller for mobile robot using point cloud and low cost depth sensor – volume: 75 start-page: 827 issue: 1 year: 1953 ident: 10.1016/j.eswa.2019.06.066_bib0018 article-title: Theoretical consideration of retarded control publication-title: Transactions of ASME – volume: 41 start-page: 4274 issue: 9 year: 2014 ident: 10.1016/j.eswa.2019.06.066_bib0057 article-title: Performance analysis of fractional order fuzzy PID controllers applied to a robotic manipulator publication-title: Expert Systems with Applications doi: 10.1016/j.eswa.2013.12.030 – start-page: 1 year: 2013 ident: 10.1016/j.eswa.2019.06.066_bib0028 article-title: Application of reinforcement learning on self-tuning PID controller for soccer robot multi-agent system – volume: 27 start-page: 599 issue: 4 year: 1991 ident: 10.1016/j.eswa.2019.06.066_bib0027 article-title: Industrial adaptive controllers based on frequency response techniques publication-title: Automatica doi: 10.1016/0005-1098(91)90052-4 – volume: 38 start-page: 29 year: 2016 ident: 10.1016/j.eswa.2019.06.066_bib0056 article-title: Review of fractional PID controller publication-title: Mechatronics doi: 10.1016/j.mechatronics.2016.06.005 – volume: 38 start-page: 243 issue: 3 year: 1999 ident: 10.1016/j.eswa.2019.06.066_bib0067 article-title: PID Tuning with exact gain and phase margins publication-title: ISA Transactions doi: 10.1016/S0019-0578(99)00020-8 – volume: 4 start-page: 53 issue: 1 year: 1994 ident: 10.1016/j.eswa.2019.06.066_bib0002 article-title: An expert PID controller uses refined Ziegler and Nichols rules and fuzzy logic ideas publication-title: Applied Intelligence doi: 10.1007/BF00872055 – volume: 12 start-page: 519 issue: 4 year: 2017 ident: 10.1016/j.eswa.2019.06.066_bib0033 article-title: PID And fuzzy-PID control model for quadcopter attitude with disturbance parameter publication-title: International Journal of Computers Communications & Control doi: 10.15837/ijccc.2017.4.2962 – year: 1998 ident: 10.1016/j.eswa.2019.06.066_bib0030 – volume: 71 start-page: 1561 issue: 7 year: 2008 ident: 10.1016/j.eswa.2019.06.066_bib0070 article-title: Adaptive control of nonlinear PID-based analog neural networks for a nonholonomic mobile robot publication-title: Neurocomputing doi: 10.1016/j.neucom.2007.04.014 – volume: 11 start-page: 221 issue: 2 year: 2003 ident: 10.1016/j.eswa.2019.06.066_bib0006 article-title: A novel adaptive control law for underwater vehicles publication-title: IEEE Transactions on Control Systems Technology doi: 10.1109/TCST.2003.809244 – volume: 26 start-page: 1179 issue: 5 year: 2015 ident: 10.1016/j.eswa.2019.06.066_bib0051 article-title: Identification and control of nonlinear dynamics of a mobile robot in discrete time using an adaptive technique based on neural PID publication-title: Neural Computing and Applications doi: 10.1007/s00521-014-1805-8 – volume: SMC-3 start-page: 28 issue: 1 year: 1973 ident: 10.1016/j.eswa.2019.06.066_bib0071 article-title: Outline of a new approach to the analysis of complex systems and decision processes publication-title: IEEE Transactions on Systems, Man, and Cybernetics doi: 10.1109/TSMC.1973.5408575 – start-page: 2094 year: 2016 ident: 10.1016/j.eswa.2019.06.066_sbref0066 article-title: Deep reinforcement learning with double Q-Learning – volume: 80 start-page: 183 year: 2017 ident: 10.1016/j.eswa.2019.06.066_bib0015 article-title: Incremental Q-learning strategy for adaptive PID control of mobile robots publication-title: Expert Systems with Applications doi: 10.1016/j.eswa.2017.03.002 – start-page: 2613 year: 2010 ident: 10.1016/j.eswa.2019.06.066_bib0029 article-title: Double q-learning – start-page: 40 year: 2008 ident: 10.1016/j.eswa.2019.06.066_bib0024 article-title: Fuzzy logic PID based control design and performance for a pectoral fin propelled unmanned underwater vehicle – volume: 115 start-page: 220 issue: 2B year: 1993 ident: 10.1016/j.eswa.2019.06.066_bib0076 article-title: Optimum settings for automatic controllers publication-title: Journal of Dynamic Systems, Measurement, and Control doi: 10.1115/1.2899060 – volume: 28 start-page: 1 issue: 1 year: 1982 ident: 10.1016/j.eswa.2019.06.066_bib0039 article-title: State of the art—A survey of partially observable Markov decision processes: theory, models, and algorithms publication-title: Management Science doi: 10.1287/mnsc.28.1.1 – volume: 48 start-page: 367 issue: 2 year: 2017 ident: 10.1016/j.eswa.2019.06.066_bib0052 article-title: Inverse optimal self-tuning PID control design for an autonomous underwater vehicle publication-title: International Journal of Systems Science doi: 10.1080/00207721.2016.1186238 – volume: 140 start-page: 216 issue: 3 year: 1993 ident: 10.1016/j.eswa.2019.06.066_bib0074 article-title: Automatic tuning of optimum PID controllers publication-title: IEE Proceedings D Control Theory and Applications doi: 10.1049/ip-d.1993.0030 – year: 2010 ident: 10.1016/j.eswa.2019.06.066_bib0042 – start-page: 328 year: 1999 ident: 10.1016/j.eswa.2019.06.066_bib0040 article-title: Automatic PID tuning: an application of unfalsified control – year: 2002 ident: 10.1016/j.eswa.2019.06.066_bib0023 – volume: 8 start-page: 357 issue: 7 year: 2015 ident: 10.1016/j.eswa.2019.06.066_bib0064 article-title: PID controller optimization by soft computing techniques–A review publication-title: International Journal of Hybrid Information Technology doi: 10.14257/ijhit.2015.8.7.32 – year: 1995 ident: 10.1016/j.eswa.2019.06.066_bib0034 – year: 2017 ident: 10.1016/j.eswa.2019.06.066_bib0072 article-title: A deeper look at experience replay – start-page: 1 year: 2013 ident: 10.1016/j.eswa.2019.06.066_bib0058 article-title: Fuzzy adaptive PI control of omni-directional mobile robot – start-page: 898 year: 2012 ident: 10.1016/j.eswa.2019.06.066_bib0020 article-title: A neural network-based self-tuning PID controller of an autonomous underwater vehicle – year: 2015 ident: 10.1016/j.eswa.2019.06.066_bib0055 article-title: A deeper look at planning as learning from replay – start-page: 160 year: 2005 ident: 10.1016/j.eswa.2019.06.066_sbref0025 article-title: Learning to Drive and Simulate Autonomous Mobile Robots – year: 2010 ident: 10.1016/j.eswa.2019.06.066_bib0012 – year: 1998 ident: 10.1016/j.eswa.2019.06.066_bib0062 – volume: 1 start-page: 37 issue: 1 year: 2014 ident: 10.1016/j.eswa.2019.06.066_bib0061 article-title: Adaptive fuzzy controller for controlling the dynamics of robot motion publication-title: International Journal of Fuzzy Computation and Modelling doi: 10.1504/IJFCM.2014.064230 – start-page: 1 year: 2007 ident: 10.1016/j.eswa.2019.06.066_bib0065 article-title: Fitted Q iteration with CMACs – volume: 32 start-page: 300 issue: 2 year: 2007 ident: 10.1016/j.eswa.2019.06.066_bib0005 article-title: On the use of adaptive/integral actions for six-degrees-of-freedom control of autonomous underwater vehicles publication-title: IEEE Journal of Oceanic Engineering doi: 10.1109/JOE.2007.893685 – start-page: 1658 year: 2015 ident: 10.1016/j.eswa.2019.06.066_bib0010 article-title: An adaptive controller for autonomous underwater vehicles – year: 2015 ident: 10.1016/j.eswa.2019.06.066_bib0069 article-title: Development of a model based fuzzy-PID controller for the AeroQuad cyclone quad-copter – volume: 46 start-page: 8 year: 2018 ident: 10.1016/j.eswa.2019.06.066_bib0013 article-title: Reinforcement learning for control: Performance, stability, and deep approximators publication-title: Annual Reviews in Control doi: 10.1016/j.arcontrol.2018.09.005 – volume: 45 start-page: 2471 issue: 11 year: 2009 ident: 10.1016/j.eswa.2019.06.066_bib0011 article-title: Natural actor-critic algorithms publication-title: Automatica doi: 10.1016/j.automatica.2009.07.008 – volume: 101 start-page: 258 year: 2018 ident: 10.1016/j.eswa.2019.06.066_bib0026 article-title: Using deep Q-learning to understand the tax evasion behavior of risk-averse firms publication-title: Expert Systems with Applications doi: 10.1016/j.eswa.2018.01.039 – volume: 590 start-page: 268 year: 2012 ident: 10.1016/j.eswa.2019.06.066_sbref0036 article-title: Trajectory tracking control of mobile robot based on self-Adaptive PID combined with preview and fuzzy logic publication-title: Advanced Materials Research doi: 10.4028/www.scientific.net/AMR.590.268 – volume: 521 start-page: 436 issue: 7553 year: 2015 ident: 10.1016/j.eswa.2019.06.066_sbref0035 article-title: Deep learning publication-title: Nature doi: 10.1038/nature14539 – start-page: 1 year: 2013 ident: 10.1016/j.eswa.2019.06.066_bib0059 article-title: Neural network based adaptive non linear PID controller for non-holonomic mobile robot – volume: 42 start-page: 843 issue: 6 year: 1997 ident: 10.1016/j.eswa.2019.06.066_bib0053 article-title: The unfalsified control concept and learning publication-title: IEEE Transactions on Automatic Control doi: 10.1109/9.587340 – volume: 49 start-page: 1013 issue: 3 year: 1989 ident: 10.1016/j.eswa.2019.06.066_bib0017 article-title: Representations of non-linear systems: the NARMAX model publication-title: International Journal of Control doi: 10.1080/00207178908559683
SSID	ssj0017007
Score	2.50358
Snippet	•A model-free reinforcement learning algorithm for adaptive low-level PID control.•Double Q-learning provides excelent performance with low computational... Many expert systems have been developed for self-adaptive PID controllers of mobile robots. However, the high computational requirements of the expert systems...
SourceID	proquest crossref elsevier
SourceType	Aggregation Database Enrichment Source Index Database Publisher
StartPage	292
SubjectTerms	Adaptive control Algorithms Applications programs Computer simulation Controllers Double Q-learning Double Q-PID Expert systems Incremental learning Machine learning Mobile robots Multi-platforms Proportional integral derivative Real time Reinforcement learning Robot control Robots Self adaptive control systems Space exploration Tuning
Title	Double Q-PID algorithm for mobile robot control
URI	https://dx.doi.org/10.1016/j.eswa.2019.06.066 https://www.proquest.com/docview/2306476042
Volume	137
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
journalDatabaseRights	– providerCode: PRVESC databaseName: Baden-Württemberg Complete Freedom Collection (Elsevier) customDbUrl: eissn: 1873-6793 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0017007 issn: 0957-4174 databaseCode: GBLVA dateStart: 20110101 isFulltext: true titleUrlDefault: https://www.sciencedirect.com providerName: Elsevier – providerCode: PRVESC databaseName: Elsevier ScienceDirect (LUT) customDbUrl: eissn: 1873-6793 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0017007 issn: 0957-4174 databaseCode: ACRLP dateStart: 19950101 isFulltext: true titleUrlDefault: https://www.sciencedirect.com providerName: Elsevier – providerCode: PRVESC databaseName: Elsevier SD Freedom Collection Journals [SCFCJ] customDbUrl: eissn: 1873-6793 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0017007 issn: 0957-4174 databaseCode: AIKHN dateStart: 19950101 isFulltext: true titleUrlDefault: https://www.sciencedirect.com providerName: Elsevier – providerCode: PRVESC databaseName: Science Direct customDbUrl: eissn: 1873-6793 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0017007 issn: 0957-4174 databaseCode: .~1 dateStart: 19950101 isFulltext: true titleUrlDefault: https://www.sciencedirect.com providerName: Elsevier – providerCode: PRVLSH databaseName: Elsevier Journals customDbUrl: mediaType: online eissn: 1873-6793 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0017007 issn: 0957-4174 databaseCode: AKRWK dateStart: 19900101 isFulltext: true providerName: Library Specific Holdings
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LS8NAEF5KvXjxLT5q2YM3iclmH2mOpVpaxaJoobdlX9FK25S24s3f7m6yKSjSg5BLwm4Is7sz35BvvgHgkkqGirKPiCATkCySgRSMBiSW2EZsrA11hcIPA9YbkrsRHdVAp6qFcbRK7_tLn154a_8k9NYM5-Nx-GzBgQ2H9krdryPiivgISVwXg-uvNc3Dyc8lpd5eErjRvnCm5HiZ5afTHkJpoeFZKCX-GZx-ueki9nT3wI4HjbBdftc-qJnZAditGjJAfz4PQWjhsJwY-BQ89m-gmLzmNvV_m0ILTOE0l9YBwEUu8xX0BPUjMOzevnR6ge-IECgct1aBTTcilcYmjZkWSseC4EimWNmwrREVWZYRZSzCSbXr9xCLSBjJpNYiFUgSpfAxqM_ymTkBEEctRTVhGkkbyGxSJKiKM4UlSTPUSvQpQJUpuPJy4a5rxYRXvLB37szHnfm4I8cxdgqu1nPmpVjGxtG0sjD_seTcevON8xrVcnB_4Ja8yKQSZnfd2T9few623Z2jqiDaAPXV4sNcWMCxks1iRzXBVrt_3xt8Ax1I0ws
linkProvider	Elsevier
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV07T8MwELYKDLDwRrzxwIZC4sR26hEVqgJtBaKVull-hYdKg9ogNn47duJUAiEGpEyJHUVn-7s75bvvADglkqKy7CPCyAQ4i2QgBSUBjmViPXaiDXGFwr0-7QzxzYiMGqBV18I4WqXH_grTS7T2d0JvzfDt-Tl8sMGBdYf2Yu7XEWYLYAmTOHUZ2PnnnOfh9OfSSnAvDdxwXzlTkbzM7MOJDyFWiniWUom_eqcfOF06n_Y6WPVRI7yoPmwDNMxkE6zVHRmgP6BbILTxsBwbeB_cXV9CMX7Mbe7_9AptZApfc2kRAE5zmRfQM9S3wbB9NWh1At8SIVBJ3CwCm29EisWGxVQLpWOBk0iyRFm_rRERWZZhZWyIw7Rr-BCLSBhJpdaCCSSxUskOWJzkE7MLYBI1FdGYaiStJ7NZkSAqzlQiMctQM9V7ANWm4Mrrhbu2FWNeE8NeuDMfd-bjjh1H6R44m895q9Qy_hxNagvzb2vOLZz_Oe-wXg7uT9yMl6lUSu222__na0_AcmfQ6_Ludf_2AKy4J463gsghWCym7-bIRh-FPC531xcq2tSg
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Double+Q-PID+algorithm+for+mobile+robot+control&rft.jtitle=Expert+systems+with+applications&rft.au=Carlucho%2C+Ignacio&rft.au=De+Paula%2C+Mariano&rft.au=Acosta%2C+Gerardo+G.&rft.date=2019-12-15&rft.issn=0957-4174&rft.volume=137&rft.spage=292&rft.epage=307&rft_id=info:doi/10.1016%2Fj.eswa.2019.06.066&rft.externalDBID=n%2Fa&rft.externalDocID=10_1016_j_eswa_2019_06_066
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0957-4174&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0957-4174&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0957-4174&client=summon