Double Q-PID algorithm for mobile robot control

•A model-free reinforcement learning algorithm for adaptive low-level PID control.•Double Q-learning provides excelent performance with low computational cost.•Incremental state and action spaces discretization for fast reinforcement learning.•The Double Q-PID algorithm is effectively applied to mul...

Full description

Saved in:
Bibliographic Details
Published inExpert systems with applications Vol. 137; pp. 292 - 307
Main Authors Carlucho, Ignacio, De Paula, Mariano, Acosta, Gerardo G.
Format Journal Article
LanguageEnglish
Published New York Elsevier Ltd 15.12.2019
Elsevier BV
Subjects
Online AccessGet full text
ISSN0957-4174
1873-6793
DOI10.1016/j.eswa.2019.06.066

Cover

Abstract •A model-free reinforcement learning algorithm for adaptive low-level PID control.•Double Q-learning provides excelent performance with low computational cost.•Incremental state and action spaces discretization for fast reinforcement learning.•The Double Q-PID algorithm is effectively applied to multiple robotic platforms.•The proposed algorithm effectively improves the adaptability of the PID controllers. Many expert systems have been developed for self-adaptive PID controllers of mobile robots. However, the high computational requirements of the expert systems layers, developed for the tuning of the PID controllers, still require previous expert knowledge and high efficiency in algorithmic and software execution for real-time applications. To address these problems, in this paper we propose an expert agent-based system, based on a reinforcement learning agent, for self-adapting multiple low-level PID controllers in mobile robots. For the formulation of the artificial expert agent, we develop an incremental model-free algorithm version of the double Q-Learning algorithm for fast on-line adaptation of multiple low-level PID controllers. Fast learning and high on-line adaptability of the artificial expert agent is achieved by means of a proposed incremental active-learning exploration-exploitation procedure, for a non-uniform state space exploration, along with an experience replay mechanism for multiple value functions updates in the double Q-learning algorithm. A comprehensive comparative simulation study and experiments in a real mobile robot demonstrate the high performance of the proposed algorithm for a real-time simultaneous tuning of multiple adaptive low-level PID controllers of mobile robots in real world conditions.
AbstractList •A model-free reinforcement learning algorithm for adaptive low-level PID control.•Double Q-learning provides excelent performance with low computational cost.•Incremental state and action spaces discretization for fast reinforcement learning.•The Double Q-PID algorithm is effectively applied to multiple robotic platforms.•The proposed algorithm effectively improves the adaptability of the PID controllers. Many expert systems have been developed for self-adaptive PID controllers of mobile robots. However, the high computational requirements of the expert systems layers, developed for the tuning of the PID controllers, still require previous expert knowledge and high efficiency in algorithmic and software execution for real-time applications. To address these problems, in this paper we propose an expert agent-based system, based on a reinforcement learning agent, for self-adapting multiple low-level PID controllers in mobile robots. For the formulation of the artificial expert agent, we develop an incremental model-free algorithm version of the double Q-Learning algorithm for fast on-line adaptation of multiple low-level PID controllers. Fast learning and high on-line adaptability of the artificial expert agent is achieved by means of a proposed incremental active-learning exploration-exploitation procedure, for a non-uniform state space exploration, along with an experience replay mechanism for multiple value functions updates in the double Q-learning algorithm. A comprehensive comparative simulation study and experiments in a real mobile robot demonstrate the high performance of the proposed algorithm for a real-time simultaneous tuning of multiple adaptive low-level PID controllers of mobile robots in real world conditions.
Many expert systems have been developed for self-adaptive PID controllers of mobile robots. However, the high computational requirements of the expert systems layers, developed for the tuning of the PID controllers, still require previous expert knowledge and high efficiency in algorithmic and software execution for real-time applications. To address these problems, in this paper we propose an expert agent-based system, based on a reinforcement learning agent, for self-adapting multiple low-level PID controllers in mobile robots. For the formulation of the artificial expert agent, we develop an incremental model-free algorithm version of the double Q-Learning algorithm for fast on-line adaptation of multiple low-level PID controllers. Fast learning and high on-line adaptability of the artificial expert agent is achieved by means of a proposed incremental active-learning exploration-exploitation procedure, for a non-uniform state space exploration, along with an experience replay mechanism for multiple value functions updates in the double Q-learning algorithm. A comprehensive comparative simulation study and experiments in a real mobile robot demonstrate the high performance of the proposed algorithm for a real-time simultaneous tuning of multiple adaptive low-level PID controllers of mobile robots in real world conditions.
Author Acosta, Gerardo G.
De Paula, Mariano
Carlucho, Ignacio
Author_xml – sequence: 1
  givenname: Ignacio
  orcidid: 0000-0002-6262-480X
  surname: Carlucho
  fullname: Carlucho, Ignacio
  email: ignacio.carlucho@fio.unicen.edu.ar
– sequence: 2
  givenname: Mariano
  orcidid: 0000-0001-7582-9188
  surname: De Paula
  fullname: De Paula, Mariano
  email: mariano.depaula@fio.unicen.edu.ar
– sequence: 3
  givenname: Gerardo G.
  surname: Acosta
  fullname: Acosta, Gerardo G.
  email: ggacosta@fio.unicen.edu.ar
BookMark eNp9kEtLAzEUhYMo2Fb_gKsB1zO9eUymATfS-igUVNB1yCQZzTCd1CRV_PfOUFcuCgfu4p7vPs4Unfa-twhdYSgwYD5vCxu_VUEAiwL4IH6CJnhR0ZxXgp6iCYiyyhmu2DmaxtgC4AqgmqD5yu_rzmYv-fN6lanu3QeXPrZZ40O29bUbWsHXPmXa9yn47gKdNaqL9vKvztDb_d3r8jHfPD2sl7ebXFOySLngAFoQKwg3ShuiGIVaUI1LbHCpmqZh2nLGhCkpAaJA2ZrXxiihcM20pjN0fZi7C_5zb2OSrd-HflgpCQXOKg6MDK7FwaWDjzHYRmqXVHLjrcp1EoMc45GtHOORYzwS-CA-oOQfugtuq8LPcejmANnh9S9ng4za2V5b44LVSRrvjuG_hdx_3Q
CitedBy_id crossref_primary_10_1016_j_oceaneng_2024_117727
crossref_primary_10_1016_j_eswa_2022_117191
crossref_primary_10_21595_jmai_2023_23410
crossref_primary_10_1002_ird_2852
crossref_primary_10_1007_s12204_023_2631_x
crossref_primary_10_1016_j_ifacol_2023_10_1768
crossref_primary_10_1016_j_ifacol_2022_07_517
crossref_primary_10_31466_kfbd_1564085
crossref_primary_10_1016_j_autcon_2022_104576
crossref_primary_10_3390_electronics12183925
crossref_primary_10_1007_s10015_020_00629_z
crossref_primary_10_1080_10255842_2023_2241945
crossref_primary_10_3934_mbe_2023463
crossref_primary_10_3390_act11110330
crossref_primary_10_3390_w16121699
crossref_primary_10_1016_j_eswa_2023_121037
crossref_primary_10_1109_ACCESS_2020_3018026
crossref_primary_10_1002_rnc_5662
crossref_primary_10_1109_TIM_2025_3545523
crossref_primary_10_1109_TITS_2024_3382816
crossref_primary_10_3390_jmse7120443
crossref_primary_10_1016_j_oceaneng_2020_108053
crossref_primary_10_1016_j_isatra_2019_12_011
crossref_primary_10_1016_j_eswa_2020_114504
crossref_primary_10_1002_int_22345
crossref_primary_10_28979_jarnas_1119957
crossref_primary_10_1016_j_eswa_2021_114589
crossref_primary_10_1109_TIE_2024_3433532
crossref_primary_10_1007_s12555_023_0616_z
crossref_primary_10_1088_1742_6596_2213_1_012005
crossref_primary_10_1016_j_ast_2025_110053
crossref_primary_10_1016_j_eswa_2022_116714
crossref_primary_10_1049_cth2_12125
crossref_primary_10_1016_j_knosys_2023_110355
crossref_primary_10_1016_j_measurement_2024_116045
crossref_primary_10_33793_acperpro_04_01_22
crossref_primary_10_3390_act12080326
Cites_doi 10.1016/j.isatra.2014.09.012
10.1002/cae.21769
10.1016/S1474-6670(17)31769-X
10.1109/87.944470
10.3390/s16091429
10.1109/TNNLS.2018.2806087
10.1016/j.eswa.2012.09.008
10.1016/j.eswa.2013.03.013
10.1038/nature14236
10.1115/1.2899252
10.1016/j.jprocont.2004.01.002
10.1007/s00773-015-0312-7
10.1007/s10846-016-0359-7
10.2478/s13540-014-0216-y
10.1016/j.eswa.2013.12.030
10.1016/0005-1098(91)90052-4
10.1016/j.mechatronics.2016.06.005
10.1016/S0019-0578(99)00020-8
10.1007/BF00872055
10.15837/ijccc.2017.4.2962
10.1016/j.neucom.2007.04.014
10.1109/TCST.2003.809244
10.1007/s00521-014-1805-8
10.1109/TSMC.1973.5408575
10.1016/j.eswa.2017.03.002
10.1115/1.2899060
10.1287/mnsc.28.1.1
10.1080/00207721.2016.1186238
10.1049/ip-d.1993.0030
10.14257/ijhit.2015.8.7.32
10.1504/IJFCM.2014.064230
10.1109/JOE.2007.893685
10.1016/j.arcontrol.2018.09.005
10.1016/j.automatica.2009.07.008
10.1016/j.eswa.2018.01.039
10.4028/www.scientific.net/AMR.590.268
10.1038/nature14539
10.1109/9.587340
10.1080/00207178908559683
ContentType Journal Article
Copyright 2019 Elsevier Ltd
Copyright Elsevier BV Dec 15, 2019
Copyright_xml – notice: 2019 Elsevier Ltd
– notice: Copyright Elsevier BV Dec 15, 2019
DBID AAYXX
CITATION
7SC
8FD
JQ2
L7M
L~C
L~D
DOI 10.1016/j.eswa.2019.06.066
DatabaseName CrossRef
Computer and Information Systems Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle CrossRef
Computer and Information Systems Abstracts
Technology Research Database
Computer and Information Systems Abstracts – Academic
Advanced Technologies Database with Aerospace
ProQuest Computer Science Collection
Computer and Information Systems Abstracts Professional
DatabaseTitleList
Computer and Information Systems Abstracts
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 1873-6793
EndPage 307
ExternalDocumentID 10_1016_j_eswa_2019_06_066
S0957417419304749
GroupedDBID --K
--M
.DC
.~1
0R~
13V
1B1
1RT
1~.
1~5
4.4
457
4G.
5GY
5VS
7-5
71M
8P~
9JN
9JO
AAAKF
AABNK
AACTN
AAEDT
AAEDW
AAIAV
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AARIN
AAXUO
AAYFN
ABBOA
ABFNM
ABMAC
ABMVD
ABUCO
ABYKQ
ACDAQ
ACGFS
ACHRH
ACNTT
ACRLP
ACZNC
ADBBV
ADEZE
ADTZH
AEBSH
AECPX
AEKER
AENEX
AFKWA
AFTJW
AGHFR
AGJBL
AGUBO
AGUMN
AGYEJ
AHHHB
AHJVU
AHZHX
AIALX
AIEXJ
AIKHN
AITUG
AJOXV
ALEQD
ALMA_UNASSIGNED_HOLDINGS
AMFUW
AMRAJ
AOUOD
APLSM
AXJTR
BJAXD
BKOJK
BLXMC
BNSAS
CS3
DU5
EBS
EFJIC
EFLBG
EJD
EO8
EO9
EP2
EP3
F5P
FDB
FIRID
FNPLU
FYGXN
G-Q
GBLVA
GBOLZ
HAMUX
IHE
J1W
JJJVA
KOM
LG9
LY1
LY7
M41
MO0
N9A
O-L
O9-
OAUVE
OZT
P-8
P-9
P2P
PC.
PQQKQ
Q38
RIG
ROL
RPZ
SDF
SDG
SDP
SDS
SES
SPC
SPCBC
SSB
SSD
SSL
SST
SSV
SSZ
T5K
TN5
~G-
29G
AAAKG
AAQXK
AATTM
AAXKI
AAYWO
AAYXX
ABJNI
ABKBG
ABUFD
ABWVN
ABXDB
ACLOT
ACNNM
ACRPL
ACVFH
ADCNI
ADJOM
ADMUD
ADNMO
AEIPS
AEUPX
AFJKZ
AFPUW
AGQPQ
AIGII
AIIUN
AKBMS
AKRWK
AKYEP
ANKPU
APXCP
ASPBG
AVWKF
AZFZN
CITATION
EFKBS
FEDTE
FGOYB
G-2
HLZ
HVGLF
HZ~
R2-
SBC
SET
SEW
WUQ
XPP
ZMT
~HD
7SC
8FD
AFXIZ
AGCQF
AGRNS
JQ2
L7M
L~C
L~D
SSH
ID FETCH-LOGICAL-c328t-9600c92e926dacd2a430b93c151d15afff4ce6449d53202a0aeb6bdda9a1b4cc3
IEDL.DBID .~1
ISSN 0957-4174
IngestDate Mon Jul 14 09:26:22 EDT 2025
Thu Apr 24 22:54:16 EDT 2025
Sat Oct 25 06:20:46 EDT 2025
Fri Feb 23 02:24:25 EST 2024
IsPeerReviewed true
IsScholarly true
Keywords Double Q-learning
Incremental learning
Multi-platforms
Mobile robots
Reinforcement learning
Double Q-PID
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c328t-9600c92e926dacd2a430b93c151d15afff4ce6449d53202a0aeb6bdda9a1b4cc3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ORCID 0000-0002-6262-480X
0000-0001-7582-9188
PQID 2306476042
PQPubID 2045477
PageCount 16
ParticipantIDs proquest_journals_2306476042
crossref_citationtrail_10_1016_j_eswa_2019_06_066
crossref_primary_10_1016_j_eswa_2019_06_066
elsevier_sciencedirect_doi_10_1016_j_eswa_2019_06_066
PublicationCentury 2000
PublicationDate 2019-12-15
PublicationDateYYYYMMDD 2019-12-15
PublicationDate_xml – month: 12
  year: 2019
  text: 2019-12-15
  day: 15
PublicationDecade 2010
PublicationPlace New York
PublicationPlace_xml – name: New York
PublicationTitle Expert systems with applications
PublicationYear 2019
Publisher Elsevier Ltd
Elsevier BV
Publisher_xml – name: Elsevier Ltd
– name: Elsevier BV
References Pessen (bib0047) 1994; 116
Monahan (bib0039) 1982; 28
Chang, Jin (bib0016) 2013
Fatan, Sefidgari, Barenji (bib0022) 2013
Gloye, Göktekin, Egorova, Tenchio, Rojas (bib0025) 2005
Ogata (bib0042) 2010
Wei, Cohen (bib0069) 2015
Hägglund, Åström (bib0027) 1991; 27
Åström, Hägglund (bib0050) 2006
Li, He, Yin (bib0036) 2012; 590
Antonelli (bib0005) 2007; 32
Calvo-Rolle, Casteleiro-Roca, Quintián, Del Carmen Meizoso-Lopez (bib0014) 2013; 40
Arokiasami, Vadakkepa, Tan, Srinivasan (bib0008) 2016
Haykin (bib0030) 1998
Mnih, Kavukcuoglu, Silver, Rusu, Veness, Bellemare (bib0038) 2015; 518
Sood, Kumar (bib0061) 2014; 1
Zhuang, Atherton (bib0074) 1993; 140
Petit, Paulo, Carlucho, Menna, Paula (bib0048) 2016
Cohen, Bossert, Col (bib0019) 2002
Bhatnagar, Sutton, Ghavamzadeh, Lee (bib0011) 2009; 45
Kuantama, Vesselenyi, Dzitac, Tarca (bib0033) 2017; 12
Safonov, Tung-Ching Tsao (bib0053) 1997; 42
el Hakim, Hindersah, Rijanto (bib0028) 2013
Zhang, Zhiyuan, Kochenderfer (bib0077) 2017
Pan, Lai, Yang, Wu (bib0044) 2013; 40
Pan, Zaheer, White, Patterson, White (bib0046) 2018
MyungsooJun, Safonov (bib0040) 1999
Sood (bib0060) 2011
Cohen, Coon (bib0018) 1953; 75
Singh, Bisht, Padhy (bib0059) 2013
Åström, Hägglund (bib0049) 2004; 14
Sutton, Barto (bib0062) 1998
Hasselt (bib0029) 2010
Rout, Subudhi (bib0052) 2017; 48
Watkins, Dayan (bib0068) 1992; 8
Kuo (bib0034) 1995
Ziegler, Nichols (bib0076) 1993; 115
Wang, Fung, Zhang (bib0067) 1999; 38
Hernández-Alvarado, García-Valdovinos, Salgado-Jiménez, Gómez-Espinosa, Fonseca-Navarro (bib0031) 2016; 16
Liu, Pan, Xue (bib0037) 2015; 55
Chen, Billings (bib0017) 1989; 49
Ahmed, Kushsairy, Bakar, Hazry, Joyo (bib0003) 2015
Dong, Guo, Lin, Li, Wang (bib0020) 2012
Neumann, Peters (bib0041) 2008
Abu Rmilah, Hassan, Bin Mardi (bib0001) 2016; 24
Rossomando, Soria (bib0051) 2015; 26
Pan, Wang, Cheng, Yu (bib0045) 2018; 29
Ziegler, Nichols (bib0075) 1942; 64
Buçoniu, Babuska, Schutter, Ernst (bib0012) 2010
Busoniu, de Bruin, Toli, Kober, Palunko (bib0013) 2018; 46
Barbalata, De Carolis, Dunnigan, Petillot, Lane (bib0010) 2015
Tandan, Swarnkar (bib0064) 2015; 8
Acosta, Mayosky, Catalfo (bib0002) 1994; 4
LeCun, Bengio, Hinton, Y., Y., G. (bib0035) 2015; 521
Salhi, Alimi (bib0054) 2013
Bansal, Sharma, Shreeraman (bib0009) 2012; 2
Van Hasselt, Guez, Silver (bib0066) 2016
Antonelli, Chiaverini, Sarkar, West (bib0007) 2001; 9
Sharma, Rana, Kumar (bib0057) 2014; 41
Takači, Takači, Takači (bib0063) 2014; 17
Alejo, Cobano, Heredia, Ollero (bib0004) 2016; 84
Ye (bib0070) 2008; 71
Khodayari, Balochian (bib0032) 2015; 20
van Seijen, Sutton (bib0055) 2015
Timmer, Riedmiller (bib0065) 2007
Dyda, Os’kin (bib0021) 2004; 37
Zadeh (bib0071) 1973; SMC-3
Antonelli, Caccavale, Chiaverini, Fusco (bib0006) 2003; 11
Önder Efe (bib0043) 2011; 7
Fossen (bib0023) 2002
Carlucho, De Paula, Villar, Acosta (bib0015) 2017; 80
Zhang, Sutton (bib0072) 2017
Sheikhlar, Fakharian, Adhami-Mirhosseini (bib0058) 2013
Geder, Palmisano, Ramamurti, Sandberg, Ratna (bib0024) 2008
Goumagias, Hristu-Varsakelis, Assael (bib0026) 2018; 101
Zhao, Yi, Peng, Peng (bib0073) 2017
Shah, Agashe (bib0056) 2016; 38
Ye (10.1016/j.eswa.2019.06.066_bib0070) 2008; 71
Goumagias (10.1016/j.eswa.2019.06.066_bib0026) 2018; 101
Sharma (10.1016/j.eswa.2019.06.066_bib0057) 2014; 41
Önder Efe (10.1016/j.eswa.2019.06.066_bib0043) 2011; 7
Ahmed (10.1016/j.eswa.2019.06.066_bib0003) 2015
Watkins (10.1016/j.eswa.2019.06.066_bib0068) 1992; 8
Acosta (10.1016/j.eswa.2019.06.066_bib0002) 1994; 4
Geder (10.1016/j.eswa.2019.06.066_bib0024) 2008
Khodayari (10.1016/j.eswa.2019.06.066_bib0032) 2015; 20
Takači (10.1016/j.eswa.2019.06.066_bib0063) 2014; 17
Sood (10.1016/j.eswa.2019.06.066_bib0061) 2014; 1
LeCun (10.1016/j.eswa.2019.06.066_sbref0035) 2015; 521
Barbalata (10.1016/j.eswa.2019.06.066_bib0010) 2015
Kuo (10.1016/j.eswa.2019.06.066_bib0034) 1995
Monahan (10.1016/j.eswa.2019.06.066_bib0039) 1982; 28
Mnih (10.1016/j.eswa.2019.06.066_sbref0038) 2015; 518
Singh (10.1016/j.eswa.2019.06.066_bib0059) 2013
van Seijen (10.1016/j.eswa.2019.06.066_bib0055) 2015
Timmer (10.1016/j.eswa.2019.06.066_bib0065) 2007
Pessen (10.1016/j.eswa.2019.06.066_bib0047) 1994; 116
Tandan (10.1016/j.eswa.2019.06.066_bib0064) 2015; 8
Åström (10.1016/j.eswa.2019.06.066_bib0049) 2004; 14
Shah (10.1016/j.eswa.2019.06.066_bib0056) 2016; 38
Wang (10.1016/j.eswa.2019.06.066_bib0067) 1999; 38
Bansal (10.1016/j.eswa.2019.06.066_bib0009) 2012; 2
Dyda (10.1016/j.eswa.2019.06.066_bib0021) 2004; 37
el Hakim (10.1016/j.eswa.2019.06.066_bib0028) 2013
Zhang (10.1016/j.eswa.2019.06.066_bib0072) 2017
Zadeh (10.1016/j.eswa.2019.06.066_bib0071) 1973; SMC-3
Ziegler (10.1016/j.eswa.2019.06.066_bib0075) 1942; 64
Antonelli (10.1016/j.eswa.2019.06.066_bib0005) 2007; 32
Gloye (10.1016/j.eswa.2019.06.066_sbref0025) 2005
MyungsooJun (10.1016/j.eswa.2019.06.066_bib0040) 1999
Safonov (10.1016/j.eswa.2019.06.066_bib0053) 1997; 42
Cohen (10.1016/j.eswa.2019.06.066_sbref0019) 2002
Pan (10.1016/j.eswa.2019.06.066_bib0044) 2013; 40
Kuantama (10.1016/j.eswa.2019.06.066_bib0033) 2017; 12
Rout (10.1016/j.eswa.2019.06.066_bib0052) 2017; 48
Chen (10.1016/j.eswa.2019.06.066_bib0017) 1989; 49
Bhatnagar (10.1016/j.eswa.2019.06.066_bib0011) 2009; 45
Alejo (10.1016/j.eswa.2019.06.066_bib0004) 2016; 84
Carlucho (10.1016/j.eswa.2019.06.066_bib0015) 2017; 80
Fatan (10.1016/j.eswa.2019.06.066_bib0022) 2013
Salhi (10.1016/j.eswa.2019.06.066_bib0054) 2013
Sheikhlar (10.1016/j.eswa.2019.06.066_bib0058) 2013
Neumann (10.1016/j.eswa.2019.06.066_bib0041) 2008
Arokiasami (10.1016/j.eswa.2019.06.066_bib0008) 2016
Liu (10.1016/j.eswa.2019.06.066_bib0037) 2015; 55
Wei (10.1016/j.eswa.2019.06.066_bib0069) 2015
Van Hasselt (10.1016/j.eswa.2019.06.066_sbref0066) 2016
Busoniu (10.1016/j.eswa.2019.06.066_bib0013) 2018; 46
Antonelli (10.1016/j.eswa.2019.06.066_bib0006) 2003; 11
Zhuang (10.1016/j.eswa.2019.06.066_bib0074) 1993; 140
Petit (10.1016/j.eswa.2019.06.066_bib0048) 2016
Antonelli (10.1016/j.eswa.2019.06.066_bib0007) 2001; 9
Chang (10.1016/j.eswa.2019.06.066_bib0016) 2013
Zhang (10.1016/j.eswa.2019.06.066_bib0077) 2017
Hägglund (10.1016/j.eswa.2019.06.066_bib0027) 1991; 27
Rossomando (10.1016/j.eswa.2019.06.066_bib0051) 2015; 26
Sood (10.1016/j.eswa.2019.06.066_bib0060) 2011
Abu Rmilah (10.1016/j.eswa.2019.06.066_bib0001) 2016; 24
Fossen (10.1016/j.eswa.2019.06.066_bib0023) 2002
Hernández-Alvarado (10.1016/j.eswa.2019.06.066_bib0031) 2016; 16
Ziegler (10.1016/j.eswa.2019.06.066_bib0076) 1993; 115
Hasselt (10.1016/j.eswa.2019.06.066_bib0029) 2010
Li (10.1016/j.eswa.2019.06.066_sbref0036) 2012; 590
Dong (10.1016/j.eswa.2019.06.066_bib0020) 2012
Ogata (10.1016/j.eswa.2019.06.066_bib0042) 2010
Pan (10.1016/j.eswa.2019.06.066_bib0045) 2018; 29
Åström (10.1016/j.eswa.2019.06.066_bib0050) 2006
Cohen (10.1016/j.eswa.2019.06.066_bib0018) 1953; 75
Pan (10.1016/j.eswa.2019.06.066_sbref0046) 2018
Sutton (10.1016/j.eswa.2019.06.066_bib0062) 1998
Zhao (10.1016/j.eswa.2019.06.066_bib0073) 2017
Buçoniu (10.1016/j.eswa.2019.06.066_bib0012) 2010
Haykin (10.1016/j.eswa.2019.06.066_bib0030) 1998
Calvo-Rolle (10.1016/j.eswa.2019.06.066_bib0014) 2013; 40
References_xml – volume: 75
  start-page: 827
  year: 1953
  end-page: 834
  ident: bib0018
  article-title: Theoretical consideration of retarded control
  publication-title: Transactions of ASME
– start-page: 662
  year: 2013
  end-page: 665
  ident: bib0022
  article-title: An adaptive neuro PID for controlling the altitude of quadcopter robot
  publication-title: Proceedings of the 18th international conference on methods & models in automation & robotics (MMAR)
– year: 2006
  ident: bib0050
  article-title: Advanced PID control
– volume: 1
  start-page: 37
  year: 2014
  ident: bib0061
  article-title: Adaptive fuzzy controller for controlling the dynamics of robot motion
  publication-title: International Journal of Fuzzy Computation and Modelling
– start-page: 1375
  year: 2016
  end-page: 1381
  ident: bib0008
  article-title: Vector directed path generation and tracking for autonomous unmanned aerial/ ground vehicles
  publication-title: Proceedings of the ieee congress on evolutionary computation (CEC)
– start-page: 1
  year: 2013
  end-page: 6
  ident: bib0028
  article-title: Application of reinforcement learning on self-tuning PID controller for soccer robot multi-agent system
  publication-title: Proceedings of the joint international conference on rural information & communication technology and electric-vehicle technology (RICT & ICEV-T)
– volume: 26
  start-page: 1179
  year: 2015
  end-page: 1191
  ident: bib0051
  article-title: Identification and control of nonlinear dynamics of a mobile robot in discrete time using an adaptive technique based on neural PID
  publication-title: Neural Computing and Applications
– volume: 29
  start-page: 2227
  year: 2018
  end-page: 2238
  ident: bib0045
  article-title: Multisource transfer double DQN based on actor learning
  publication-title: IEEE Transactions on Neural Networks and Learning Systems
– volume: 28
  start-page: 1
  year: 1982
  end-page: 16
  ident: bib0039
  article-title: State of the art—A survey of partially observable Markov decision processes: theory, models, and algorithms
  publication-title: Management Science
– year: 2015
  ident: bib0069
  article-title: Development of a model based fuzzy-PID controller for the AeroQuad cyclone quad-copter
  publication-title: Proceedings of the AIAA infotech @ aerospace
– year: 2002
  ident: bib0019
  article-title: PID and fuzzy logic pitch attitude hold systems for a fighter jet AIAA-2002-4646 PID and fuzzy logic pitch attitude hold systems for a fighter jet
  publication-title: Proceedings of the AIAA guidance, navigation, and control conference and exhibit
– volume: 7
  year: 2011
  ident: bib0043
  article-title: Neural network assisted computationally simple PI d control of a quadrotor UAV
  publication-title: IEEE Transactions on Industrial Informatics
– volume: 48
  start-page: 367
  year: 2017
  end-page: 375
  ident: bib0052
  article-title: Inverse optimal self-tuning PID control design for an autonomous underwater vehicle
  publication-title: International Journal of Systems Science
– volume: 46
  start-page: 8
  year: 2018
  end-page: 28
  ident: bib0013
  article-title: Reinforcement learning for control: Performance, stability, and deep approximators
  publication-title: Annual Reviews in Control
– volume: 20
  start-page: 559
  year: 2015
  end-page: 578
  ident: bib0032
  article-title: Modeling and control of autonomous underwater vehicle (AUV) in heading and depth attitude via self-adaptive fuzzy PID controller
  publication-title: Journal of Marine Science and Technology (Japan)
– year: 2017
  ident: bib0072
  article-title: A deeper look at experience replay
  publication-title: Proceedings of the NIPS deep reinforcement learning symposium
– volume: 16
  year: 2016
  ident: bib0031
  article-title: Neural network-based self-Tuning PID control for underwater vehicles
  publication-title: Sensors (Basel, Switzerland)
– volume: 49
  start-page: 1013
  year: 1989
  end-page: 1032
  ident: bib0017
  article-title: Representations of non-linear systems: the NARMAX model
  publication-title: International Journal of Control
– volume: 27
  start-page: 599
  year: 1991
  end-page: 609
  ident: bib0027
  article-title: Industrial adaptive controllers based on frequency response techniques
  publication-title: Automatica
– start-page: 385
  year: 2011
  end-page: 390
  ident: bib0060
  article-title: Autonomous robot motion control using fuzzy PID controller
  publication-title: High performance architecture and grid computing
– volume: 71
  start-page: 1561
  year: 2008
  end-page: 1565
  ident: bib0070
  article-title: Adaptive control of nonlinear PID-based analog neural networks for a nonholonomic mobile robot
  publication-title: Neurocomputing
– start-page: 2094
  year: 2016
  end-page: 2100
  ident: bib0066
  article-title: Deep reinforcement learning with double Q-Learning
  publication-title: Proceedings of the 30th AAAI conference on artificial intelligence, AAAI
– volume: 17
  year: 2014
  ident: bib0063
  article-title: On the operational solutions of fuzzy fractional differential equations
  publication-title: Fractional Calculus and Applied Analysis
– volume: 45
  start-page: 2471
  year: 2009
  end-page: 2482
  ident: bib0011
  article-title: Natural actor-critic algorithms
  publication-title: Automatica
– year: 2015
  ident: bib0055
  article-title: A deeper look at planning as learning from replay
  publication-title: Proceedings of the 2nd multidisciplinary conference on reinforcement learning and decision making
– volume: 41
  start-page: 4274
  year: 2014
  end-page: 4289
  ident: bib0057
  article-title: Performance analysis of fractional order fuzzy PID controllers applied to a robotic manipulator
  publication-title: Expert Systems with Applications
– start-page: 160
  year: 2005
  end-page: 171
  ident: bib0025
  article-title: Learning to Drive and Simulate Autonomous Mobile Robots
  publication-title: Proceedings of the Robocup
– start-page: 40
  year: 2008
  end-page: 46
  ident: bib0024
  article-title: Fuzzy logic PID based control design and performance for a pectoral fin propelled unmanned underwater vehicle
  publication-title: Proceedings of the international conference on control, automation and systems
– volume: 24
  start-page: 934
  year: 2016
  end-page: 950
  ident: bib0001
  article-title: A PC-based simulation platform for a quadcopter system with self-tuning fuzzy PID controllers
  publication-title: Computer Applications in Engineering Education
– volume: 115
  start-page: 220
  year: 1993
  ident: bib0076
  article-title: Optimum settings for automatic controllers
  publication-title: Journal of Dynamic Systems, Measurement, and Control
– volume: 11
  start-page: 221
  year: 2003
  end-page: 232
  ident: bib0006
  article-title: A novel adaptive control law for underwater vehicles
  publication-title: IEEE Transactions on Control Systems Technology
– year: 2002
  ident: bib0023
  article-title: Marine control systems : guidance, navigation and control of ships, rigs and underwater vehicles
– volume: 8
  start-page: 357
  year: 2015
  end-page: 362
  ident: bib0064
  article-title: PID controller optimization by soft computing techniques–A review
  publication-title: International Journal of Hybrid Information Technology
– start-page: 92
  year: 2013
  end-page: 97
  ident: bib0054
  article-title: Fuzzy-PID hybrid controller for mobile robot using point cloud and low cost depth sensor
  publication-title: Proceedings of the international conference on individual and collective behaviors in robotics (ICBR)
– volume: 40
  start-page: 5188
  year: 2013
  end-page: 5196
  ident: bib0014
  article-title: A hybrid intelligent system for PID controller using in a steel rolling process
  publication-title: Expert Systems with Applications
– start-page: 1
  year: 2016
  end-page: 6
  ident: bib0048
  article-title: Prediction of the hydrodynamic coefficients of an autonomous underwater vehicle
  publication-title: Proceedings of the 3rd IEEE/OES south american international symposium on oceanic engineering (SAISOE)
– year: 1998
  ident: bib0062
  article-title: Reinforcement learning : An introduction
– volume: 38
  start-page: 243
  year: 1999
  end-page: 249
  ident: bib0067
  article-title: PID Tuning with exact gain and phase margins
  publication-title: ISA Transactions
– volume: 64
  start-page: 759
  year: 1942
  end-page: 768
  ident: bib0075
  article-title: Optimum settings for automatic controllers
  publication-title: Transactions of the ASME
– volume: 55
  start-page: 227
  year: 2015
  end-page: 233
  ident: bib0037
  article-title: Variable-order fuzzy fractional PID controller
  publication-title: ISA Transactions
– start-page: 1658
  year: 2015
  end-page: 1663
  ident: bib0010
  article-title: An adaptive controller for autonomous underwater vehicles
  publication-title: Proceedings of the IEEE/RSJ international conference on intelligent robots and systems (IROS)
– volume: 4
  start-page: 53
  year: 1994
  end-page: 66
  ident: bib0002
  article-title: An expert PID controller uses refined Ziegler and Nichols rules and fuzzy logic ideas
  publication-title: Applied Intelligence
– volume: 80
  start-page: 183
  year: 2017
  end-page: 199
  ident: bib0015
  article-title: Incremental Q-learning strategy for adaptive PID control of mobile robots
  publication-title: Expert Systems with Applications
– volume: SMC-3
  start-page: 28
  year: 1973
  end-page: 44
  ident: bib0071
  article-title: Outline of a new approach to the analysis of complex systems and decision processes
  publication-title: IEEE Transactions on Systems, Man, and Cybernetics
– volume: 521
  start-page: 436
  year: 2015
  end-page: 444
  ident: bib0035
  article-title: Deep learning
  publication-title: Nature
– volume: 140
  start-page: 216
  year: 1993
  ident: bib0074
  article-title: Automatic tuning of optimum PID controllers
  publication-title: IEE Proceedings D Control Theory and Applications
– volume: 38
  start-page: 29
  year: 2016
  end-page: 41
  ident: bib0056
  article-title: Review of fractional PID controller
  publication-title: Mechatronics
– start-page: 1
  year: 2013
  end-page: 6
  ident: bib0059
  article-title: Neural network based adaptive non linear PID controller for non-holonomic mobile robot
  publication-title: Proceedings of the international conference on control, automation, robotics and embedded systems (CARE)
– start-page: 1
  year: 2007
  end-page: 8
  ident: bib0065
  article-title: Fitted Q iteration with CMACs
  publication-title: Proceedings of the IEEE international symposium on approximate dynamic programming and reinforcement learning
– volume: 9
  start-page: 756
  year: 2001
  end-page: 765
  ident: bib0007
  article-title: Adaptive control of an autonomous underwater vehicle: experimental results on ODIN
  publication-title: IEEE Transactions on Control Systems Technology
– volume: 84
  start-page: 745
  year: 2016
  end-page: 758
  ident: bib0004
  article-title: A reactive method for collision avoidance in industrial environments
  publication-title: Journal of Intelligent & Robotic Systems
– volume: 2
  start-page: 168
  year: 2012
  end-page: 176
  ident: bib0009
  article-title: PID Controller tuning techniques: A Review
  publication-title: Journal of Control Engineering and Technology
– start-page: 898
  year: 2012
  end-page: 903
  ident: bib0020
  article-title: A neural network-based self-tuning PID controller of an autonomous underwater vehicle
  publication-title: Proceedings of the IEEE international conference on mechatronics and automation
– volume: 101
  start-page: 258
  year: 2018
  end-page: 270
  ident: bib0026
  article-title: Using deep Q-learning to understand the tax evasion behavior of risk-averse firms
  publication-title: Expert Systems with Applications
– start-page: 540
  year: 2013
  end-page: 549
  ident: bib0016
  article-title: Adaptive tracking controller based on the PID for mobile robot path tracking
  publication-title: Intelligent robotics and applications
– start-page: 2613
  year: 2010
  end-page: 2621
  ident: bib0029
  article-title: Double q-learning
  publication-title: Proceedings of the 23rd international conference on neural information processing systems - volume 2
– start-page: 867
  year: 2017
  end-page: 878
  ident: bib0073
  article-title: Design and simulation of a Self-adaptive Fuzzy-PID Controller for an autonomous underwater vehicle
– volume: 42
  start-page: 843
  year: 1997
  end-page: 847
  ident: bib0053
  article-title: The unfalsified control concept and learning
  publication-title: IEEE Transactions on Automatic Control
– year: 1998
  ident: bib0030
  article-title: Neural networks: A comprehensive foundation
– start-page: 99
  year: 2015
  end-page: 104
  ident: bib0003
  article-title: Attitude stabilization of quad-rotor (UAV) system using fuzzy PID controller (an experimental test)
  publication-title: Proceedings of the second international conference on computing technology and information management (ICCTIM)
– start-page: 328
  year: 1999
  end-page: 333
  ident: bib0040
  article-title: Automatic PID tuning: an application of unfalsified control
  publication-title: Proceedings of the IEEE international symposium on computer aided control system design
– year: 2010
  ident: bib0042
  article-title: Modern control engineering
– year: 2018
  ident: bib0046
  article-title: Organizing experience: A deeper look at replay mechanisms for sample-based planning in continuous state domains
  publication-title: Technical Report
– volume: 40
  start-page: 1629
  year: 2013
  end-page: 1635
  ident: bib0044
  article-title: An efficient neural network approach to tracking control of an autonomous surface vehicle with unknown dynamics
  publication-title: Expert Systems with Applications
– volume: 8
  start-page: 279
  year: 1992
  end-page: 292
  ident: bib0068
  article-title: Technical note: Q-Learning
  publication-title: Machine Learning
– start-page: 1
  year: 2013
  end-page: 4
  ident: bib0058
  article-title: Fuzzy adaptive PI control of omni-directional mobile robot
  publication-title: Proceedings of the 13th Iranian conference on fuzzy systems (IFSC)
– volume: 116
  start-page: 553
  year: 1994
  ident: bib0047
  article-title: A new look at PID-controller tuning
  publication-title: Journal of Dynamic Systems, Measurement, and Control
– volume: 32
  start-page: 300
  year: 2007
  end-page: 312
  ident: bib0005
  article-title: On the use of adaptive/integral actions for six-degrees-of-freedom control of autonomous underwater vehicles
  publication-title: IEEE Journal of Oceanic Engineering
– volume: 14
  start-page: 635
  year: 2004
  end-page: 650
  ident: bib0049
  article-title: Revisiting the Ziegler–Nichols step response method for PID control
  publication-title: Journal of Process Control
– volume: 12
  start-page: 519
  year: 2017
  ident: bib0033
  article-title: PID And fuzzy-PID control model for quadcopter attitude with disturbance parameter
  publication-title: International Journal of Computers Communications & Control
– start-page: 3455
  year: 2017
  end-page: 3461
  ident: bib0077
  article-title: Weighted double Q-learning
  publication-title: Proceedings of the twenty-sixth international joint conference on artificial intelligence, IJCAI
– volume: 590
  start-page: 268
  year: 2012
  end-page: 271
  ident: bib0036
  article-title: Trajectory tracking control of mobile robot based on self-Adaptive PID combined with preview and fuzzy logic
  publication-title: Advanced Materials Research
– year: 2010
  ident: bib0012
  article-title: Reinforcement learning and dynamic programming using function approximators
– year: 1995
  ident: bib0034
  article-title: Automatic control systems
– volume: 518
  start-page: 529
  year: 2015
  end-page: 533
  ident: bib0038
  article-title: Human-level control through deep reinforcement learning
  publication-title: Nature
– volume: 37
  start-page: 427
  year: 2004
  end-page: 432
  ident: bib0021
  article-title: Neural network control system for underwater robots
  publication-title: IFAC Proceedings Volumes
– start-page: 1177
  year: 2008
  end-page: 1184
  ident: bib0041
  article-title: Fitted Q-iteration by advantage weighted regression
  publication-title: Advances in neural information processing systems 21
– volume: 55
  start-page: 227
  year: 2015
  ident: 10.1016/j.eswa.2019.06.066_bib0037
  article-title: Variable-order fuzzy fractional PID controller
  publication-title: ISA Transactions
  doi: 10.1016/j.isatra.2014.09.012
– volume: 24
  start-page: 934
  issue: 6
  year: 2016
  ident: 10.1016/j.eswa.2019.06.066_bib0001
  article-title: A PC-based simulation platform for a quadcopter system with self-tuning fuzzy PID controllers
  publication-title: Computer Applications in Engineering Education
  doi: 10.1002/cae.21769
– volume: 2
  start-page: 168
  issue: 4
  year: 2012
  ident: 10.1016/j.eswa.2019.06.066_bib0009
  article-title: PID Controller tuning techniques: A Review
  publication-title: Journal of Control Engineering and Technology
– volume: 37
  start-page: 427
  issue: 10
  year: 2004
  ident: 10.1016/j.eswa.2019.06.066_bib0021
  article-title: Neural network control system for underwater robots
  publication-title: IFAC Proceedings Volumes
  doi: 10.1016/S1474-6670(17)31769-X
– volume: 9
  start-page: 756
  issue: 5
  year: 2001
  ident: 10.1016/j.eswa.2019.06.066_bib0007
  article-title: Adaptive control of an autonomous underwater vehicle: experimental results on ODIN
  publication-title: IEEE Transactions on Control Systems Technology
  doi: 10.1109/87.944470
– start-page: 1
  year: 2016
  ident: 10.1016/j.eswa.2019.06.066_bib0048
  article-title: Prediction of the hydrodynamic coefficients of an autonomous underwater vehicle
– volume: 16
  issue: 9
  year: 2016
  ident: 10.1016/j.eswa.2019.06.066_bib0031
  article-title: Neural network-based self-Tuning PID control for underwater vehicles
  publication-title: Sensors (Basel, Switzerland)
  doi: 10.3390/s16091429
– volume: 29
  start-page: 2227
  issue: 6
  year: 2018
  ident: 10.1016/j.eswa.2019.06.066_bib0045
  article-title: Multisource transfer double DQN based on actor learning
  publication-title: IEEE Transactions on Neural Networks and Learning Systems
  doi: 10.1109/TNNLS.2018.2806087
– year: 2006
  ident: 10.1016/j.eswa.2019.06.066_bib0050
– volume: 40
  start-page: 1629
  issue: 5
  year: 2013
  ident: 10.1016/j.eswa.2019.06.066_bib0044
  article-title: An efficient neural network approach to tracking control of an autonomous surface vehicle with unknown dynamics
  publication-title: Expert Systems with Applications
  doi: 10.1016/j.eswa.2012.09.008
– start-page: 540
  year: 2013
  ident: 10.1016/j.eswa.2019.06.066_bib0016
  article-title: Adaptive tracking controller based on the PID for mobile robot path tracking
– start-page: 385
  year: 2011
  ident: 10.1016/j.eswa.2019.06.066_bib0060
  article-title: Autonomous robot motion control using fuzzy PID controller
– volume: 40
  start-page: 5188
  issue: 13
  year: 2013
  ident: 10.1016/j.eswa.2019.06.066_bib0014
  article-title: A hybrid intelligent system for PID controller using in a steel rolling process
  publication-title: Expert Systems with Applications
  doi: 10.1016/j.eswa.2013.03.013
– start-page: 1375
  year: 2016
  ident: 10.1016/j.eswa.2019.06.066_bib0008
  article-title: Vector directed path generation and tracking for autonomous unmanned aerial/ ground vehicles
– start-page: 1177
  year: 2008
  ident: 10.1016/j.eswa.2019.06.066_bib0041
  article-title: Fitted Q-iteration by advantage weighted regression
– start-page: 99
  year: 2015
  ident: 10.1016/j.eswa.2019.06.066_bib0003
  article-title: Attitude stabilization of quad-rotor (UAV) system using fuzzy PID controller (an experimental test)
– start-page: 662
  year: 2013
  ident: 10.1016/j.eswa.2019.06.066_bib0022
  article-title: An adaptive neuro PID for controlling the altitude of quadcopter robot
– volume: 518
  start-page: 529
  issue: 7540
  year: 2015
  ident: 10.1016/j.eswa.2019.06.066_sbref0038
  article-title: Human-level control through deep reinforcement learning
  publication-title: Nature
  doi: 10.1038/nature14236
– volume: 7
  issue: 2
  year: 2011
  ident: 10.1016/j.eswa.2019.06.066_bib0043
  article-title: Neural network assisted computationally simple PI d control of a quadrotor UAV
  publication-title: IEEE Transactions on Industrial Informatics
– start-page: 3455
  year: 2017
  ident: 10.1016/j.eswa.2019.06.066_bib0077
  article-title: Weighted double Q-learning
– volume: 116
  start-page: 553
  issue: 3
  year: 1994
  ident: 10.1016/j.eswa.2019.06.066_bib0047
  article-title: A new look at PID-controller tuning
  publication-title: Journal of Dynamic Systems, Measurement, and Control
  doi: 10.1115/1.2899252
– volume: 14
  start-page: 635
  issue: 6
  year: 2004
  ident: 10.1016/j.eswa.2019.06.066_bib0049
  article-title: Revisiting the Ziegler–Nichols step response method for PID control
  publication-title: Journal of Process Control
  doi: 10.1016/j.jprocont.2004.01.002
– volume: 20
  start-page: 559
  issue: 3
  year: 2015
  ident: 10.1016/j.eswa.2019.06.066_bib0032
  article-title: Modeling and control of autonomous underwater vehicle (AUV) in heading and depth attitude via self-adaptive fuzzy PID controller
  publication-title: Journal of Marine Science and Technology (Japan)
  doi: 10.1007/s00773-015-0312-7
– year: 2018
  ident: 10.1016/j.eswa.2019.06.066_sbref0046
  article-title: Organizing experience: A deeper look at replay mechanisms for sample-based planning in continuous state domains
– volume: 64
  start-page: 759
  year: 1942
  ident: 10.1016/j.eswa.2019.06.066_bib0075
  article-title: Optimum settings for automatic controllers
  publication-title: Transactions of the ASME
– volume: 84
  start-page: 745
  issue: 1
  year: 2016
  ident: 10.1016/j.eswa.2019.06.066_bib0004
  article-title: A reactive method for collision avoidance in industrial environments
  publication-title: Journal of Intelligent & Robotic Systems
  doi: 10.1007/s10846-016-0359-7
– start-page: 867
  year: 2017
  ident: 10.1016/j.eswa.2019.06.066_bib0073
– volume: 8
  start-page: 279
  issue: 3/4
  year: 1992
  ident: 10.1016/j.eswa.2019.06.066_bib0068
  article-title: Technical note: Q-Learning
  publication-title: Machine Learning
– year: 2002
  ident: 10.1016/j.eswa.2019.06.066_sbref0019
  article-title: PID and fuzzy logic pitch attitude hold systems for a fighter jet AIAA-2002-4646 PID and fuzzy logic pitch attitude hold systems for a fighter jet
– volume: 17
  issue: 4
  year: 2014
  ident: 10.1016/j.eswa.2019.06.066_bib0063
  article-title: On the operational solutions of fuzzy fractional differential equations
  publication-title: Fractional Calculus and Applied Analysis
  doi: 10.2478/s13540-014-0216-y
– start-page: 92
  year: 2013
  ident: 10.1016/j.eswa.2019.06.066_bib0054
  article-title: Fuzzy-PID hybrid controller for mobile robot using point cloud and low cost depth sensor
– volume: 75
  start-page: 827
  issue: 1
  year: 1953
  ident: 10.1016/j.eswa.2019.06.066_bib0018
  article-title: Theoretical consideration of retarded control
  publication-title: Transactions of ASME
– volume: 41
  start-page: 4274
  issue: 9
  year: 2014
  ident: 10.1016/j.eswa.2019.06.066_bib0057
  article-title: Performance analysis of fractional order fuzzy PID controllers applied to a robotic manipulator
  publication-title: Expert Systems with Applications
  doi: 10.1016/j.eswa.2013.12.030
– start-page: 1
  year: 2013
  ident: 10.1016/j.eswa.2019.06.066_bib0028
  article-title: Application of reinforcement learning on self-tuning PID controller for soccer robot multi-agent system
– volume: 27
  start-page: 599
  issue: 4
  year: 1991
  ident: 10.1016/j.eswa.2019.06.066_bib0027
  article-title: Industrial adaptive controllers based on frequency response techniques
  publication-title: Automatica
  doi: 10.1016/0005-1098(91)90052-4
– volume: 38
  start-page: 29
  year: 2016
  ident: 10.1016/j.eswa.2019.06.066_bib0056
  article-title: Review of fractional PID controller
  publication-title: Mechatronics
  doi: 10.1016/j.mechatronics.2016.06.005
– volume: 38
  start-page: 243
  issue: 3
  year: 1999
  ident: 10.1016/j.eswa.2019.06.066_bib0067
  article-title: PID Tuning with exact gain and phase margins
  publication-title: ISA Transactions
  doi: 10.1016/S0019-0578(99)00020-8
– volume: 4
  start-page: 53
  issue: 1
  year: 1994
  ident: 10.1016/j.eswa.2019.06.066_bib0002
  article-title: An expert PID controller uses refined Ziegler and Nichols rules and fuzzy logic ideas
  publication-title: Applied Intelligence
  doi: 10.1007/BF00872055
– volume: 12
  start-page: 519
  issue: 4
  year: 2017
  ident: 10.1016/j.eswa.2019.06.066_bib0033
  article-title: PID And fuzzy-PID control model for quadcopter attitude with disturbance parameter
  publication-title: International Journal of Computers Communications & Control
  doi: 10.15837/ijccc.2017.4.2962
– year: 1998
  ident: 10.1016/j.eswa.2019.06.066_bib0030
– volume: 71
  start-page: 1561
  issue: 7
  year: 2008
  ident: 10.1016/j.eswa.2019.06.066_bib0070
  article-title: Adaptive control of nonlinear PID-based analog neural networks for a nonholonomic mobile robot
  publication-title: Neurocomputing
  doi: 10.1016/j.neucom.2007.04.014
– volume: 11
  start-page: 221
  issue: 2
  year: 2003
  ident: 10.1016/j.eswa.2019.06.066_bib0006
  article-title: A novel adaptive control law for underwater vehicles
  publication-title: IEEE Transactions on Control Systems Technology
  doi: 10.1109/TCST.2003.809244
– volume: 26
  start-page: 1179
  issue: 5
  year: 2015
  ident: 10.1016/j.eswa.2019.06.066_bib0051
  article-title: Identification and control of nonlinear dynamics of a mobile robot in discrete time using an adaptive technique based on neural PID
  publication-title: Neural Computing and Applications
  doi: 10.1007/s00521-014-1805-8
– volume: SMC-3
  start-page: 28
  issue: 1
  year: 1973
  ident: 10.1016/j.eswa.2019.06.066_bib0071
  article-title: Outline of a new approach to the analysis of complex systems and decision processes
  publication-title: IEEE Transactions on Systems, Man, and Cybernetics
  doi: 10.1109/TSMC.1973.5408575
– start-page: 2094
  year: 2016
  ident: 10.1016/j.eswa.2019.06.066_sbref0066
  article-title: Deep reinforcement learning with double Q-Learning
– volume: 80
  start-page: 183
  year: 2017
  ident: 10.1016/j.eswa.2019.06.066_bib0015
  article-title: Incremental Q-learning strategy for adaptive PID control of mobile robots
  publication-title: Expert Systems with Applications
  doi: 10.1016/j.eswa.2017.03.002
– start-page: 2613
  year: 2010
  ident: 10.1016/j.eswa.2019.06.066_bib0029
  article-title: Double q-learning
– start-page: 40
  year: 2008
  ident: 10.1016/j.eswa.2019.06.066_bib0024
  article-title: Fuzzy logic PID based control design and performance for a pectoral fin propelled unmanned underwater vehicle
– volume: 115
  start-page: 220
  issue: 2B
  year: 1993
  ident: 10.1016/j.eswa.2019.06.066_bib0076
  article-title: Optimum settings for automatic controllers
  publication-title: Journal of Dynamic Systems, Measurement, and Control
  doi: 10.1115/1.2899060
– volume: 28
  start-page: 1
  issue: 1
  year: 1982
  ident: 10.1016/j.eswa.2019.06.066_bib0039
  article-title: State of the art—A survey of partially observable Markov decision processes: theory, models, and algorithms
  publication-title: Management Science
  doi: 10.1287/mnsc.28.1.1
– volume: 48
  start-page: 367
  issue: 2
  year: 2017
  ident: 10.1016/j.eswa.2019.06.066_bib0052
  article-title: Inverse optimal self-tuning PID control design for an autonomous underwater vehicle
  publication-title: International Journal of Systems Science
  doi: 10.1080/00207721.2016.1186238
– volume: 140
  start-page: 216
  issue: 3
  year: 1993
  ident: 10.1016/j.eswa.2019.06.066_bib0074
  article-title: Automatic tuning of optimum PID controllers
  publication-title: IEE Proceedings D Control Theory and Applications
  doi: 10.1049/ip-d.1993.0030
– year: 2010
  ident: 10.1016/j.eswa.2019.06.066_bib0042
– start-page: 328
  year: 1999
  ident: 10.1016/j.eswa.2019.06.066_bib0040
  article-title: Automatic PID tuning: an application of unfalsified control
– year: 2002
  ident: 10.1016/j.eswa.2019.06.066_bib0023
– volume: 8
  start-page: 357
  issue: 7
  year: 2015
  ident: 10.1016/j.eswa.2019.06.066_bib0064
  article-title: PID controller optimization by soft computing techniques–A review
  publication-title: International Journal of Hybrid Information Technology
  doi: 10.14257/ijhit.2015.8.7.32
– year: 1995
  ident: 10.1016/j.eswa.2019.06.066_bib0034
– year: 2017
  ident: 10.1016/j.eswa.2019.06.066_bib0072
  article-title: A deeper look at experience replay
– start-page: 1
  year: 2013
  ident: 10.1016/j.eswa.2019.06.066_bib0058
  article-title: Fuzzy adaptive PI control of omni-directional mobile robot
– start-page: 898
  year: 2012
  ident: 10.1016/j.eswa.2019.06.066_bib0020
  article-title: A neural network-based self-tuning PID controller of an autonomous underwater vehicle
– year: 2015
  ident: 10.1016/j.eswa.2019.06.066_bib0055
  article-title: A deeper look at planning as learning from replay
– start-page: 160
  year: 2005
  ident: 10.1016/j.eswa.2019.06.066_sbref0025
  article-title: Learning to Drive and Simulate Autonomous Mobile Robots
– year: 2010
  ident: 10.1016/j.eswa.2019.06.066_bib0012
– year: 1998
  ident: 10.1016/j.eswa.2019.06.066_bib0062
– volume: 1
  start-page: 37
  issue: 1
  year: 2014
  ident: 10.1016/j.eswa.2019.06.066_bib0061
  article-title: Adaptive fuzzy controller for controlling the dynamics of robot motion
  publication-title: International Journal of Fuzzy Computation and Modelling
  doi: 10.1504/IJFCM.2014.064230
– start-page: 1
  year: 2007
  ident: 10.1016/j.eswa.2019.06.066_bib0065
  article-title: Fitted Q iteration with CMACs
– volume: 32
  start-page: 300
  issue: 2
  year: 2007
  ident: 10.1016/j.eswa.2019.06.066_bib0005
  article-title: On the use of adaptive/integral actions for six-degrees-of-freedom control of autonomous underwater vehicles
  publication-title: IEEE Journal of Oceanic Engineering
  doi: 10.1109/JOE.2007.893685
– start-page: 1658
  year: 2015
  ident: 10.1016/j.eswa.2019.06.066_bib0010
  article-title: An adaptive controller for autonomous underwater vehicles
– year: 2015
  ident: 10.1016/j.eswa.2019.06.066_bib0069
  article-title: Development of a model based fuzzy-PID controller for the AeroQuad cyclone quad-copter
– volume: 46
  start-page: 8
  year: 2018
  ident: 10.1016/j.eswa.2019.06.066_bib0013
  article-title: Reinforcement learning for control: Performance, stability, and deep approximators
  publication-title: Annual Reviews in Control
  doi: 10.1016/j.arcontrol.2018.09.005
– volume: 45
  start-page: 2471
  issue: 11
  year: 2009
  ident: 10.1016/j.eswa.2019.06.066_bib0011
  article-title: Natural actor-critic algorithms
  publication-title: Automatica
  doi: 10.1016/j.automatica.2009.07.008
– volume: 101
  start-page: 258
  year: 2018
  ident: 10.1016/j.eswa.2019.06.066_bib0026
  article-title: Using deep Q-learning to understand the tax evasion behavior of risk-averse firms
  publication-title: Expert Systems with Applications
  doi: 10.1016/j.eswa.2018.01.039
– volume: 590
  start-page: 268
  year: 2012
  ident: 10.1016/j.eswa.2019.06.066_sbref0036
  article-title: Trajectory tracking control of mobile robot based on self-Adaptive PID combined with preview and fuzzy logic
  publication-title: Advanced Materials Research
  doi: 10.4028/www.scientific.net/AMR.590.268
– volume: 521
  start-page: 436
  issue: 7553
  year: 2015
  ident: 10.1016/j.eswa.2019.06.066_sbref0035
  article-title: Deep learning
  publication-title: Nature
  doi: 10.1038/nature14539
– start-page: 1
  year: 2013
  ident: 10.1016/j.eswa.2019.06.066_bib0059
  article-title: Neural network based adaptive non linear PID controller for non-holonomic mobile robot
– volume: 42
  start-page: 843
  issue: 6
  year: 1997
  ident: 10.1016/j.eswa.2019.06.066_bib0053
  article-title: The unfalsified control concept and learning
  publication-title: IEEE Transactions on Automatic Control
  doi: 10.1109/9.587340
– volume: 49
  start-page: 1013
  issue: 3
  year: 1989
  ident: 10.1016/j.eswa.2019.06.066_bib0017
  article-title: Representations of non-linear systems: the NARMAX model
  publication-title: International Journal of Control
  doi: 10.1080/00207178908559683
SSID ssj0017007
Score 2.50358
Snippet •A model-free reinforcement learning algorithm for adaptive low-level PID control.•Double Q-learning provides excelent performance with low computational...
Many expert systems have been developed for self-adaptive PID controllers of mobile robots. However, the high computational requirements of the expert systems...
SourceID proquest
crossref
elsevier
SourceType Aggregation Database
Enrichment Source
Index Database
Publisher
StartPage 292
SubjectTerms Adaptive control
Algorithms
Applications programs
Computer simulation
Controllers
Double Q-learning
Double Q-PID
Expert systems
Incremental learning
Machine learning
Mobile robots
Multi-platforms
Proportional integral derivative
Real time
Reinforcement learning
Robot control
Robots
Self adaptive control systems
Space exploration
Tuning
Title Double Q-PID algorithm for mobile robot control
URI https://dx.doi.org/10.1016/j.eswa.2019.06.066
https://www.proquest.com/docview/2306476042
Volume 137
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: Baden-Württemberg Complete Freedom Collection (Elsevier)
  customDbUrl:
  eissn: 1873-6793
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017007
  issn: 0957-4174
  databaseCode: GBLVA
  dateStart: 20110101
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
– providerCode: PRVESC
  databaseName: Elsevier ScienceDirect (LUT)
  customDbUrl:
  eissn: 1873-6793
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017007
  issn: 0957-4174
  databaseCode: ACRLP
  dateStart: 19950101
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
– providerCode: PRVESC
  databaseName: Elsevier SD Freedom Collection Journals [SCFCJ]
  customDbUrl:
  eissn: 1873-6793
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017007
  issn: 0957-4174
  databaseCode: AIKHN
  dateStart: 19950101
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
– providerCode: PRVESC
  databaseName: Science Direct
  customDbUrl:
  eissn: 1873-6793
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017007
  issn: 0957-4174
  databaseCode: .~1
  dateStart: 19950101
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
– providerCode: PRVLSH
  databaseName: Elsevier Journals
  customDbUrl:
  mediaType: online
  eissn: 1873-6793
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017007
  issn: 0957-4174
  databaseCode: AKRWK
  dateStart: 19900101
  isFulltext: true
  providerName: Library Specific Holdings
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LS8NAEF5KvXjxLT5q2YM3iclmH2mOpVpaxaJoobdlX9FK25S24s3f7m6yKSjSg5BLwm4Is7sz35BvvgHgkkqGirKPiCATkCySgRSMBiSW2EZsrA11hcIPA9YbkrsRHdVAp6qFcbRK7_tLn154a_8k9NYM5-Nx-GzBgQ2H9krdryPiivgISVwXg-uvNc3Dyc8lpd5eErjRvnCm5HiZ5afTHkJpoeFZKCX-GZx-ueki9nT3wI4HjbBdftc-qJnZAditGjJAfz4PQWjhsJwY-BQ89m-gmLzmNvV_m0ILTOE0l9YBwEUu8xX0BPUjMOzevnR6ge-IECgct1aBTTcilcYmjZkWSseC4EimWNmwrREVWZYRZSzCSbXr9xCLSBjJpNYiFUgSpfAxqM_ymTkBEEctRTVhGkkbyGxSJKiKM4UlSTPUSvQpQJUpuPJy4a5rxYRXvLB37szHnfm4I8cxdgqu1nPmpVjGxtG0sjD_seTcevON8xrVcnB_4Ja8yKQSZnfd2T9few623Z2jqiDaAPXV4sNcWMCxks1iRzXBVrt_3xt8Ax1I0ws
linkProvider Elsevier
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV07T8MwELYKDLDwRrzxwIZC4sR26hEVqgJtBaKVull-hYdKg9ogNn47duJUAiEGpEyJHUVn-7s75bvvADglkqKy7CPCyAQ4i2QgBSUBjmViPXaiDXGFwr0-7QzxzYiMGqBV18I4WqXH_grTS7T2d0JvzfDt-Tl8sMGBdYf2Yu7XEWYLYAmTOHUZ2PnnnOfh9OfSSnAvDdxwXzlTkbzM7MOJDyFWiniWUom_eqcfOF06n_Y6WPVRI7yoPmwDNMxkE6zVHRmgP6BbILTxsBwbeB_cXV9CMX7Mbe7_9AptZApfc2kRAE5zmRfQM9S3wbB9NWh1At8SIVBJ3CwCm29EisWGxVQLpWOBk0iyRFm_rRERWZZhZWyIw7Rr-BCLSBhJpdaCCSSxUskOWJzkE7MLYBI1FdGYaiStJ7NZkSAqzlQiMctQM9V7ANWm4Mrrhbu2FWNeE8NeuDMfd-bjjh1H6R44m895q9Qy_hxNagvzb2vOLZz_Oe-wXg7uT9yMl6lUSu222__na0_AcmfQ6_Ludf_2AKy4J463gsghWCym7-bIRh-FPC531xcq2tSg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Double+Q-PID+algorithm+for+mobile+robot+control&rft.jtitle=Expert+systems+with+applications&rft.au=Carlucho%2C+Ignacio&rft.au=De+Paula%2C+Mariano&rft.au=Acosta%2C+Gerardo+G.&rft.date=2019-12-15&rft.issn=0957-4174&rft.volume=137&rft.spage=292&rft.epage=307&rft_id=info:doi/10.1016%2Fj.eswa.2019.06.066&rft.externalDBID=n%2Fa&rft.externalDocID=10_1016_j_eswa_2019_06_066
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0957-4174&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0957-4174&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0957-4174&client=summon