Tabu Temporal Difference Learning for Robot Path Planning in Uncertain Environments

This paper addresses the robot path planning problem in uncertain environments, where the robot has to avoid potential collisions with other agents or obstacles, as well as rectify actuation errors caused by environmental disturbances. This problem is motivated by many practical applications, such a...

Full description

Saved in:
Bibliographic Details
Published inTowards Autonomous Robotic Systems Vol. 10965; pp. 123 - 134
Main Authors Wei, Changyun, Ni, Fusheng
Format Book Chapter
LanguageEnglish
Published Switzerland Springer International Publishing AG 2018
Springer International Publishing
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text
ISBN9783319967271
3319967274
ISSN0302-9743
1611-3349
DOI10.1007/978-3-319-96728-8_11

Cover

Abstract This paper addresses the robot path planning problem in uncertain environments, where the robot has to avoid potential collisions with other agents or obstacles, as well as rectify actuation errors caused by environmental disturbances. This problem is motivated by many practical applications, such as ocean exploration by underwater vehicles, and package transportation in a warehouse by mobile robots. The novel feature of this paper is that we propose a Tabu methodology consisting of an Adaptive Action Selection Rule and a Tabu Action Elimination Strategy to improve the classic Temporal Difference (TD) learning approach. Furthermore, two classic TD learning algorithms (i.e., Q-learning and SASRA) are revised by the proposed Tabu methodology for optimizing learning performance. We use a simulated environment to evaluate the proposed algorithms. The results show that the proposed approach can provide an effective solution for generating collision-free and safety paths for robots in uncertain environments.
AbstractList This paper addresses the robot path planning problem in uncertain environments, where the robot has to avoid potential collisions with other agents or obstacles, as well as rectify actuation errors caused by environmental disturbances. This problem is motivated by many practical applications, such as ocean exploration by underwater vehicles, and package transportation in a warehouse by mobile robots. The novel feature of this paper is that we propose a Tabu methodology consisting of an Adaptive Action Selection Rule and a Tabu Action Elimination Strategy to improve the classic Temporal Difference (TD) learning approach. Furthermore, two classic TD learning algorithms (i.e., Q-learning and SASRA) are revised by the proposed Tabu methodology for optimizing learning performance. We use a simulated environment to evaluate the proposed algorithms. The results show that the proposed approach can provide an effective solution for generating collision-free and safety paths for robots in uncertain environments.
Author Ni, Fusheng
Wei, Changyun
Author_xml – sequence: 1
  givenname: Changyun
  surname: Wei
  fullname: Wei, Changyun
  email: weichangyun@hotmail.com
  organization: College of Mechanical and Electrical Engineering, Hohai University, Changzhou, China
– sequence: 2
  givenname: Fusheng
  surname: Ni
  fullname: Ni, Fusheng
  organization: College of Mechanical and Electrical Engineering, Hohai University, Changzhou, China
BookMark eNo1kN1OAjEQhauiEZA38GJfoNppS7u9NIg_CYlE4bpply6sQru2i89vAb2ZmZye05x8A9TzwTuEboHcASHyXskSM8xAYSUkLXGpAc7QgGXlKNBz1AcBgBnj6gKNsv__TUIP9QkjFCvJ2RUaAOGCCAlKXqNRSp-EEEqYolT00cfC2H2xcLs2RLMtHpu6dtH5yhUzZ6Jv_LqoQyzegw1dMTfdpphvjT_qjS-W2Rg7k6-p_2li8Dvnu3SDLmuzTW70t4do-TRdTF7w7O35dfIww-vcv8OWGEOBUVvZiuY5zjW55GqloLa8psSsFOd5ApQlOOpWwiqhTEVdtlLDhoie_k1tzIVc1DaEr6SB6ANDnZlopjMVfUSmDwxziJ9CbQzfe5c67Q6pKhfPAKqNaTsXkxZUjilTGrjUwMbsF2Lycao
ContentType Book Chapter
Copyright Springer International Publishing AG, part of Springer Nature 2018
Copyright_xml – notice: Springer International Publishing AG, part of Springer Nature 2018
DBID FFUUA
DOI 10.1007/978-3-319-96728-8_11
DatabaseName ProQuest Ebook Central - Book Chapters - Demo use only
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Computer Science
EISBN 3319967282
9783319967288
EISSN 1611-3349
Editor Giuliani, Manuel
Assaf, Tareq
Giannaccini, Maria Elena
Editor_xml – sequence: 1
  givenname: Manuel
  orcidid: 0000-0003-3781-7623
  surname: Giuliani
  fullname: Giuliani, Manuel
  email: manuel.giuliani@brl.ac.uk
– sequence: 2
  givenname: Tareq
  surname: Assaf
  fullname: Assaf, Tareq
  email: t.assaf@bath.ac.uk
– sequence: 3
  givenname: Maria Elena
  surname: Giannaccini
  fullname: Giannaccini, Maria Elena
  email: maria.elena.giannaccini@brl.ac.uk
EndPage 134
ExternalDocumentID EBC6275239_147_135
GroupedDBID 0D6
0DA
38.
AABBV
ACOUV
AEDXK
AEJLV
AEKFX
AEZAY
ALMA_UNASSIGNED_HOLDINGS
ANXHU
BBABE
BICGV
BJAWL
BUBNW
CVGDX
CZZ
EDOXC
FFUUA
FOYMO
I4C
IEZ
NQNQZ
OEBZI
SBO
TPJZQ
TSXQS
Z5O
Z7R
Z7S
Z7U
Z7V
Z7W
Z7X
Z7Y
Z7Z
Z81
Z82
Z83
Z84
Z85
Z87
Z88
-DT
-GH
-~X
1SB
29L
2HA
2HV
5QI
875
AASHB
ABMNI
ACGFS
ADCXD
AEFIE
EJD
F5P
FEDTE
HVGLF
LAS
LDH
P2P
RNI
RSU
SVGTG
VI1
~02
ID FETCH-LOGICAL-g282t-b0aa2132bcbc22bc50464749d91fb4f20ad9440ad11881e2ed6b969ac2e5042a3
ISBN 9783319967271
3319967274
ISSN 0302-9743
IngestDate Wed Sep 17 03:14:55 EDT 2025
Thu May 29 16:30:24 EDT 2025
IsPeerReviewed true
IsScholarly true
LCCallNum QA75.5-76.95
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-g282t-b0aa2132bcbc22bc50464749d91fb4f20ad9440ad11881e2ed6b969ac2e5042a3
OCLC 1046067197
PQID EBC6275239_147_135
PageCount 12
ParticipantIDs springer_books_10_1007_978_3_319_96728_8_11
proquest_ebookcentralchapters_6275239_147_135
PublicationCentury 2000
PublicationDate 2018
PublicationDateYYYYMMDD 2018-01-01
PublicationDate_xml – year: 2018
  text: 2018
PublicationDecade 2010
PublicationPlace Switzerland
PublicationPlace_xml – name: Switzerland
– name: Cham
PublicationSeriesSubtitle Lecture Notes in Artificial Intelligence
PublicationSeriesTitle Lecture Notes in Computer Science
PublicationSeriesTitleAlternate Lect.Notes Computer
PublicationSubtitle 19th Annual Conference, TAROS 2018, Bristol, UK July 25-27, 2018, Proceedings
PublicationTitle Towards Autonomous Robotic Systems
PublicationYear 2018
Publisher Springer International Publishing AG
Springer International Publishing
Publisher_xml – name: Springer International Publishing AG
– name: Springer International Publishing
RelatedPersons Kleinberg, Jon M.
Mattern, Friedemann
Naor, Moni
Mitchell, John C.
Terzopoulos, Demetri
Steffen, Bernhard
Pandu Rangan, C.
Kanade, Takeo
Kittler, Josef
Weikum, Gerhard
Hutchison, David
Tygar, Doug
RelatedPersons_xml – sequence: 1
  givenname: David
  surname: Hutchison
  fullname: Hutchison, David
  organization: Lancaster University, Lancaster, United Kingdom
– sequence: 2
  givenname: Takeo
  surname: Kanade
  fullname: Kanade, Takeo
  organization: Carnegie Mellon University, Pittsburgh, USA
– sequence: 3
  givenname: Josef
  surname: Kittler
  fullname: Kittler, Josef
  organization: University of Surrey, Guildford, United Kingdom
– sequence: 4
  givenname: Jon M.
  surname: Kleinberg
  fullname: Kleinberg, Jon M.
  organization: Cornell University, Ithaca, USA
– sequence: 5
  givenname: Friedemann
  surname: Mattern
  fullname: Mattern, Friedemann
  organization: ETH Zurich, Zurich, Switzerland
– sequence: 6
  givenname: John C.
  surname: Mitchell
  fullname: Mitchell, John C.
  organization: Stanford University, Stanford, USA
– sequence: 7
  givenname: Moni
  surname: Naor
  fullname: Naor, Moni
  organization: Dept Applied Math & Computer Science, Weizmann Institute of Science, Rehovot, Israel
– sequence: 8
  givenname: C.
  surname: Pandu Rangan
  fullname: Pandu Rangan, C.
  organization: Indian Institute of Technology Madras, Chennai, India
– sequence: 9
  givenname: Bernhard
  surname: Steffen
  fullname: Steffen, Bernhard
  organization: TU Dortmund University, Dortmund, Germany
– sequence: 10
  givenname: Demetri
  surname: Terzopoulos
  fullname: Terzopoulos, Demetri
  organization: University of California, Los Angeles, USA
– sequence: 11
  givenname: Doug
  surname: Tygar
  fullname: Tygar, Doug
  organization: University of California, Berkeley, USA
– sequence: 12
  givenname: Gerhard
  surname: Weikum
  fullname: Weikum, Gerhard
  organization: Max Planck Institute for Informatics, Saarbrücken, Germany
SSID ssj0002039226
ssj0002792
Score 2.1072583
Snippet This paper addresses the robot path planning problem in uncertain environments, where the robot has to avoid potential collisions with other agents or...
SourceID springer
proquest
SourceType Publisher
StartPage 123
SubjectTerms Path planning
Reinforcement learning
Uncertain environments
Title Tabu Temporal Difference Learning for Robot Path Planning in Uncertain Environments
URI http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=6275239&ppg=135
http://link.springer.com/10.1007/978-3-319-96728-8_11
Volume 10965
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3Nb9MwFLegXIADY4AYMOQDtyiotpM0OXCYUKdpKj1Ai3az7NTZdkklkhy2v573_JG6ZZdxsSrLrV2_n579vn4m5EtW6LqpmEpNga4blU9TZUqTNsaUdTVDjioscP6xLC7W2eVVfrULxdjqkl5_re8frCv5H6lCH8gVq2QfIdnxR6EDPoN8oQUJQ3tw-d13szoB24TXLjkbeixMwFTWn1u9RQLWmIbcRl1ufWS9vb4bRjQsbe_50N0Yf3wF6Cg9JCvHWYVKsQlctIvgR8HcRDsZUvzfjC8fofNkDQNtlkEyj2ronPJCUuXu28KHLZbb3maDJeFliaBoYk8EKw88EcETeeDL3LnT9kxXITD_GW5PLNJ4AtQzGDhO4xmnkQvkWRSO19RrWcZFdGAz5w395yyI0z-wVAtnK9NSYiX4U1jAhDw7m18ufo8uOT6FyyLeRv1BjtyKLgjlVoWlQWHVjlMy-hdRWeZDU-4ZMAcxd3uVWR2Rl1jeQrHuBPbvNXli2mPyKoiAehEckxcRU-Ub8gsxQQMm6A4TNGCCAiaoxQRFTNCACXrb0hETNMbEW7I-n6--X6T-PY70GgzzPtVTpTgTXNe65tDmGBafZdWmYo3OGj5VmyrLoAWjtWSGm02hq6JSNTcwlCvxjkzabWveE1qAGYDUhbwwsyzPhBIwskEHiYLvKnFC0rBd0mYN-FTl2m1OJ5Fcm4sKLNeZZCI_IUnYU4nDOxnouEEYUkgQhrTCkCiMD48a_ZE836H9E5n0fwZzCjfRXn_2CPoL80yBOQ
linkProvider Library Specific Holdings
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Towards+Autonomous+Robotic+Systems&rft.au=Wei%2C+Changyun&rft.au=Ni%2C+Fusheng&rft.atitle=Tabu+Temporal+Difference+Learning+for+Robot+Path+Planning+in+Uncertain+Environments&rft.series=Lecture+Notes+in+Computer+Science&rft.date=2018-01-01&rft.pub=Springer+International+Publishing&rft.isbn=9783319967271&rft.issn=0302-9743&rft.eissn=1611-3349&rft.spage=123&rft.epage=134&rft_id=info:doi/10.1007%2F978-3-319-96728-8_11
thumbnail_s http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F6275239-l.jpg