Tabu Temporal Difference Learning for Robot Path Planning in Uncertain Environments

This paper addresses the robot path planning problem in uncertain environments, where the robot has to avoid potential collisions with other agents or obstacles, as well as rectify actuation errors caused by environmental disturbances. This problem is motivated by many practical applications, such a...

Full description

Saved in:

Bibliographic Details
Published in	Towards Autonomous Robotic Systems Vol. 10965; pp. 123 - 134
Main Authors	Wei, Changyun, Ni, Fusheng
Format	Book Chapter
Language	English
Published	Switzerland Springer International Publishing AG 2018 Springer International Publishing
Series	Lecture Notes in Computer Science
Subjects	Path planning Reinforcement learning Uncertain environments
Online Access	Get full text
ISBN	9783319967271 3319967274
ISSN	0302-9743 1611-3349
DOI	10.1007/978-3-319-96728-8_11

Cover

Abstract	This paper addresses the robot path planning problem in uncertain environments, where the robot has to avoid potential collisions with other agents or obstacles, as well as rectify actuation errors caused by environmental disturbances. This problem is motivated by many practical applications, such as ocean exploration by underwater vehicles, and package transportation in a warehouse by mobile robots. The novel feature of this paper is that we propose a Tabu methodology consisting of an Adaptive Action Selection Rule and a Tabu Action Elimination Strategy to improve the classic Temporal Difference (TD) learning approach. Furthermore, two classic TD learning algorithms (i.e., Q-learning and SASRA) are revised by the proposed Tabu methodology for optimizing learning performance. We use a simulated environment to evaluate the proposed algorithms. The results show that the proposed approach can provide an effective solution for generating collision-free and safety paths for robots in uncertain environments.
AbstractList	This paper addresses the robot path planning problem in uncertain environments, where the robot has to avoid potential collisions with other agents or obstacles, as well as rectify actuation errors caused by environmental disturbances. This problem is motivated by many practical applications, such as ocean exploration by underwater vehicles, and package transportation in a warehouse by mobile robots. The novel feature of this paper is that we propose a Tabu methodology consisting of an Adaptive Action Selection Rule and a Tabu Action Elimination Strategy to improve the classic Temporal Difference (TD) learning approach. Furthermore, two classic TD learning algorithms (i.e., Q-learning and SASRA) are revised by the proposed Tabu methodology for optimizing learning performance. We use a simulated environment to evaluate the proposed algorithms. The results show that the proposed approach can provide an effective solution for generating collision-free and safety paths for robots in uncertain environments.
Author	Ni, Fusheng Wei, Changyun
Author_xml	– sequence: 1 givenname: Changyun surname: Wei fullname: Wei, Changyun email: weichangyun@hotmail.com organization: College of Mechanical and Electrical Engineering, Hohai University, Changzhou, China – sequence: 2 givenname: Fusheng surname: Ni fullname: Ni, Fusheng organization: College of Mechanical and Electrical Engineering, Hohai University, Changzhou, China
BookMark	eNo1kN1OAjEQhauiEZA38GJfoNppS7u9NIg_CYlE4bpply6sQru2i89vAb2ZmZye05x8A9TzwTuEboHcASHyXskSM8xAYSUkLXGpAc7QgGXlKNBz1AcBgBnj6gKNsv__TUIP9QkjFCvJ2RUaAOGCCAlKXqNRSp-EEEqYolT00cfC2H2xcLs2RLMtHpu6dtH5yhUzZ6Jv_LqoQyzegw1dMTfdpphvjT_qjS-W2Rg7k6-p_2li8Dvnu3SDLmuzTW70t4do-TRdTF7w7O35dfIww-vcv8OWGEOBUVvZiuY5zjW55GqloLa8psSsFOd5ApQlOOpWwiqhTEVdtlLDhoie_k1tzIVc1DaEr6SB6ANDnZlopjMVfUSmDwxziJ9CbQzfe5c67Q6pKhfPAKqNaTsXkxZUjilTGrjUwMbsF2Lycao
ContentType	Book Chapter
Copyright	Springer International Publishing AG, part of Springer Nature 2018
Copyright_xml	– notice: Springer International Publishing AG, part of Springer Nature 2018
DBID	FFUUA
DOI	10.1007/978-3-319-96728-8_11
DatabaseName	ProQuest Ebook Central - Book Chapters - Demo use only
DatabaseTitleList
DeliveryMethod	fulltext_linktorsrc
Discipline	Engineering Computer Science
EISBN	3319967282 9783319967288
EISSN	1611-3349
Editor	Giuliani, Manuel Assaf, Tareq Giannaccini, Maria Elena
Editor_xml	– sequence: 1 givenname: Manuel orcidid: 0000-0003-3781-7623 surname: Giuliani fullname: Giuliani, Manuel email: manuel.giuliani@brl.ac.uk – sequence: 2 givenname: Tareq surname: Assaf fullname: Assaf, Tareq email: t.assaf@bath.ac.uk – sequence: 3 givenname: Maria Elena surname: Giannaccini fullname: Giannaccini, Maria Elena email: maria.elena.giannaccini@brl.ac.uk
EndPage	134
ExternalDocumentID	EBC6275239_147_135
GroupedDBID	0D6 0DA 38. AABBV ACOUV AEDXK AEJLV AEKFX AEZAY ALMA_UNASSIGNED_HOLDINGS ANXHU BBABE BICGV BJAWL BUBNW CVGDX CZZ EDOXC FFUUA FOYMO I4C IEZ NQNQZ OEBZI SBO TPJZQ TSXQS Z5O Z7R Z7S Z7U Z7V Z7W Z7X Z7Y Z7Z Z81 Z82 Z83 Z84 Z85 Z87 Z88 -DT -GH -~X 1SB 29L 2HA 2HV 5QI 875 AASHB ABMNI ACGFS ADCXD AEFIE EJD F5P FEDTE HVGLF LAS LDH P2P RNI RSU SVGTG VI1 ~02
ID	FETCH-LOGICAL-g282t-b0aa2132bcbc22bc50464749d91fb4f20ad9440ad11881e2ed6b969ac2e5042a3
ISBN	9783319967271 3319967274
ISSN	0302-9743
IngestDate	Wed Sep 17 03:14:55 EDT 2025 Thu May 29 16:30:24 EDT 2025
IsPeerReviewed	true
IsScholarly	true
LCCallNum	QA75.5-76.95
Language	English
LinkModel	OpenURL
MergedId	FETCHMERGED-LOGICAL-g282t-b0aa2132bcbc22bc50464749d91fb4f20ad9440ad11881e2ed6b969ac2e5042a3
OCLC	1046067197
PQID	EBC6275239_147_135
PageCount	12
ParticipantIDs	springer_books_10_1007_978_3_319_96728_8_11 proquest_ebookcentralchapters_6275239_147_135
PublicationCentury	2000
PublicationDate	2018
PublicationDateYYYYMMDD	2018-01-01
PublicationDate_xml	– year: 2018 text: 2018
PublicationDecade	2010
PublicationPlace	Switzerland
PublicationPlace_xml	– name: Switzerland – name: Cham
PublicationSeriesSubtitle	Lecture Notes in Artificial Intelligence
PublicationSeriesTitle	Lecture Notes in Computer Science
PublicationSeriesTitleAlternate	Lect.Notes Computer
PublicationSubtitle	19th Annual Conference, TAROS 2018, Bristol, UK July 25-27, 2018, Proceedings
PublicationTitle	Towards Autonomous Robotic Systems
PublicationYear	2018
Publisher	Springer International Publishing AG Springer International Publishing
Publisher_xml	– name: Springer International Publishing AG – name: Springer International Publishing
RelatedPersons	Kleinberg, Jon M. Mattern, Friedemann Naor, Moni Mitchell, John C. Terzopoulos, Demetri Steffen, Bernhard Pandu Rangan, C. Kanade, Takeo Kittler, Josef Weikum, Gerhard Hutchison, David Tygar, Doug
RelatedPersons_xml	– sequence: 1 givenname: David surname: Hutchison fullname: Hutchison, David organization: Lancaster University, Lancaster, United Kingdom – sequence: 2 givenname: Takeo surname: Kanade fullname: Kanade, Takeo organization: Carnegie Mellon University, Pittsburgh, USA – sequence: 3 givenname: Josef surname: Kittler fullname: Kittler, Josef organization: University of Surrey, Guildford, United Kingdom – sequence: 4 givenname: Jon M. surname: Kleinberg fullname: Kleinberg, Jon M. organization: Cornell University, Ithaca, USA – sequence: 5 givenname: Friedemann surname: Mattern fullname: Mattern, Friedemann organization: ETH Zurich, Zurich, Switzerland – sequence: 6 givenname: John C. surname: Mitchell fullname: Mitchell, John C. organization: Stanford University, Stanford, USA – sequence: 7 givenname: Moni surname: Naor fullname: Naor, Moni organization: Dept Applied Math & Computer Science, Weizmann Institute of Science, Rehovot, Israel – sequence: 8 givenname: C. surname: Pandu Rangan fullname: Pandu Rangan, C. organization: Indian Institute of Technology Madras, Chennai, India – sequence: 9 givenname: Bernhard surname: Steffen fullname: Steffen, Bernhard organization: TU Dortmund University, Dortmund, Germany – sequence: 10 givenname: Demetri surname: Terzopoulos fullname: Terzopoulos, Demetri organization: University of California, Los Angeles, USA – sequence: 11 givenname: Doug surname: Tygar fullname: Tygar, Doug organization: University of California, Berkeley, USA – sequence: 12 givenname: Gerhard surname: Weikum fullname: Weikum, Gerhard organization: Max Planck Institute for Informatics, Saarbrücken, Germany
SSID	ssj0002039226 ssj0002792
Score	2.1072583
Snippet	This paper addresses the robot path planning problem in uncertain environments, where the robot has to avoid potential collisions with other agents or...
SourceID	springer proquest
SourceType	Publisher
StartPage	123
SubjectTerms	Path planning Reinforcement learning Uncertain environments
Title	Tabu Temporal Difference Learning for Robot Path Planning in Uncertain Environments
URI	http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=6275239&ppg=135 http://link.springer.com/10.1007/978-3-319-96728-8_11
Volume	10965
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3Nb9MwFLegXIADY4AYMOQDtyiotpM0OXCYUKdpKj1Ai3az7NTZdkklkhy2v573_JG6ZZdxsSrLrV2_n579vn4m5EtW6LqpmEpNga4blU9TZUqTNsaUdTVDjioscP6xLC7W2eVVfrULxdjqkl5_re8frCv5H6lCH8gVq2QfIdnxR6EDPoN8oQUJQ3tw-d13szoB24TXLjkbeixMwFTWn1u9RQLWmIbcRl1ufWS9vb4bRjQsbe_50N0Yf3wF6Cg9JCvHWYVKsQlctIvgR8HcRDsZUvzfjC8fofNkDQNtlkEyj2ronPJCUuXu28KHLZbb3maDJeFliaBoYk8EKw88EcETeeDL3LnT9kxXITD_GW5PLNJ4AtQzGDhO4xmnkQvkWRSO19RrWcZFdGAz5w395yyI0z-wVAtnK9NSYiX4U1jAhDw7m18ufo8uOT6FyyLeRv1BjtyKLgjlVoWlQWHVjlMy-hdRWeZDU-4ZMAcxd3uVWR2Rl1jeQrHuBPbvNXli2mPyKoiAehEckxcRU-Ub8gsxQQMm6A4TNGCCAiaoxQRFTNCACXrb0hETNMbEW7I-n6--X6T-PY70GgzzPtVTpTgTXNe65tDmGBafZdWmYo3OGj5VmyrLoAWjtWSGm02hq6JSNTcwlCvxjkzabWveE1qAGYDUhbwwsyzPhBIwskEHiYLvKnFC0rBd0mYN-FTl2m1OJ5Fcm4sKLNeZZCI_IUnYU4nDOxnouEEYUkgQhrTCkCiMD48a_ZE836H9E5n0fwZzCjfRXn_2CPoL80yBOQ
linkProvider	Library Specific Holdings
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Towards+Autonomous+Robotic+Systems&rft.au=Wei%2C+Changyun&rft.au=Ni%2C+Fusheng&rft.atitle=Tabu+Temporal+Difference+Learning+for+Robot+Path+Planning+in+Uncertain+Environments&rft.series=Lecture+Notes+in+Computer+Science&rft.date=2018-01-01&rft.pub=Springer+International+Publishing&rft.isbn=9783319967271&rft.issn=0302-9743&rft.eissn=1611-3349&rft.spage=123&rft.epage=134&rft_id=info:doi/10.1007%2F978-3-319-96728-8_11
thumbnail_s	http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F6275239-l.jpg