Tabu Temporal Difference Learning for Robot Path Planning in Uncertain Environments
This paper addresses the robot path planning problem in uncertain environments, where the robot has to avoid potential collisions with other agents or obstacles, as well as rectify actuation errors caused by environmental disturbances. This problem is motivated by many practical applications, such a...
        Saved in:
      
    
          | Published in | Towards Autonomous Robotic Systems Vol. 10965; pp. 123 - 134 | 
|---|---|
| Main Authors | , | 
| Format | Book Chapter | 
| Language | English | 
| Published | 
        Switzerland
          Springer International Publishing AG
    
        2018
     Springer International Publishing  | 
| Series | Lecture Notes in Computer Science | 
| Subjects | |
| Online Access | Get full text | 
| ISBN | 9783319967271 3319967274  | 
| ISSN | 0302-9743 1611-3349  | 
| DOI | 10.1007/978-3-319-96728-8_11 | 
Cover
| Abstract | This paper addresses the robot path planning problem in uncertain environments, where the robot has to avoid potential collisions with other agents or obstacles, as well as rectify actuation errors caused by environmental disturbances. This problem is motivated by many practical applications, such as ocean exploration by underwater vehicles, and package transportation in a warehouse by mobile robots. The novel feature of this paper is that we propose a Tabu methodology consisting of an Adaptive Action Selection Rule and a Tabu Action Elimination Strategy to improve the classic Temporal Difference (TD) learning approach. Furthermore, two classic TD learning algorithms (i.e., Q-learning and SASRA) are revised by the proposed Tabu methodology for optimizing learning performance. We use a simulated environment to evaluate the proposed algorithms. The results show that the proposed approach can provide an effective solution for generating collision-free and safety paths for robots in uncertain environments. | 
    
|---|---|
| AbstractList | This paper addresses the robot path planning problem in uncertain environments, where the robot has to avoid potential collisions with other agents or obstacles, as well as rectify actuation errors caused by environmental disturbances. This problem is motivated by many practical applications, such as ocean exploration by underwater vehicles, and package transportation in a warehouse by mobile robots. The novel feature of this paper is that we propose a Tabu methodology consisting of an Adaptive Action Selection Rule and a Tabu Action Elimination Strategy to improve the classic Temporal Difference (TD) learning approach. Furthermore, two classic TD learning algorithms (i.e., Q-learning and SASRA) are revised by the proposed Tabu methodology for optimizing learning performance. We use a simulated environment to evaluate the proposed algorithms. The results show that the proposed approach can provide an effective solution for generating collision-free and safety paths for robots in uncertain environments. | 
    
| Author | Ni, Fusheng Wei, Changyun  | 
    
| Author_xml | – sequence: 1 givenname: Changyun surname: Wei fullname: Wei, Changyun email: weichangyun@hotmail.com organization: College of Mechanical and Electrical Engineering, Hohai University, Changzhou, China – sequence: 2 givenname: Fusheng surname: Ni fullname: Ni, Fusheng organization: College of Mechanical and Electrical Engineering, Hohai University, Changzhou, China  | 
    
| BookMark | eNo1kN1OAjEQhauiEZA38GJfoNppS7u9NIg_CYlE4bpply6sQru2i89vAb2ZmZye05x8A9TzwTuEboHcASHyXskSM8xAYSUkLXGpAc7QgGXlKNBz1AcBgBnj6gKNsv__TUIP9QkjFCvJ2RUaAOGCCAlKXqNRSp-EEEqYolT00cfC2H2xcLs2RLMtHpu6dtH5yhUzZ6Jv_LqoQyzegw1dMTfdpphvjT_qjS-W2Rg7k6-p_2li8Dvnu3SDLmuzTW70t4do-TRdTF7w7O35dfIww-vcv8OWGEOBUVvZiuY5zjW55GqloLa8psSsFOd5ApQlOOpWwiqhTEVdtlLDhoie_k1tzIVc1DaEr6SB6ANDnZlopjMVfUSmDwxziJ9CbQzfe5c67Q6pKhfPAKqNaTsXkxZUjilTGrjUwMbsF2Lycao | 
    
| ContentType | Book Chapter | 
    
| Copyright | Springer International Publishing AG, part of Springer Nature 2018 | 
    
| Copyright_xml | – notice: Springer International Publishing AG, part of Springer Nature 2018 | 
    
| DBID | FFUUA | 
    
| DOI | 10.1007/978-3-319-96728-8_11 | 
    
| DatabaseName | ProQuest Ebook Central - Book Chapters - Demo use only | 
    
| DatabaseTitleList | |
| DeliveryMethod | fulltext_linktorsrc | 
    
| Discipline | Engineering Computer Science  | 
    
| EISBN | 3319967282 9783319967288  | 
    
| EISSN | 1611-3349 | 
    
| Editor | Giuliani, Manuel Assaf, Tareq Giannaccini, Maria Elena  | 
    
| Editor_xml | – sequence: 1 givenname: Manuel orcidid: 0000-0003-3781-7623 surname: Giuliani fullname: Giuliani, Manuel email: manuel.giuliani@brl.ac.uk – sequence: 2 givenname: Tareq surname: Assaf fullname: Assaf, Tareq email: t.assaf@bath.ac.uk – sequence: 3 givenname: Maria Elena surname: Giannaccini fullname: Giannaccini, Maria Elena email: maria.elena.giannaccini@brl.ac.uk  | 
    
| EndPage | 134 | 
    
| ExternalDocumentID | EBC6275239_147_135 | 
    
| GroupedDBID | 0D6 0DA 38. AABBV ACOUV AEDXK AEJLV AEKFX AEZAY ALMA_UNASSIGNED_HOLDINGS ANXHU BBABE BICGV BJAWL BUBNW CVGDX CZZ EDOXC FFUUA FOYMO I4C IEZ NQNQZ OEBZI SBO TPJZQ TSXQS Z5O Z7R Z7S Z7U Z7V Z7W Z7X Z7Y Z7Z Z81 Z82 Z83 Z84 Z85 Z87 Z88 -DT -GH -~X 1SB 29L 2HA 2HV 5QI 875 AASHB ABMNI ACGFS ADCXD AEFIE EJD F5P FEDTE HVGLF LAS LDH P2P RNI RSU SVGTG VI1 ~02  | 
    
| ID | FETCH-LOGICAL-g282t-b0aa2132bcbc22bc50464749d91fb4f20ad9440ad11881e2ed6b969ac2e5042a3 | 
    
| ISBN | 9783319967271 3319967274  | 
    
| ISSN | 0302-9743 | 
    
| IngestDate | Wed Sep 17 03:14:55 EDT 2025 Thu May 29 16:30:24 EDT 2025  | 
    
| IsPeerReviewed | true | 
    
| IsScholarly | true | 
    
| LCCallNum | QA75.5-76.95 | 
    
| Language | English | 
    
| LinkModel | OpenURL | 
    
| MergedId | FETCHMERGED-LOGICAL-g282t-b0aa2132bcbc22bc50464749d91fb4f20ad9440ad11881e2ed6b969ac2e5042a3 | 
    
| OCLC | 1046067197 | 
    
| PQID | EBC6275239_147_135 | 
    
| PageCount | 12 | 
    
| ParticipantIDs | springer_books_10_1007_978_3_319_96728_8_11 proquest_ebookcentralchapters_6275239_147_135  | 
    
| PublicationCentury | 2000 | 
    
| PublicationDate | 2018 | 
    
| PublicationDateYYYYMMDD | 2018-01-01 | 
    
| PublicationDate_xml | – year: 2018 text: 2018  | 
    
| PublicationDecade | 2010 | 
    
| PublicationPlace | Switzerland | 
    
| PublicationPlace_xml | – name: Switzerland – name: Cham  | 
    
| PublicationSeriesSubtitle | Lecture Notes in Artificial Intelligence | 
    
| PublicationSeriesTitle | Lecture Notes in Computer Science | 
    
| PublicationSeriesTitleAlternate | Lect.Notes Computer | 
    
| PublicationSubtitle | 19th Annual Conference, TAROS 2018, Bristol, UK July 25-27, 2018, Proceedings | 
    
| PublicationTitle | Towards Autonomous Robotic Systems | 
    
| PublicationYear | 2018 | 
    
| Publisher | Springer International Publishing AG Springer International Publishing  | 
    
| Publisher_xml | – name: Springer International Publishing AG – name: Springer International Publishing  | 
    
| RelatedPersons | Kleinberg, Jon M. Mattern, Friedemann Naor, Moni Mitchell, John C. Terzopoulos, Demetri Steffen, Bernhard Pandu Rangan, C. Kanade, Takeo Kittler, Josef Weikum, Gerhard Hutchison, David Tygar, Doug  | 
    
| RelatedPersons_xml | – sequence: 1 givenname: David surname: Hutchison fullname: Hutchison, David organization: Lancaster University, Lancaster, United Kingdom – sequence: 2 givenname: Takeo surname: Kanade fullname: Kanade, Takeo organization: Carnegie Mellon University, Pittsburgh, USA – sequence: 3 givenname: Josef surname: Kittler fullname: Kittler, Josef organization: University of Surrey, Guildford, United Kingdom – sequence: 4 givenname: Jon M. surname: Kleinberg fullname: Kleinberg, Jon M. organization: Cornell University, Ithaca, USA – sequence: 5 givenname: Friedemann surname: Mattern fullname: Mattern, Friedemann organization: ETH Zurich, Zurich, Switzerland – sequence: 6 givenname: John C. surname: Mitchell fullname: Mitchell, John C. organization: Stanford University, Stanford, USA – sequence: 7 givenname: Moni surname: Naor fullname: Naor, Moni organization: Dept Applied Math & Computer Science, Weizmann Institute of Science, Rehovot, Israel – sequence: 8 givenname: C. surname: Pandu Rangan fullname: Pandu Rangan, C. organization: Indian Institute of Technology Madras, Chennai, India – sequence: 9 givenname: Bernhard surname: Steffen fullname: Steffen, Bernhard organization: TU Dortmund University, Dortmund, Germany – sequence: 10 givenname: Demetri surname: Terzopoulos fullname: Terzopoulos, Demetri organization: University of California, Los Angeles, USA – sequence: 11 givenname: Doug surname: Tygar fullname: Tygar, Doug organization: University of California, Berkeley, USA – sequence: 12 givenname: Gerhard surname: Weikum fullname: Weikum, Gerhard organization: Max Planck Institute for Informatics, Saarbrücken, Germany  | 
    
| SSID | ssj0002039226 ssj0002792  | 
    
| Score | 2.1072583 | 
    
| Snippet | This paper addresses the robot path planning problem in uncertain environments, where the robot has to avoid potential collisions with other agents or... | 
    
| SourceID | springer proquest  | 
    
| SourceType | Publisher | 
    
| StartPage | 123 | 
    
| SubjectTerms | Path planning Reinforcement learning Uncertain environments  | 
    
| Title | Tabu Temporal Difference Learning for Robot Path Planning in Uncertain Environments | 
    
| URI | http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=6275239&ppg=135 http://link.springer.com/10.1007/978-3-319-96728-8_11  | 
    
| Volume | 10965 | 
    
| hasFullText | 1 | 
    
| inHoldings | 1 | 
    
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3Nb9MwFLegXIADY4AYMOQDtyiotpM0OXCYUKdpKj1Ai3az7NTZdkklkhy2v573_JG6ZZdxsSrLrV2_n579vn4m5EtW6LqpmEpNga4blU9TZUqTNsaUdTVDjioscP6xLC7W2eVVfrULxdjqkl5_re8frCv5H6lCH8gVq2QfIdnxR6EDPoN8oQUJQ3tw-d13szoB24TXLjkbeixMwFTWn1u9RQLWmIbcRl1ufWS9vb4bRjQsbe_50N0Yf3wF6Cg9JCvHWYVKsQlctIvgR8HcRDsZUvzfjC8fofNkDQNtlkEyj2ronPJCUuXu28KHLZbb3maDJeFliaBoYk8EKw88EcETeeDL3LnT9kxXITD_GW5PLNJ4AtQzGDhO4xmnkQvkWRSO19RrWcZFdGAz5w395yyI0z-wVAtnK9NSYiX4U1jAhDw7m18ufo8uOT6FyyLeRv1BjtyKLgjlVoWlQWHVjlMy-hdRWeZDU-4ZMAcxd3uVWR2Rl1jeQrHuBPbvNXli2mPyKoiAehEckxcRU-Ub8gsxQQMm6A4TNGCCAiaoxQRFTNCACXrb0hETNMbEW7I-n6--X6T-PY70GgzzPtVTpTgTXNe65tDmGBafZdWmYo3OGj5VmyrLoAWjtWSGm02hq6JSNTcwlCvxjkzabWveE1qAGYDUhbwwsyzPhBIwskEHiYLvKnFC0rBd0mYN-FTl2m1OJ5Fcm4sKLNeZZCI_IUnYU4nDOxnouEEYUkgQhrTCkCiMD48a_ZE836H9E5n0fwZzCjfRXn_2CPoL80yBOQ | 
    
| linkProvider | Library Specific Holdings | 
    
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Towards+Autonomous+Robotic+Systems&rft.au=Wei%2C+Changyun&rft.au=Ni%2C+Fusheng&rft.atitle=Tabu+Temporal+Difference+Learning+for+Robot+Path+Planning+in+Uncertain+Environments&rft.series=Lecture+Notes+in+Computer+Science&rft.date=2018-01-01&rft.pub=Springer+International+Publishing&rft.isbn=9783319967271&rft.issn=0302-9743&rft.eissn=1611-3349&rft.spage=123&rft.epage=134&rft_id=info:doi/10.1007%2F978-3-319-96728-8_11 | 
    
| thumbnail_s | http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F6275239-l.jpg |