Robot Navigation in Crowds Environment Base Deep Reinforcement Learning with POMDP
With the development of deep learning technology, the navigation technology of mobile robot based on deep reinforcement learning is developing rapidly. But, navigation policy based on deep reinforcement learning still needs to be improved in crowds environment. The motion intention of pedestrians in...
        Saved in:
      
    
          | Published in | Multimedia Technology and Enhanced Learning Vol. 446; pp. 675 - 685 | 
|---|---|
| Main Authors | , , , | 
| Format | Book Chapter | 
| Language | English | 
| Published | 
        Switzerland
          Springer
    
        2022
     Springer Nature Switzerland  | 
| Series | Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering | 
| Subjects | |
| Online Access | Get full text | 
| ISBN | 3031181220 9783031181221  | 
| ISSN | 1867-8211 1867-822X  | 
| DOI | 10.1007/978-3-031-18123-8_53 | 
Cover
| Abstract | With the development of deep learning technology, the navigation technology of mobile robot based on deep reinforcement learning is developing rapidly. But, navigation policy based on deep reinforcement learning still needs to be improved in crowds environment. The motion intention of pedestrians in crowds environment is variable, and the current motion intention information of pedestrian cannot be judged by only relying on a single frame of sensor sensing information. Therefore, in the case of only one frame of input, the pedestrian motion state information is partially observable. To dealing with this problem, we present the P-RL algorithm in this paper. The algorithm replaces traditional deep reinforcement learning Markov Decision Process model with a Partially Observable Markov Decision Process model, and introduces the LSTM neural network into the deep reinforcement learning algorithm. The LSTM neural network has the ability to process time series information, so that makes the algorithm has the ability to perceive the relationship between the observation data of each frame, which enhances the robustness of the algorithm. Experimental results show our algorithm is superior to other algorithms in time overhead and navigation success rate in crowds environment. | 
    
|---|---|
| AbstractList | With the development of deep learning technology, the navigation technology of mobile robot based on deep reinforcement learning is developing rapidly. But, navigation policy based on deep reinforcement learning still needs to be improved in crowds environment. The motion intention of pedestrians in crowds environment is variable, and the current motion intention information of pedestrian cannot be judged by only relying on a single frame of sensor sensing information. Therefore, in the case of only one frame of input, the pedestrian motion state information is partially observable. To dealing with this problem, we present the P-RL algorithm in this paper. The algorithm replaces traditional deep reinforcement learning Markov Decision Process model with a Partially Observable Markov Decision Process model, and introduces the LSTM neural network into the deep reinforcement learning algorithm. The LSTM neural network has the ability to process time series information, so that makes the algorithm has the ability to perceive the relationship between the observation data of each frame, which enhances the robustness of the algorithm. Experimental results show our algorithm is superior to other algorithms in time overhead and navigation success rate in crowds environment. | 
    
| Author | Li, Haiming Li, Qinghua Wang, Jiahui Feng, Chao  | 
    
| Author_xml | – sequence: 1 givenname: Qinghua surname: Li fullname: Li, Qinghua – sequence: 2 givenname: Haiming surname: Li fullname: Li, Haiming – sequence: 3 givenname: Jiahui surname: Wang fullname: Wang, Jiahui – sequence: 4 givenname: Chao surname: Feng fullname: Feng, Chao email: cfeng@qlu.edu.cn  | 
    
| BookMark | eNpFkMtOwzAQRc1TtNA_YOEfCNjjJHaW0PKSykMVSOws20xooNjBDvT3SVsEmsVI9-qO5p4h2fXBIyHHnJ1wxuRpJVUmMiZ4xhUHkSldiC0yFL2yFqptMuCqlJkCeN75N4Dt_hmc75MhF7kqK4A8PyCjlN4YYyAFVFIOyGwWbOjonfluXk3XBE8bT8cxLF8SvfDfTQz-A31Hz01COkFs6QwbX4focK1P0UTf-Fe6bLo5fbi_nTwckb3aLBKOfvchebq8eBxfZ9P7q5vx2TRreaFEZkpEiY5j4ZizsmI5stqql8oCgHIWLJfgamdQ9JNL63IljTUWBDNYKnFIYHM3tbH_AKO2IbwnzZle8dM9Py10D0WvcekVvz6Ub0JtDJ9fmDqNq5Tru0SzcHPTdhiTlpxXOYAuS6FLVYgfFNtxQw | 
    
| ContentType | Book Chapter | 
    
| Copyright | ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering 2022 | 
    
| Copyright_xml | – notice: ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering 2022 | 
    
| DBID | FFUUA | 
    
| DEWEY | 371.334 | 
    
| DOI | 10.1007/978-3-031-18123-8_53 | 
    
| DatabaseName | ProQuest Ebook Central - Book Chapters - Demo use only | 
    
| DatabaseTitleList | |
| DeliveryMethod | fulltext_linktorsrc | 
    
| Discipline | Education Computer Science  | 
    
| EISBN | 3031181239 9783031181238  | 
    
| EISSN | 1867-822X | 
    
| Editor | Zhang, Yu-Dong Wang, Shui-Hua  | 
    
| Editor_xml | – sequence: 1 fullname: Zhang, Yu-Dong – sequence: 2 fullname: Wang, Shui-Hua  | 
    
| EndPage | 685 | 
    
| ExternalDocumentID | EBC7119422_663_685 | 
    
| GroupedDBID | 38. AABBV AAZWU ABSVR ABTHU ABVND ACBPT ACHZO ACPMC ADNVS AEJLV AEKFX AHVRR AIYYB AJQNT ALMA_UNASSIGNED_HOLDINGS BBABE CZZ FFUUA IEZ SBO TPJZQ Z81 Z83  | 
    
| ID | FETCH-LOGICAL-p1583-a6ee7ec1e5c0cb7904e0fb8d9b2228cb2b172cfcae3e3e47bc487abab230ae683 | 
    
| ISBN | 3031181220 9783031181221  | 
    
| ISSN | 1867-8211 | 
    
| IngestDate | Tue Jul 29 20:28:40 EDT 2025 Thu May 29 01:10:12 EDT 2025  | 
    
| IsPeerReviewed | false | 
    
| IsScholarly | false | 
    
| LCCallNum | LB1028.43-1028.75 | 
    
| Language | English | 
    
| LinkModel | OpenURL | 
    
| MergedId | FETCHMERGED-LOGICAL-p1583-a6ee7ec1e5c0cb7904e0fb8d9b2228cb2b172cfcae3e3e47bc487abab230ae683 | 
    
| OCLC | 1348692244 | 
    
| PQID | EBC7119422_663_685 | 
    
| PageCount | 11 | 
    
| ParticipantIDs | springer_books_10_1007_978_3_031_18123_8_53 proquest_ebookcentralchapters_7119422_663_685  | 
    
| PublicationCentury | 2000 | 
    
| PublicationDate | 2022 | 
    
| PublicationDateYYYYMMDD | 2022-01-01 | 
    
| PublicationDate_xml | – year: 2022 text: 2022  | 
    
| PublicationDecade | 2020 | 
    
| PublicationPlace | Switzerland | 
    
| PublicationPlace_xml | – name: Switzerland – name: Cham  | 
    
| PublicationSeriesTitle | Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering | 
    
| PublicationSeriesTitleAlternate | Lect.Notes Social.Inform. | 
    
| PublicationSubtitle | 4th EAI International Conference, ICMTEL 2022, Virtual Event, April 15-16, 2022, Proceedings | 
    
| PublicationTitle | Multimedia Technology and Enhanced Learning | 
    
| PublicationYear | 2022 | 
    
| Publisher | Springer Springer Nature Switzerland  | 
    
| Publisher_xml | – name: Springer – name: Springer Nature Switzerland  | 
    
| RelatedPersons | Jia, Xiaohua Dressler, Falko Ferrari, Domenico Kobayashi, Hisashi Shen, Xuemin Cao, Jiannong Palazzo, Sergio Coulson, Geoffrey Stan, Mircea Gerla, Mario Akan, Ozgur Bellavista, Paolo Zomaya, Albert Y. Sahni, Sartaj  | 
    
| RelatedPersons_xml | – sequence: 1 givenname: Ozgur surname: Akan fullname: Akan, Ozgur – sequence: 2 givenname: Paolo surname: Bellavista fullname: Bellavista, Paolo – sequence: 3 givenname: Jiannong surname: Cao fullname: Cao, Jiannong – sequence: 4 givenname: Geoffrey surname: Coulson fullname: Coulson, Geoffrey – sequence: 5 givenname: Falko surname: Dressler fullname: Dressler, Falko – sequence: 6 givenname: Domenico surname: Ferrari fullname: Ferrari, Domenico – sequence: 7 givenname: Mario surname: Gerla fullname: Gerla, Mario – sequence: 8 givenname: Hisashi surname: Kobayashi fullname: Kobayashi, Hisashi – sequence: 9 givenname: Sergio surname: Palazzo fullname: Palazzo, Sergio – sequence: 10 givenname: Sartaj surname: Sahni fullname: Sahni, Sartaj – sequence: 11 givenname: Xuemin orcidid: 0000-0002-4140-287X surname: Shen fullname: Shen, Xuemin – sequence: 12 givenname: Mircea surname: Stan fullname: Stan, Mircea – sequence: 13 givenname: Xiaohua surname: Jia fullname: Jia, Xiaohua – sequence: 14 givenname: Albert Y. surname: Zomaya fullname: Zomaya, Albert Y.  | 
    
| SSID | ssj0002732977 ssib023167542 ssj0000608566  | 
    
| Score | 1.6578498 | 
    
| Snippet | With the development of deep learning technology, the navigation technology of mobile robot based on deep reinforcement learning is developing rapidly. But,... | 
    
| SourceID | springer proquest  | 
    
| SourceType | Publisher | 
    
| StartPage | 675 | 
    
| SubjectTerms | Deep reinforcement learning Partially observable Markov decision process Robot navigation  | 
    
| Title | Robot Navigation in Crowds Environment Base Deep Reinforcement Learning with POMDP | 
    
| URI | http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=7119422&ppg=685 http://link.springer.com/10.1007/978-3-031-18123-8_53  | 
    
| Volume | 446 | 
    
| hasFullText | 1 | 
    
| inHoldings | 1 | 
    
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3JbtswECUc91L0kK5ouoGH5mSokEmthx5S14VhJOnmtLkRJEPF7sEKYruHfE2-Jf_Tf-gMF1l2c0lhQDAI2qJmHkbD4bwZQt7GvExkqfrouaVRUhVxJNO8jMo8ZTnTqaoMbhSPjrPRSTI-TU87nT-trKXVUr3TV7fySv5HqzAGekWW7B002_wpDMB30C9cQcNw3XJ-N8OsrsMQpgJa4kcrPm7PAobzqTvX98VTz5usG3t0_xVGpiu5OTiS2N6rmfnTh5HHMzldzdb-ohsdTGXdxtq3WtVLMNS_bb0Omzy5P2D7B_EANvlni95wzabrfYDXJlg5cwGatVVbtQ1QNmu1kWH36y-fjz46SgBK0yzeH_oDj-N66WrlutTPkO6AGZOhS0UwWhamnoPsqVdNYeoJ9gBqE2QW7fKM7XgIY1vxkBAPxWRvXNF3WPWVI05vbJ3h1Y2cW-b42d76F_DWKJi3_qY9Znu-ByufuWYv3mHIXM-hf95F7fQTuFmEd-NRIVK-Q3ZgAV1y72A4PvwRzB_DogShH7FzJMAf9t72L1d5iJW2l2izUGQrhQfxFczWD9Ziit62io091VYagPWuJg_JA2TcUKTCgOYekY6ZPya7QZPUa_IJmVic0TXO6Gx-c-0wRlsYo4gxihijGxijAWMUMXZzbfH1lJx8Gk4Go8h3BYku-mnBI5kZkxvdN6mOtcrLODFxpYqzUmEwUyumwCfXlZaGwyfJlYY9uVRSwWZbmqzgz0h3Xs_Nc0JZpuIySY3kBiZliTSKZ0Ulk4RXkrNqj0RBQsLmLviEae3ksRB5v18mjAlw2wXAYI_0ghgFTl-IUBQc5C-4APkLK3-B8n9xp9kvyf012l-R7vJyZV6DP7xUbzyO_gLa7K8y | 
    
| linkProvider | Library Specific Holdings | 
    
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Multimedia+Technology+and+Enhanced+Learning&rft.au=Li%2C+Qinghua&rft.au=Li%2C+Haiming&rft.au=Wang%2C+Jiahui&rft.au=Feng%2C+Chao&rft.atitle=Robot+Navigation+in%C2%A0Crowds+Environment+Base+Deep+Reinforcement+Learning+with%C2%A0POMDP&rft.series=Lecture+Notes+of+the+Institute+for+Computer+Sciences%2C+Social+Informatics+and+Telecommunications+Engineering&rft.date=2022-01-01&rft.pub=Springer+Nature+Switzerland&rft.isbn=9783031181221&rft.issn=1867-8211&rft.eissn=1867-822X&rft.spage=675&rft.epage=685&rft_id=info:doi/10.1007%2F978-3-031-18123-8_53 | 
    
| thumbnail_s | http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F7119422-l.jpg |