Robot Navigation in Crowds Environment Base Deep Reinforcement Learning with POMDP

With the development of deep learning technology, the navigation technology of mobile robot based on deep reinforcement learning is developing rapidly. But, navigation policy based on deep reinforcement learning still needs to be improved in crowds environment. The motion intention of pedestrians in...

Full description

Saved in:
Bibliographic Details
Published inMultimedia Technology and Enhanced Learning Vol. 446; pp. 675 - 685
Main Authors Li, Qinghua, Li, Haiming, Wang, Jiahui, Feng, Chao
Format Book Chapter
LanguageEnglish
Published Switzerland Springer 2022
Springer Nature Switzerland
SeriesLecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
Subjects
Online AccessGet full text
ISBN3031181220
9783031181221
ISSN1867-8211
1867-822X
DOI10.1007/978-3-031-18123-8_53

Cover

Abstract With the development of deep learning technology, the navigation technology of mobile robot based on deep reinforcement learning is developing rapidly. But, navigation policy based on deep reinforcement learning still needs to be improved in crowds environment. The motion intention of pedestrians in crowds environment is variable, and the current motion intention information of pedestrian cannot be judged by only relying on a single frame of sensor sensing information. Therefore, in the case of only one frame of input, the pedestrian motion state information is partially observable. To dealing with this problem, we present the P-RL algorithm in this paper. The algorithm replaces traditional deep reinforcement learning Markov Decision Process model with a Partially Observable Markov Decision Process model, and introduces the LSTM neural network into the deep reinforcement learning algorithm. The LSTM neural network has the ability to process time series information, so that makes the algorithm has the ability to perceive the relationship between the observation data of each frame, which enhances the robustness of the algorithm. Experimental results show our algorithm is superior to other algorithms in time overhead and navigation success rate in crowds environment.
AbstractList With the development of deep learning technology, the navigation technology of mobile robot based on deep reinforcement learning is developing rapidly. But, navigation policy based on deep reinforcement learning still needs to be improved in crowds environment. The motion intention of pedestrians in crowds environment is variable, and the current motion intention information of pedestrian cannot be judged by only relying on a single frame of sensor sensing information. Therefore, in the case of only one frame of input, the pedestrian motion state information is partially observable. To dealing with this problem, we present the P-RL algorithm in this paper. The algorithm replaces traditional deep reinforcement learning Markov Decision Process model with a Partially Observable Markov Decision Process model, and introduces the LSTM neural network into the deep reinforcement learning algorithm. The LSTM neural network has the ability to process time series information, so that makes the algorithm has the ability to perceive the relationship between the observation data of each frame, which enhances the robustness of the algorithm. Experimental results show our algorithm is superior to other algorithms in time overhead and navigation success rate in crowds environment.
Author Li, Haiming
Li, Qinghua
Wang, Jiahui
Feng, Chao
Author_xml – sequence: 1
  givenname: Qinghua
  surname: Li
  fullname: Li, Qinghua
– sequence: 2
  givenname: Haiming
  surname: Li
  fullname: Li, Haiming
– sequence: 3
  givenname: Jiahui
  surname: Wang
  fullname: Wang, Jiahui
– sequence: 4
  givenname: Chao
  surname: Feng
  fullname: Feng, Chao
  email: cfeng@qlu.edu.cn
BookMark eNpFkMtOwzAQRc1TtNA_YOEfCNjjJHaW0PKSykMVSOws20xooNjBDvT3SVsEmsVI9-qO5p4h2fXBIyHHnJ1wxuRpJVUmMiZ4xhUHkSldiC0yFL2yFqptMuCqlJkCeN75N4Dt_hmc75MhF7kqK4A8PyCjlN4YYyAFVFIOyGwWbOjonfluXk3XBE8bT8cxLF8SvfDfTQz-A31Hz01COkFs6QwbX4focK1P0UTf-Fe6bLo5fbi_nTwckb3aLBKOfvchebq8eBxfZ9P7q5vx2TRreaFEZkpEiY5j4ZizsmI5stqql8oCgHIWLJfgamdQ9JNL63IljTUWBDNYKnFIYHM3tbH_AKO2IbwnzZle8dM9Py10D0WvcekVvz6Ub0JtDJ9fmDqNq5Tru0SzcHPTdhiTlpxXOYAuS6FLVYgfFNtxQw
ContentType Book Chapter
Copyright ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering 2022
Copyright_xml – notice: ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering 2022
DBID FFUUA
DEWEY 371.334
DOI 10.1007/978-3-031-18123-8_53
DatabaseName ProQuest Ebook Central - Book Chapters - Demo use only
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Education
Computer Science
EISBN 3031181239
9783031181238
EISSN 1867-822X
Editor Zhang, Yu-Dong
Wang, Shui-Hua
Editor_xml – sequence: 1
  fullname: Zhang, Yu-Dong
– sequence: 2
  fullname: Wang, Shui-Hua
EndPage 685
ExternalDocumentID EBC7119422_663_685
GroupedDBID 38.
AABBV
AAZWU
ABSVR
ABTHU
ABVND
ACBPT
ACHZO
ACPMC
ADNVS
AEJLV
AEKFX
AHVRR
AIYYB
AJQNT
ALMA_UNASSIGNED_HOLDINGS
BBABE
CZZ
FFUUA
IEZ
SBO
TPJZQ
Z81
Z83
ID FETCH-LOGICAL-p1583-a6ee7ec1e5c0cb7904e0fb8d9b2228cb2b172cfcae3e3e47bc487abab230ae683
ISBN 3031181220
9783031181221
ISSN 1867-8211
IngestDate Tue Jul 29 20:28:40 EDT 2025
Thu May 29 01:10:12 EDT 2025
IsPeerReviewed false
IsScholarly false
LCCallNum LB1028.43-1028.75
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-p1583-a6ee7ec1e5c0cb7904e0fb8d9b2228cb2b172cfcae3e3e47bc487abab230ae683
OCLC 1348692244
PQID EBC7119422_663_685
PageCount 11
ParticipantIDs springer_books_10_1007_978_3_031_18123_8_53
proquest_ebookcentralchapters_7119422_663_685
PublicationCentury 2000
PublicationDate 2022
PublicationDateYYYYMMDD 2022-01-01
PublicationDate_xml – year: 2022
  text: 2022
PublicationDecade 2020
PublicationPlace Switzerland
PublicationPlace_xml – name: Switzerland
– name: Cham
PublicationSeriesTitle Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
PublicationSeriesTitleAlternate Lect.Notes Social.Inform.
PublicationSubtitle 4th EAI International Conference, ICMTEL 2022, Virtual Event, April 15-16, 2022, Proceedings
PublicationTitle Multimedia Technology and Enhanced Learning
PublicationYear 2022
Publisher Springer
Springer Nature Switzerland
Publisher_xml – name: Springer
– name: Springer Nature Switzerland
RelatedPersons Jia, Xiaohua
Dressler, Falko
Ferrari, Domenico
Kobayashi, Hisashi
Shen, Xuemin
Cao, Jiannong
Palazzo, Sergio
Coulson, Geoffrey
Stan, Mircea
Gerla, Mario
Akan, Ozgur
Bellavista, Paolo
Zomaya, Albert Y.
Sahni, Sartaj
RelatedPersons_xml – sequence: 1
  givenname: Ozgur
  surname: Akan
  fullname: Akan, Ozgur
– sequence: 2
  givenname: Paolo
  surname: Bellavista
  fullname: Bellavista, Paolo
– sequence: 3
  givenname: Jiannong
  surname: Cao
  fullname: Cao, Jiannong
– sequence: 4
  givenname: Geoffrey
  surname: Coulson
  fullname: Coulson, Geoffrey
– sequence: 5
  givenname: Falko
  surname: Dressler
  fullname: Dressler, Falko
– sequence: 6
  givenname: Domenico
  surname: Ferrari
  fullname: Ferrari, Domenico
– sequence: 7
  givenname: Mario
  surname: Gerla
  fullname: Gerla, Mario
– sequence: 8
  givenname: Hisashi
  surname: Kobayashi
  fullname: Kobayashi, Hisashi
– sequence: 9
  givenname: Sergio
  surname: Palazzo
  fullname: Palazzo, Sergio
– sequence: 10
  givenname: Sartaj
  surname: Sahni
  fullname: Sahni, Sartaj
– sequence: 11
  givenname: Xuemin
  orcidid: 0000-0002-4140-287X
  surname: Shen
  fullname: Shen, Xuemin
– sequence: 12
  givenname: Mircea
  surname: Stan
  fullname: Stan, Mircea
– sequence: 13
  givenname: Xiaohua
  surname: Jia
  fullname: Jia, Xiaohua
– sequence: 14
  givenname: Albert Y.
  surname: Zomaya
  fullname: Zomaya, Albert Y.
SSID ssj0002732977
ssib023167542
ssj0000608566
Score 1.6578498
Snippet With the development of deep learning technology, the navigation technology of mobile robot based on deep reinforcement learning is developing rapidly. But,...
SourceID springer
proquest
SourceType Publisher
StartPage 675
SubjectTerms Deep reinforcement learning
Partially observable Markov decision process
Robot navigation
Title Robot Navigation in Crowds Environment Base Deep Reinforcement Learning with POMDP
URI http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=7119422&ppg=685
http://link.springer.com/10.1007/978-3-031-18123-8_53
Volume 446
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3JbtswECUc91L0kK5ouoGH5mSokEmthx5S14VhJOnmtLkRJEPF7sEKYruHfE2-Jf_Tf-gMF1l2c0lhQDAI2qJmHkbD4bwZQt7GvExkqfrouaVRUhVxJNO8jMo8ZTnTqaoMbhSPjrPRSTI-TU87nT-trKXVUr3TV7fySv5HqzAGekWW7B002_wpDMB30C9cQcNw3XJ-N8OsrsMQpgJa4kcrPm7PAobzqTvX98VTz5usG3t0_xVGpiu5OTiS2N6rmfnTh5HHMzldzdb-ohsdTGXdxtq3WtVLMNS_bb0Omzy5P2D7B_EANvlni95wzabrfYDXJlg5cwGatVVbtQ1QNmu1kWH36y-fjz46SgBK0yzeH_oDj-N66WrlutTPkO6AGZOhS0UwWhamnoPsqVdNYeoJ9gBqE2QW7fKM7XgIY1vxkBAPxWRvXNF3WPWVI05vbJ3h1Y2cW-b42d76F_DWKJi3_qY9Znu-ByufuWYv3mHIXM-hf95F7fQTuFmEd-NRIVK-Q3ZgAV1y72A4PvwRzB_DogShH7FzJMAf9t72L1d5iJW2l2izUGQrhQfxFczWD9Ziit62io091VYagPWuJg_JA2TcUKTCgOYekY6ZPya7QZPUa_IJmVic0TXO6Gx-c-0wRlsYo4gxihijGxijAWMUMXZzbfH1lJx8Gk4Go8h3BYku-mnBI5kZkxvdN6mOtcrLODFxpYqzUmEwUyumwCfXlZaGwyfJlYY9uVRSwWZbmqzgz0h3Xs_Nc0JZpuIySY3kBiZliTSKZ0Ulk4RXkrNqj0RBQsLmLviEae3ksRB5v18mjAlw2wXAYI_0ghgFTl-IUBQc5C-4APkLK3-B8n9xp9kvyf012l-R7vJyZV6DP7xUbzyO_gLa7K8y
linkProvider Library Specific Holdings
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Multimedia+Technology+and+Enhanced+Learning&rft.au=Li%2C+Qinghua&rft.au=Li%2C+Haiming&rft.au=Wang%2C+Jiahui&rft.au=Feng%2C+Chao&rft.atitle=Robot+Navigation+in%C2%A0Crowds+Environment+Base+Deep+Reinforcement+Learning+with%C2%A0POMDP&rft.series=Lecture+Notes+of+the+Institute+for+Computer+Sciences%2C+Social+Informatics+and+Telecommunications+Engineering&rft.date=2022-01-01&rft.pub=Springer+Nature+Switzerland&rft.isbn=9783031181221&rft.issn=1867-8211&rft.eissn=1867-822X&rft.spage=675&rft.epage=685&rft_id=info:doi/10.1007%2F978-3-031-18123-8_53
thumbnail_s http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F7119422-l.jpg