Robot Navigation in Crowds Environment Base Deep Reinforcement Learning with POMDP

With the development of deep learning technology, the navigation technology of mobile robot based on deep reinforcement learning is developing rapidly. But, navigation policy based on deep reinforcement learning still needs to be improved in crowds environment. The motion intention of pedestrians in...

Full description

Saved in:

Bibliographic Details
Published in	Multimedia Technology and Enhanced Learning Vol. 446; pp. 675 - 685
Main Authors	Li, Qinghua, Li, Haiming, Wang, Jiahui, Feng, Chao
Format	Book Chapter
Language	English
Published	Switzerland Springer 2022 Springer Nature Switzerland
Series	Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
Subjects	Deep reinforcement learning Partially observable Markov decision process Robot navigation
Online Access	Get full text
ISBN	3031181220 9783031181221
ISSN	1867-8211 1867-822X
DOI	10.1007/978-3-031-18123-8_53

Cover

Abstract	With the development of deep learning technology, the navigation technology of mobile robot based on deep reinforcement learning is developing rapidly. But, navigation policy based on deep reinforcement learning still needs to be improved in crowds environment. The motion intention of pedestrians in crowds environment is variable, and the current motion intention information of pedestrian cannot be judged by only relying on a single frame of sensor sensing information. Therefore, in the case of only one frame of input, the pedestrian motion state information is partially observable. To dealing with this problem, we present the P-RL algorithm in this paper. The algorithm replaces traditional deep reinforcement learning Markov Decision Process model with a Partially Observable Markov Decision Process model, and introduces the LSTM neural network into the deep reinforcement learning algorithm. The LSTM neural network has the ability to process time series information, so that makes the algorithm has the ability to perceive the relationship between the observation data of each frame, which enhances the robustness of the algorithm. Experimental results show our algorithm is superior to other algorithms in time overhead and navigation success rate in crowds environment.
AbstractList	With the development of deep learning technology, the navigation technology of mobile robot based on deep reinforcement learning is developing rapidly. But, navigation policy based on deep reinforcement learning still needs to be improved in crowds environment. The motion intention of pedestrians in crowds environment is variable, and the current motion intention information of pedestrian cannot be judged by only relying on a single frame of sensor sensing information. Therefore, in the case of only one frame of input, the pedestrian motion state information is partially observable. To dealing with this problem, we present the P-RL algorithm in this paper. The algorithm replaces traditional deep reinforcement learning Markov Decision Process model with a Partially Observable Markov Decision Process model, and introduces the LSTM neural network into the deep reinforcement learning algorithm. The LSTM neural network has the ability to process time series information, so that makes the algorithm has the ability to perceive the relationship between the observation data of each frame, which enhances the robustness of the algorithm. Experimental results show our algorithm is superior to other algorithms in time overhead and navigation success rate in crowds environment.
Author	Li, Haiming Li, Qinghua Wang, Jiahui Feng, Chao
Author_xml	– sequence: 1 givenname: Qinghua surname: Li fullname: Li, Qinghua – sequence: 2 givenname: Haiming surname: Li fullname: Li, Haiming – sequence: 3 givenname: Jiahui surname: Wang fullname: Wang, Jiahui – sequence: 4 givenname: Chao surname: Feng fullname: Feng, Chao email: cfeng@qlu.edu.cn
BookMark	eNpFkMtOwzAQRc1TtNA_YOEfCNjjJHaW0PKSykMVSOws20xooNjBDvT3SVsEmsVI9-qO5p4h2fXBIyHHnJ1wxuRpJVUmMiZ4xhUHkSldiC0yFL2yFqptMuCqlJkCeN75N4Dt_hmc75MhF7kqK4A8PyCjlN4YYyAFVFIOyGwWbOjonfluXk3XBE8bT8cxLF8SvfDfTQz-A31Hz01COkFs6QwbX4focK1P0UTf-Fe6bLo5fbi_nTwckb3aLBKOfvchebq8eBxfZ9P7q5vx2TRreaFEZkpEiY5j4ZizsmI5stqql8oCgHIWLJfgamdQ9JNL63IljTUWBDNYKnFIYHM3tbH_AKO2IbwnzZle8dM9Py10D0WvcekVvz6Ub0JtDJ9fmDqNq5Tru0SzcHPTdhiTlpxXOYAuS6FLVYgfFNtxQw
ContentType	Book Chapter
Copyright	ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering 2022
Copyright_xml	– notice: ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering 2022
DBID	FFUUA
DEWEY	371.334
DOI	10.1007/978-3-031-18123-8_53
DatabaseName	ProQuest Ebook Central - Book Chapters - Demo use only
DatabaseTitleList
DeliveryMethod	fulltext_linktorsrc
Discipline	Education Computer Science
EISBN	3031181239 9783031181238
EISSN	1867-822X
Editor	Zhang, Yu-Dong Wang, Shui-Hua
Editor_xml	– sequence: 1 fullname: Zhang, Yu-Dong – sequence: 2 fullname: Wang, Shui-Hua
EndPage	685
ExternalDocumentID	EBC7119422_663_685
GroupedDBID	38. AABBV AAZWU ABSVR ABTHU ABVND ACBPT ACHZO ACPMC ADNVS AEJLV AEKFX AHVRR AIYYB AJQNT ALMA_UNASSIGNED_HOLDINGS BBABE CZZ FFUUA IEZ SBO TPJZQ Z81 Z83
ID	FETCH-LOGICAL-p1583-a6ee7ec1e5c0cb7904e0fb8d9b2228cb2b172cfcae3e3e47bc487abab230ae683
ISBN	3031181220 9783031181221
ISSN	1867-8211
IngestDate	Tue Jul 29 20:28:40 EDT 2025 Thu May 29 01:10:12 EDT 2025
IsPeerReviewed	false
IsScholarly	false
LCCallNum	LB1028.43-1028.75
Language	English
LinkModel	OpenURL
MergedId	FETCHMERGED-LOGICAL-p1583-a6ee7ec1e5c0cb7904e0fb8d9b2228cb2b172cfcae3e3e47bc487abab230ae683
OCLC	1348692244
PQID	EBC7119422_663_685
PageCount	11
ParticipantIDs	springer_books_10_1007_978_3_031_18123_8_53 proquest_ebookcentralchapters_7119422_663_685
PublicationCentury	2000
PublicationDate	2022
PublicationDateYYYYMMDD	2022-01-01
PublicationDate_xml	– year: 2022 text: 2022
PublicationDecade	2020
PublicationPlace	Switzerland
PublicationPlace_xml	– name: Switzerland – name: Cham
PublicationSeriesTitle	Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
PublicationSeriesTitleAlternate	Lect.Notes Social.Inform.
PublicationSubtitle	4th EAI International Conference, ICMTEL 2022, Virtual Event, April 15-16, 2022, Proceedings
PublicationTitle	Multimedia Technology and Enhanced Learning
PublicationYear	2022
Publisher	Springer Springer Nature Switzerland
Publisher_xml	– name: Springer – name: Springer Nature Switzerland
RelatedPersons	Jia, Xiaohua Dressler, Falko Ferrari, Domenico Kobayashi, Hisashi Shen, Xuemin Cao, Jiannong Palazzo, Sergio Coulson, Geoffrey Stan, Mircea Gerla, Mario Akan, Ozgur Bellavista, Paolo Zomaya, Albert Y. Sahni, Sartaj
RelatedPersons_xml	– sequence: 1 givenname: Ozgur surname: Akan fullname: Akan, Ozgur – sequence: 2 givenname: Paolo surname: Bellavista fullname: Bellavista, Paolo – sequence: 3 givenname: Jiannong surname: Cao fullname: Cao, Jiannong – sequence: 4 givenname: Geoffrey surname: Coulson fullname: Coulson, Geoffrey – sequence: 5 givenname: Falko surname: Dressler fullname: Dressler, Falko – sequence: 6 givenname: Domenico surname: Ferrari fullname: Ferrari, Domenico – sequence: 7 givenname: Mario surname: Gerla fullname: Gerla, Mario – sequence: 8 givenname: Hisashi surname: Kobayashi fullname: Kobayashi, Hisashi – sequence: 9 givenname: Sergio surname: Palazzo fullname: Palazzo, Sergio – sequence: 10 givenname: Sartaj surname: Sahni fullname: Sahni, Sartaj – sequence: 11 givenname: Xuemin orcidid: 0000-0002-4140-287X surname: Shen fullname: Shen, Xuemin – sequence: 12 givenname: Mircea surname: Stan fullname: Stan, Mircea – sequence: 13 givenname: Xiaohua surname: Jia fullname: Jia, Xiaohua – sequence: 14 givenname: Albert Y. surname: Zomaya fullname: Zomaya, Albert Y.
SSID	ssj0002732977 ssib023167542 ssj0000608566
Score	1.6578498
Snippet	With the development of deep learning technology, the navigation technology of mobile robot based on deep reinforcement learning is developing rapidly. But,...
SourceID	springer proquest
SourceType	Publisher
StartPage	675
SubjectTerms	Deep reinforcement learning Partially observable Markov decision process Robot navigation
Title	Robot Navigation in Crowds Environment Base Deep Reinforcement Learning with POMDP
URI	http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=7119422&ppg=685 http://link.springer.com/10.1007/978-3-031-18123-8_53
Volume	446
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3JbtswECUc91L0kK5ouoGH5mSokEmthx5S14VhJOnmtLkRJEPF7sEKYruHfE2-Jf_Tf-gMF1l2c0lhQDAI2qJmHkbD4bwZQt7GvExkqfrouaVRUhVxJNO8jMo8ZTnTqaoMbhSPjrPRSTI-TU87nT-trKXVUr3TV7fySv5HqzAGekWW7B002_wpDMB30C9cQcNw3XJ-N8OsrsMQpgJa4kcrPm7PAobzqTvX98VTz5usG3t0_xVGpiu5OTiS2N6rmfnTh5HHMzldzdb-ohsdTGXdxtq3WtVLMNS_bb0Omzy5P2D7B_EANvlni95wzabrfYDXJlg5cwGatVVbtQ1QNmu1kWH36y-fjz46SgBK0yzeH_oDj-N66WrlutTPkO6AGZOhS0UwWhamnoPsqVdNYeoJ9gBqE2QW7fKM7XgIY1vxkBAPxWRvXNF3WPWVI05vbJ3h1Y2cW-b42d76F_DWKJi3_qY9Znu-ByufuWYv3mHIXM-hf95F7fQTuFmEd-NRIVK-Q3ZgAV1y72A4PvwRzB_DogShH7FzJMAf9t72L1d5iJW2l2izUGQrhQfxFczWD9Ziit62io091VYagPWuJg_JA2TcUKTCgOYekY6ZPya7QZPUa_IJmVic0TXO6Gx-c-0wRlsYo4gxihijGxijAWMUMXZzbfH1lJx8Gk4Go8h3BYku-mnBI5kZkxvdN6mOtcrLODFxpYqzUmEwUyumwCfXlZaGwyfJlYY9uVRSwWZbmqzgz0h3Xs_Nc0JZpuIySY3kBiZliTSKZ0Ulk4RXkrNqj0RBQsLmLviEae3ksRB5v18mjAlw2wXAYI_0ghgFTl-IUBQc5C-4APkLK3-B8n9xp9kvyf012l-R7vJyZV6DP7xUbzyO_gLa7K8y
linkProvider	Library Specific Holdings
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Multimedia+Technology+and+Enhanced+Learning&rft.au=Li%2C+Qinghua&rft.au=Li%2C+Haiming&rft.au=Wang%2C+Jiahui&rft.au=Feng%2C+Chao&rft.atitle=Robot+Navigation+in%C2%A0Crowds+Environment+Base+Deep+Reinforcement+Learning+with%C2%A0POMDP&rft.series=Lecture+Notes+of+the+Institute+for+Computer+Sciences%2C+Social+Informatics+and+Telecommunications+Engineering&rft.date=2022-01-01&rft.pub=Springer+Nature+Switzerland&rft.isbn=9783031181221&rft.issn=1867-8211&rft.eissn=1867-822X&rft.spage=675&rft.epage=685&rft_id=info:doi/10.1007%2F978-3-031-18123-8_53
thumbnail_s	http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F7119422-l.jpg