A Real-Time Path Planning Algorithm Based on the Markov Decision Process in a Dynamic Environment for Wheeled Mobile Robots

A real-time path planning algorithm based on the Markov decision process (MDP) is proposed in this paper. This algorithm can be used in dynamic environments to guide the wheeled mobile robot to the goal. Two phases (the utility update phase and the policy update phase) constitute the path planning o...

Full description

Saved in:

Bibliographic Details
Published in	Actuators Vol. 12; no. 4; p. 166
Main Authors	Chen, Yu-Ju, Jhong, Bing-Gang, Chen, Mei-Yung
Format	Journal Article
Language	English
Published	Basel MDPI AG 01.04.2023
Subjects	A algorithm Algorithms Barriers Decision making Design Heuristic Markov analysis Markov decision process Markov processes Methods Path planning Real time reward cost function Robotics industry Robots
Online Access	Get full text
ISSN	2076-0825 2076-0825
DOI	10.3390/act12040166

Cover

More Information
Summary:	A real-time path planning algorithm based on the Markov decision process (MDP) is proposed in this paper. This algorithm can be used in dynamic environments to guide the wheeled mobile robot to the goal. Two phases (the utility update phase and the policy update phase) constitute the path planning of the entire system. In the utility update phase, the utility value is updated based on information from the observable environment. Obstacles and walls reduce the utility value, pushing agents away from these impassable areas. The utility value of the goal is constant and is always only the largest. In the policy update, a series of policies can be obtained by the strategy of maximizing its long-term total reward, and the series will eventually form a path towards the goal, regardless of where the agent is located. The simulations and experiments show that it takes longer to find the first path in the beginning due to the large changes of utility value, but once the path is planned, it requires a small amount of time cost to respond to the environmental changes. Therefore, the proposed path planning algorithm has an advantage in dynamic environments where obstacles move in unpredictable ways.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	2076-0825 2076-0825
DOI:	10.3390/act12040166