Robust Multi-agent Patrolling Strategies Using Reinforcement Learning
Patrolling an environment involves a team of agents whose goal usually consists in continuously visiting the most relevant areas as fast as possible. In this paper, we follow up on the work by Santana et al. who formulated this problem in terms of a reinforcement learning problem, where agents indiv...
Saved in:
| Published in | Swarm Intelligence Based Optimization pp. 157 - 165 |
|---|---|
| Main Authors | , |
| Format | Book Chapter |
| Language | English |
| Published |
Cham
Springer International Publishing
01.01.2014
|
| Series | Lecture Notes in Computer Science |
| Subjects | |
| Online Access | Get full text |
| ISBN | 3319129694 9783319129693 |
| ISSN | 0302-9743 1611-3349 |
| DOI | 10.1007/978-3-319-12970-9_17 |
Cover
| Summary: | Patrolling an environment involves a team of agents whose goal usually consists in continuously visiting the most relevant areas as fast as possible. In this paper, we follow up on the work by Santana et al. who formulated this problem in terms of a reinforcement learning problem, where agents individually learn an MDP using Q-Learning to patrol their environment. We propose another definition of the state space and of the reward function associated with the MDP of an agent. Experimental evaluation shows that our approach substantially improves the previous RL method in several instances (graph topology and number of agents). Moreover, it is observed that such an RL approach is robust as it can efficiently cope with most of the situations caused by the removal of agents during a patrolling simulation. |
|---|---|
| ISBN: | 3319129694 9783319129693 |
| ISSN: | 0302-9743 1611-3349 |
| DOI: | 10.1007/978-3-319-12970-9_17 |