Robust Multi-agent Patrolling Strategies Using Reinforcement Learning

Patrolling an environment involves a team of agents whose goal usually consists in continuously visiting the most relevant areas as fast as possible. In this paper, we follow up on the work by Santana et al. who formulated this problem in terms of a reinforcement learning problem, where agents indiv...

Full description

Saved in:
Bibliographic Details
Published inSwarm Intelligence Based Optimization pp. 157 - 165
Main Authors Lauri, Fabrice, Koukam, Abderrafiaa
Format Book Chapter
LanguageEnglish
Published Cham Springer International Publishing 01.01.2014
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text
ISBN3319129694
9783319129693
ISSN0302-9743
1611-3349
DOI10.1007/978-3-319-12970-9_17

Cover

More Information
Summary:Patrolling an environment involves a team of agents whose goal usually consists in continuously visiting the most relevant areas as fast as possible. In this paper, we follow up on the work by Santana et al. who formulated this problem in terms of a reinforcement learning problem, where agents individually learn an MDP using Q-Learning to patrol their environment. We propose another definition of the state space and of the reward function associated with the MDP of an agent. Experimental evaluation shows that our approach substantially improves the previous RL method in several instances (graph topology and number of agents). Moreover, it is observed that such an RL approach is robust as it can efficiently cope with most of the situations caused by the removal of agents during a patrolling simulation.
ISBN:3319129694
9783319129693
ISSN:0302-9743
1611-3349
DOI:10.1007/978-3-319-12970-9_17