Implicit dual control based on particle filtering and forward dynamic programming
This paper develops a sampling‐based approach to implicit dual control. Implicit dual control methods synthesize stochastic control policies by systematically approximating the stochastic dynamic programming equations of Bellman, in contrast to explicit dual control methods that artificially induce...
Saved in:
Published in | International journal of adaptive control and signal processing Vol. 24; no. 3; pp. 155 - 177 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
Chichester, UK
John Wiley & Sons, Ltd
01.03.2010
|
Subjects | |
Online Access | Get full text |
ISSN | 0890-6327 1099-1115 |
DOI | 10.1002/acs.1094 |
Cover
Summary: | This paper develops a sampling‐based approach to implicit dual control. Implicit dual control methods synthesize stochastic control policies by systematically approximating the stochastic dynamic programming equations of Bellman, in contrast to explicit dual control methods that artificially induce probing into the control law by modifying the cost function to include a term that rewards learning. The proposed implicit dual control approach is novel in that it combines a particle filter with a policy‐iteration method for forward dynamic programming. The integration of the two methods provides a complete sampling‐based approach to the problem. Implementation of the approach is simplified by making use of a specific architecture denoted as a H‐block. Practical suggestions are given for reducing computational loads within the H‐block for real‐time applications. As an example, the method is applied to the control of a stochastic pendulum model having unknown mass, length, initial position and velocity, and unknown sign of its dc gain. Simulation results indicate that active controllers based on the described method can systematically improve closed‐loop performance with respect to other more common stochastic control approaches. Copyright © 2008 John Wiley & Sons, Ltd. |
---|---|
Bibliography: | National Institute of Health - No. GM068968; No. EB005803 ark:/67375/WNG-ZD7MMPN1-Z istex:78D59F30D181710C7B3B20FBF7288DC9615B5A39 ArticleID:ACS1094 ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 ObjectType-Article-2 ObjectType-Feature-1 |
ISSN: | 0890-6327 1099-1115 |
DOI: | 10.1002/acs.1094 |