Whizbang Labs East

Publication List Details

Period

2000 - 2000

Number

1

Co-Authors

Stochastic Optimization of Controlled Partially Observable Markov Decision Processes (2000)

Peter L. Bartlett, Jonathan Baxter, Whizbang Labs East

We introduce an on-line algorithm for finding local maxima of the average reward in a Partially Observable Markov Decision Process (POMPD) controlled by a parameterized policy. Optimization is over...