Qlearning with infinite stat space.pdfVIP

  • 13
  • 0
  • 约9.99千字
  • 约 27页
  • 2017-04-06 发布于江苏
  • 举报
Qlearning with infinite stat space

$ % Convergence of Q-learning with linear function approximation Francisco S. Melo and M. Isabel Ribeiro Institute for Systems and Robotics [fmelo,mir]@isr.ist.utl.pt European Control Conference, Kos, Greece, July 2007 July 4th, 2007 Slide 1 $ % Outline of the presentation ? Motivation and problem formulation ? Background ? Related work ? Q-learning with LFA ? Some results ? Concluding remarks July 4th, 2007 Slide 2 $ % Motivation ? Markov decision processes provide useful models to address discrete-time stochastic control problems; ? Many powerful methods are available (e.g., TD(λ), Q-lear

文档评论(0)

1亿VIP精品文档

相关文档