- 13
- 0
- 约9.99千字
- 约 27页
- 2017-04-06 发布于江苏
- 举报
Qlearning with infinite stat space
$
%
Convergence of Q-learning with
linear function approximation
Francisco S. Melo and M. Isabel Ribeiro
Institute for Systems and Robotics
[fmelo,mir]@isr.ist.utl.pt
European Control Conference,
Kos, Greece, July 2007
July 4th, 2007 Slide 1
$
%
Outline of the presentation
? Motivation and problem formulation
? Background
? Related work
? Q-learning with LFA
? Some results
? Concluding remarks
July 4th, 2007 Slide 2
$
%
Motivation
? Markov decision processes provide useful models to address
discrete-time stochastic control problems;
? Many powerful methods are available (e.g., TD(λ), Q-lear
您可能关注的文档
- On the gamma-ray spectra radiated by protons accelerated in SNR shocks near molecular cloud.pdf
- On the Group of Automorphisms of Universal Algebra and Many Sorted Algebra.pdf
- On the long term spatial segregation for a competition-diffusion system.pdf
- On the number of bound states for Schrdinger operators with operator-valued potentials.pdf
- On the mechanisms of various fretting wear modes微动磨损.pdf
- On the operator space UMD property for noncommutative Lp-spaces.pdf
- On the Polyharmonic Operator with a Periodic Potential.pdf
- On the position operator for massless particles.pdf
- On the semi-regular module and vertex operator algebras.pdf
- On the Trivial Many Sorted Algebras and Many Sorted Congruences.pdf
原创力文档

文档评论(0)