- 5
- 0
- 约27.05万字
- 约 26页
- 2017-10-15 发布于上海
- 举报
Approximate policy iteration a survey and some new methods-近似策略迭代一项调查和一些新方法
_,cDn加Z 201l
Z抛D拶A印Z 9(3)310-335
DOI
▲ ■ J 1■ ■』 J● ■
lteratlon:aannS0me
ADDr0XlmateD0llCy SUrVey
newmethodS
DiIni仕iP.BERTSEKAS
Dcpam∞ntofElecmcalEn萄necringandC伽叩utersci即ce,M舔sachu∞ttsI璐timte
considertlleclassicalite:rationmemodof
Abstl鼍ct:Wb policv dvnaITlicpmg瑚mmjng(DP),wherea1,proximations
of
肌dsimulanonareusedtodealwithmecurSeof anumberof andrate
dimensionality.Wbsurvey issues:convergence
evaluation tosimuladonnoiseof evaIu·
conve理enceofappmximatepoIicy methods,singul州西andsusceptibiliH policy
ande划嘲nced osciIlationand
ation,exDlora止ionissues,cons仃ained policyitemtion,policy chanering,andoptiIIlistic如d
dis砸butedite瑚lion.Ourdiscussionof evaluationiscouchedin temsandaimsto available
policv policy general unify山e
of andto metwomain evaluation
me山odsinttlelightrecentresearchdevelopmentscomp硼_e policy approaches:projected
aIId contextof山ese mrodiff.erem
equationsteInporaldin.erences(TD),andaggregation.IIl山e appmaches,wesun,ey
of inversion as
tvpessimulanon-basedalgorimms:matrixmethods,suchleast—squarestemporaldiffbrence(LSTD),柚d
iterative
您可能关注的文档
- A descriptor system approach to robust stability of uncertain degenerate systems with discrete and distribute delays-描述了离散和分布时滞不确定退化系统鲁棒稳定性的描述系统.pdf
- A Fast Ambient Occlusion Method for Real-Time Plant Rendering-一种快速的环境遮挡法,用于实时的植物渲染.pdf
- A Hybrid Circular Queue Method for Iterative Stencil Computations on GPUs-在gpu上迭代模板计算的混合循环队列方法.pdf
- A new adaptive mutative scale chaos optimization algorithm and its application-一种新的自适应混沌尺度混沌优化算法及其应用.pdf
- A minimal-energy driving strategy for high-speed electric train-一种高速电动列车的最低能量驱动策略.pdf
- A New Approach to Graph Recognition and Applications to Distance-Hereditary Graphs-一种将图像识别和应用于距离遗传图的新方法.pdf
- A new approach for automated image segmentation based on simplified PCNN-一种基于简化PCNN的自动图像分割方法.pdf
- A new multisensor fusion SLAM approach for mobile robots-一种新的移动机器人多传感器融合技术.pdf
- A Multi-Threaded Semantic Focused Crawler-一个多线程的语义聚焦爬虫.pdf
- A Geometric Approach for Multi-Degree Spline-多学位样条的几何方法.pdf
最近下载
- 贝纳利BJ250维修手册.pdf VIP
- PasswortD A1 听力原文-德语学习资料.pdf VIP
- 一体化污水处理设备施工工艺.docx VIP
- 自动可调螺杆机组触摸屏说明书_SCC60-TP-V2.05.doc VIP
- 学堂在线 雨课堂 学堂云 如何写好科研论文 章节测试答案.docx VIP
- 人教版八年级数学下册基础知识专项讲练 专题17.20 勾股定理(中考真题专练)(巩固篇)(专项练习).docx VIP
- 教育实习鉴定实习内容.docx VIP
- 《GBT11616-2013-同步带传动节距型号MXL、XXL、XL、L、H、XH和XXH同步带尺寸》.pdf
- 离婚协议书(无子女版).docx VIP
- pluronic系列产品指标.pptx VIP
原创力文档

文档评论(0)