TH广诚广场烂尾楼重建.pptVIP

  • 22
  • 0
  • 约5.02千字
  • 约 26页
  • 2017-06-17 发布于天津
  • 举报
TH广诚广场烂尾楼重建.ppt

Deep Reinforcement Learning Ph.D Student Wangyu 王宇 Deep Q-Network Published on Nature. A CNN trained with a variant of Q-learning. Use Atari games as testbed. Use raw pixels as input. Not provided with any game-specific information or hand-designed visual features. Mnih, K. Kavukcuoglu, D. Silver, et al. Human-level control through deep reinforcement learning, Nature, 518(7540):529–533, 2015. Contribution This work bridges the divide between high-dimensional sensory inputs and actions, resulting in the first artificial agent that is capable of learning to excel at a diverse array of challengin

文档评论(0)

1亿VIP精品文档

相关文档