新規登録 | ログイン | FAQ      [?] 

タグ: policy_gradient [8 articles]

Recent papers classified by the tag policy_gradient.
  • Policy gradient methods for reinforcement learning with function approximation
    Vol. 12 (2000), pp. 1057-1063.
    by Richard S Sutton, David Mcallester, Satinder Singh, Yishay Mansour
    posted to policy_gradient reinforcement_learning by tdahl on 2008-08-26 09:40:39 as *****
  • Hierarchical policy gradient algorithms
    Vol. 18 (2003), pp. 226-233.
    by Mohammad Ghavamzadeh, Sridhar Mahadevan
    posted to hierarchical policy_gradient reinforcement_learning by tdahl on 2008-08-26 09:43:02 as ****
  • Policy Gradient Critics
    Machine Learning: ECML 2007 (2007), pp. 466-477.
    by Daan Wierstra, Jürgen Schmidhuber
    posted to action_selection actor-critic policy_gradient by Cavadini on 2008-02-11 07:11:11 as read
  • Policy Gradient Methods for Robotics
    Intelligent Robots and Systems, 2006 IEEE/RSJ International Conference on (2006), pp. 2219-2225.
    by Jan Peters, Stefan Schaal
    posted to actor-critic policy_gradient by Cavadini on 2008-02-11 07:13:30 as read
  • Evaluation of Policy Gradient Methods and Variants on the Cart-Pole Benchmark
    Approximate Dynamic Programming and Reinforcement Learning, 2007. ADPRL 2007. IEEE International Symposium on (2007), pp. 254-261.
    posted to experimental policy_gradient by Cavadini on 2008-02-05 11:28:12 as read along with 1 person schaul
  • Policy Gradient Methods for Reinforcement Learning with Function Approximation
    (1999)
    posted to actor-critic policy_gradient by Cavadini on 2008-02-11 07:05:12 as read
  • Policy Gradient Methods for Reinforcement Learning with Function Approximation
    (1999)
  • A natural policy gradient
    Advances in Neural Information Processing Systems, Vol. 14 (2002)
    by Sham Kakade
    posted to natural_gradient policy_gradient reinforcement_learning by bsilverthorn on 2008-01-25 16:20:39 as read
  • 注: このページを引用する時は次のURLでどうぞ: http://www.citeulike.org/tag/policy_gradient

    RIS BibTeX
    CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.