Policy and value iteration

Loading...
Просмотреть программу курса

Рецензии

4.1 (оценок: 272)
  • 5 stars
    147 ratings
  • 4 stars
    68 ratings
  • 3 stars
    26 ratings
  • 2 stars
    11 ratings
  • 1 star
    20 ratings
VO

Mar 17, 2019

Well Prepared and taught course.. Will highly recommend as the primer for reinforcement learning

JJ

Sep 15, 2019

Fantastic class if you don't mind to overcome some code issues in the homework.

Из урока
At the heart of RL: Dynamic Programming
This week we'll consider the reinforcement learning formalisms in a more rigorous, mathematical way. You'll learn how to effectively compute the return your agent gets for a particular action - and how to pick best actions based on that return.

Преподаватели

  • Pavel Shvechikov

    Pavel Shvechikov

    Researcher at HSE and Sberbank AI Lab
  • Alexander Panin

    Alexander Panin

    Lecturer

Ознакомьтесь с нашим каталогом

Присоединяйтесь бесплатно и получайте персонализированные рекомендации, обновления и предложения.