Footnote: Monte-Carlo vs Temporal Difference

Loading...
Просмотреть программу курса

Рецензии

4.1 (оценок: 249)
  • 5 stars
    129 ratings
  • 4 stars
    63 ratings
  • 3 stars
    26 ratings
  • 2 stars
    11 ratings
  • 1 star
    20 ratings
VO

Mar 17, 2019

Well Prepared and taught course.. Will highly recommend as the primer for reinforcement learning

AH

Aug 17, 2018

Learned a lot. The pace is quick and the assignment is challenging sometimes

Из урока
Model-free methods
This week we'll find out how to apply last week's ideas to the real world problems: ones where you don't have a perfect model of your environment.

Преподаватели

  • Pavel Shvechikov

    Pavel Shvechikov

    Researcher at HSE and Sberbank AI Lab
  • Alexander Panin

    Alexander Panin

    Lecturer

Ознакомьтесь с нашим каталогом

Присоединяйтесь бесплатно и получайте персонализированные рекомендации, обновления и предложения.