Footnote: Monte-Carlo vs Temporal Difference

Loading...
Просмотреть программу курса

Рецензии

4.2 (оценок: 397)
  • 5 stars
    57.17%
  • 4 stars
    23.67%
  • 3 stars
    8.81%
  • 2 stars
    4.53%
  • 1 star
    5.79%
FZ
13 февр. 2019 г.

A great course with very practical assignments to help you learn how to implement RL algorithms. But it also has some stupid quiz questions which makes you feel confusing.

LJ
6 окт. 2019 г.

Challenging (unlike many other courses on Coursera, it does not baby you and does not seem to be targeting as high a pass rate as possible), but very very rewarding.

Из урока
Model-free methods
This week we'll find out how to apply last week's ideas to the real world problems: ones where you don't have a perfect model of your environment.

Преподаватели

  • Placeholder

    Pavel Shvechikov

    Researcher at HSE and Sberbank AI Lab
  • Placeholder

    Alexander Panin

    Lecturer

Ознакомьтесь с нашим каталогом

Присоединяйтесь бесплатно и получайте персонализированные рекомендации, обновления и предложения.