Policy gradient formalism

video-placeholder
Loading...
Просмотреть программу курса

Рецензии

4.3 (436 ratings)

  • 5 stars
    58,25 %
  • 4 stars
    23,39 %
  • 3 stars
    8,94 %
  • 2 stars
    4,12 %
  • 1 star
    5,27 %

SF

8 апр. 2020 г.

Filled StarFilled StarFilled StarFilled StarFilled Star

At times it felt like a bit more video material would be helpful to better understand the subject/gain deeper understanding.\n\nAnd fixing some of the notebooks would be helpful.

FZ

13 февр. 2019 г.

Filled StarFilled StarFilled StarFilled StarFilled Star

A great course with very practical assignments to help you learn how to implement RL algorithms. But it also has some stupid quiz questions which makes you feel confusing.

Из урока

Policy-based methods

We spent 3 previous modules working on the value-based methods: learning state values, action values and whatnot. Now's the time to see an alternative approach that doesn't require you to predict all future rewards to learn something.

Преподаватели

  • Placeholder

    Pavel Shvechikov

    Researcher at HSE and Sberbank AI Lab

  • Placeholder

    Alexander Panin

    Lecturer

Ознакомьтесь с нашим каталогом

Присоединяйтесь бесплатно и получайте персонализированные рекомендации, обновления и предложения.