Measuring Policy Optimality

Loading...
Из курса от партнера National Research University Higher School of Economics
Practical Reinforcement Learning
57 оценки
National Research University Higher School of Economics
57 оценки
Курс 4 из 7 — Specialization Advanced Machine Learning
Из урока
At the heart of RL: Dynamic Programming
This week we'll consider the reinforcement learning formalisms in a more rigorous, mathematical way. You'll learn how to effectively compute the return your agent gets for a particular action - and how to pick best actions based on that return.

Познакомьтесь с преподавателями

  • Pavel Shvechikov
    Pavel Shvechikov
    Researcher at HSE and Sberbank AI Lab
    HSE Faculty of Computer Science
  • Alexander Panin
    Alexander Panin
    Lecturer
    HSE Faculty of Computer Science

Ознакомьтесь с нашим каталогом

Присоединяйтесь бесплатно и получайте персонализированные рекомендации, обновления и предложения.