Вернуться к Prediction and Control with Function Approximation

звезд

Оценки: 670

•

Рецензии: 123

In this course, you will learn how to solve problems with large, high-dimensional, and potentially infinite state spaces. You will see that estimating value functions can be cast as a supervised learning problem---function approximation---allowing you to build agents that carefully balance generalization and discrimination in order to maximize reward. We will begin this journey by investigating how our policy evaluation or prediction methods like Monte Carlo and TD can be extended to the function approximation setting. You will learn about feature construction techniques for RL, and representation learning via neural networks and backprop. We conclude this course with a deep-dive into policy gradient methods; a way to learn policies directly without learning a value function. In this course you will solve two continuous-state control tasks and investigate the benefits of policy gradient methods in a continuous-action environment.
Prerequisites: This course strongly builds on the fundamentals of Courses 1 and 2, and learners should have completed these before starting this course. Learners should also be comfortable with probabilities & expectations, basic linear algebra, basic calculus, Python 3.0 (at least 1 year), and implementing algorithms from pseudocode.
By the end of this course, you will be able to:
-Understand how to use supervised learning approaches to approximate value functions
-Understand objectives for prediction (value estimation) under function approximation
-Implement TD with function approximation (state aggregation), on an environment with an infinite state space (continuous state space)
-Understand fixed basis and neural network approaches to feature construction
-Implement TD with neural network function approximation in a continuous state environment
-Understand new difficulties in exploration when moving to function approximation
-Contrast discounted problem formulations for control versus an average reward problem formulation
-Implement expected Sarsa and Q-learning with function approximation on a continuous state control task
-Understand objectives for directly estimating policies (policy gradient objectives)
-Implement a policy gradient method (called Actor-Critic) on a discrete state environment...

AC

1 дек. 2019 г.

Well peaced and thoughtfully explained course. Highly recommended for anyone willing to set solid grounding in Reinforcement Learning. Thank you Coursera and Univ. of Alberta for the masterclass.

SJ

24 июня 2020 г.

Surely a level-up from the previous courses. This course adds to and extends what has been learned in courses 1 & 2 to a greater sphere of real-world problems. Great job Prof. Adam and Martha!

Фильтр по:

автор: George G

•28 февр. 2020 г.

Fantastic course! Despite the challenging content, this course actually is taught at least at the same level as the ones by Andrew Ng, Daphne Koller, and Geoffrey Hinton. Congratulations Martha and Adam! You are awesome and are my heroes! Thanks a lot! George

автор: Mukund C

•27 мар. 2020 г.

Excellent Course and Lectures. Loved it!! So important to read the chapters in the book ahead of time. Book is also excellent!! I liked the way the instructors explained the equations and broke them down. Nicely done!! I wish some more of the questions in the quiz reflected the data structures we use in the programming exercise, which will be super-helpful to reinforce the concepts when we do the programming exercises. In other words, an intermediate step of a worked example between the Pseudo-Code Algorithm in the Texbook/Lectures and the Programming Exercise. For example, more of the Feature_Vector -> Action_Value Calculation - even if we have to do some matrix manipulation by hand, that'd be wonderful. One of the quizzes has something like that (but more simplified) - which was perfect.

автор: Navid H

•16 окт. 2019 г.

The material is very good. But this course needs better instructors/ method of teaching. The book is also written in an unnecessarily technical way filled with jargon. explanations are not clear, simple stuff is presented in a very complicated manner for no obvious way.

автор: Maxim V

•23 янв. 2020 г.

Good content, but there was a highly unpleasant surprise in the programming assignments, namely this: "Retakes: You can attempt this assignment 5 times every 4 months." First of all, this is a highly unusual and therefore unexpected requirement on Coursera. Also, considering how buggy graders are and that some assignments require submitting results separately from the notebook, this is a really high risk of having to wait 4 months for another chance.

автор: Arthur O

•9 окт. 2020 г.

Excellent instruction that guides through the core material of part 2 of Sutton & Barto's Reinforcement Learning: an Introduction. The instructors additionally teach complementary material not found in the book. The notebooks got me "making something real" with the material in a way that deepened my understanding beyond a theoretical/pen-and-paper treatment. I appreciate the care that went into setting up the RL learning environment, creating test cases, and visualizations of performance -- it's awesome when the agents come together and you can see how well they perform!

A couple very minor notes on the lectures. The pacing of speech by Dr. Adam White often felt stiff and clipped. In future video courses he might benefit from practicing changing his tone, speed, and pauses to sound more natural. Similarly, Dr. Martha White's microphone was positioned in such a way that her breathing between sentences is captured, and sounds pretty loud. Improving these aspects of presentation in the future can make the lectures flow more naturally and reduce some friction from the distractions.

Those are nits on an otherwise excellent course. Thank you very much for putting the materials together! See you in the next one!

автор: Maximiliano B

•31 мар. 2020 г.

The third course of the specialization is excellent and it provides a solid foundation on problems with arbitrarily large state spaces that rely on approximate solution methods. The lectures are very well explained. It’s strongly recommended to read each book chapter in advance before watching the lectures to be able to better understand the concepts and be able to answer the quizzes. The content in this course is quite abstract and it is heavily dependent on statistics and calculus. It was very nice to integrate reinforcement learning with neural networks as part of one of the assignments as well as to implement the swing-up pendulum. I am looking forward to begin the capstone project.

автор: D. R

•31 дек. 2019 г.

Excellent course. The videos, quizzes, and especially the exercises add a lot of extra value to the text book (which is available for free - Sutton and Burto, 2nd edition). Of course it is not perfect - the videos are sometimes a bit dry, the NN part was brushed over too quickly for a beginner (luckily I had taken some courses about deep learning, so I was ok - but if you don't know the basics of NN, week 2 might be quite challenging for you). Other than that the biggest disadvantage is that the course forums are still quite empty - and so if you get stuck you can be on your own... But you shouldn't get stuck, and I guess this will improve over time.

автор: Mark J

•22 окт. 2019 г.

This, the third in an exceptionally well-paced series of four courses on Reinforcement Learning, extends the scope of the subject to include parameterized functions (i.e., neural networks). The section on tiling methods is especially interesting. The course is taught under the auspices of professors who, quite literally, wrote the book on reinforcement learning, and includes several video lectures by leading practitioners and theorists in the field. The final programming assignment, in particular, made me feel like I did when I wrote my first computer program that actually did what it was supposed to way back when -- delight and amazement.

автор: Ian W

•17 июня 2021 г.

Learned a lot through the course. This specialization is to teach you through the whole reinforcement learning textbook. Very informative, but the programming assignments are very difficult.

There are many tiny details to notice while programming, and the discussion forum is not very active. I suggest find some clues from the old posts when having difficulties, and ask / answer questions as much as possible to help yourself and the others.

автор: Julien T

•12 нояб. 2019 г.

Great course and specialization. The teachers are great, the material well presented and balanced. I strongly recommend this course to anyone interested in the field of Reinforcement Learning. For maximum chance of success I suggest following all 3 courses in succession and investing the necessary amount of time to read the textbook chapters as specified at the beginning of each week.

Looking forward to completing the capstone project now!

автор: Gordon L W C

•23 мар. 2020 г.

The course is very comprehensive on the content. But I think the difficulty of this course is in some sense too high for most people who don't have a background in engineering degree due to the extensive use of advanced mathematics. I think it might be a better idea if you are focusing on a few critical algorithms that trying to cover too much algorithms which is quite overwhelming

автор: Walter O A

•9 дек. 2019 г.

An almost overwhelming amount of material, however we managed to navigate through the thicket. The labs were well maintained and provided robust tests so that one could have a high degree of confidence in the solution before submitting to the grader. I really appreciate this. I would recommend this course to anybody wanting a serious introduction to reinforcement learning.

автор: Stefano P

•19 мая 2020 г.

This course is very rich of both mathematical and practical concepts, and it actually provides you with powerful tools to understand and use Reinforcement Learning. So far, it is the most interesting course in this specialization. Lectures are very clear and they often explain more deeply some concepts you find in the text book. Quizzes are challenging and well constructed.

автор: Sebastian P B

•2 дек. 2019 г.

This was a very good and though course. The content in this course is perfect to get yourself the necessary bases in order to start getting into deep RL. It doesn't really explain that far, but at the end you will have a basic idea of how deep learning can be used with RL. Enough to start reading papers about it or to watch other lectures focused on that topic.

автор: Zhang d

•6 мая 2020 г.

This course is amazing and wonderful, teaching us the knowledge to use function approximation to solve the vaule and policy estimation. Compared to the preceding tabular presentation, function approximation is appropriate for the reality problem and makes RL more powerful and interesting. I am looking foward to learning more skills in the RL area!

автор: Jesse W

•29 июля 2020 г.

This is a rigorous but clear course in using function approximation to calculate (rather than tabulate) state values and policies in reinforcement learning applications. The videos and assignments are well-constructed and instructive, and the free online textbook is great as well.

автор: Guilherme V

•20 нояб. 2020 г.

It was a great course, but I think it could be better if at least one practical example of function approximation with Neural Networks were in it, maybe integrate it with Tensorflow to make it simpler, but illustrate how it would work. Nevertheless, it was Really a nice course

автор: Tobias L

•14 окт. 2020 г.

Once again a nice course in the series of lectures. Giving a good overview of function approximation and its use in RL. Moreover, a lot of the piecies given in the previous lectures start to fit together. And I finally can connect DeepLearning with RL.

Thanks, Martha and Adam

автор: Alvaro M A

•3 апр. 2020 г.

Excellent course. You'll learn how to apply techniques like Neural Networks to Reinforcement Learning Agents. The assignments are very well designed, and they have the right difficulty so you can learn the most fundamental concepts of the course.

автор: Surya K

•30 апр. 2020 г.

Very well made course. This helped me build Agents for our mobile board game! I'd like some more practical tips as well though, like how to make algorithms converge and things to try out. Other than that, I recommend this to every RL enthusiast.

автор: César A S C

•8 авг. 2021 г.

Awesome course that complements both first courses on RL. Excellent chapter selection from the book that shows just the necessary and sufficient information to get a good idea of many of the concepts of function approximation. Very recommended.

автор: Lim G

•10 мая 2020 г.

I enjoyed the course because the content delivery was clear and concise. The hands-on assignment helped me better understand the concepts that were taught. I was able to draw connections and link between textbook and hands-on experience.

автор: Thomas G

•21 апр. 2020 г.

A very ambitious course where you have to invest a lot in reading the book but therefore you also learn a lot. I prefer more of those advanced courses here on Coursera.

The course is a very good complement to the book from Sutton.

автор: Mateusz K

•29 окт. 2019 г.

Its got a great variety of very applicable examples, use cases, and assignments. May be tough if people don't quite understand how neural networks work, so I suggest having a basic understanding of NN for parts of this course.

автор: Steven H

•9 июля 2020 г.

Excellent course! The assignment could be improved by adding input checking in methods with one-hot encoding of state as input. Which I suffered when I forgot to use the one-hot encoding and spent much time debugging.

- Аналитик данных Google
- Управление проектами от Google
- UX-дизайн от Google
- ИТ-поддержка Google
- Наука о данных IBM
- Аналитик данных от IBM
- Анализ данных с помощью Excel и R от IBM
- Аналитик по кибербезопасности от IBM
- Маркетинг в социальных сетях от Facebook
- Разработчик комплексных облачных приложений IBM
- Представитель по развитию продаж от Salesforce
- Сбытовые операции Salesforce
- Soporte de Tecnologías de la Información de Google
- Certificado profesional de Suporte em TI do Google
- ИТ-автоматизация с помощью Python от Google
- Tensorflow от DeepLearning.AI
- Популярные сертификаты по кибербезопасности
- Популярные сертификаты по SQL
- Популярные сертификаты по ИТ
- Посмотреть все сертификаты

- бесплатные курсы
- Изучите иностранный язык
- Python
- Java
- веб-дизайн
- SQL
- Cursos Gratis
- Microsoft Excel
- Управление проектами
- Безопасность в киберпространстве
- Людские ресурсы
- Бесплатные курсы в области науки о данных
- говорить на английском
- Написание контента
- Веб-разработка: полный спектр технологий
- Искусственный интеллект
- Программирование на языке C
- Навыки общения
- Блокчейн
- Просмотреть все курсы

- Навыки для команд по науке о данных
- Принятие решений на основе данных
- Навыки в области программной инженерии
- Навыки межличностного общения для проектных групп
- Управленческие навыки
- Навыки в области маркетинга
- Навыки для отделов продаж
- Навыки менеджера по продукту
- Навыки в области финансов
- Проекты по разработке для Android
- Проекты по TensorFlow и Keras
- Python для всех
- Глубокое обучение
- Навыки Excel для бизнеса
- Основы бизнеса
- Машинное обучение
- Основы AWS
- Основы инженерии данных
- Навыки для аналитика данных
- Навыки для UX-дизайнеров

- Сертификаты MasterTrack®
- Профессиональные сертификаты
- Сертификаты университетов
- MBA и другие дипломы в области бизнеса
- Степени в области науки о данных
- Степени в области компьютерных наук
- Дипломные программы по анализу данных
- Степени в области общественного здравоохранения
- Степени в области социальных наук
- Дипломные программы в области управления
- Дипломы ведущих европейских университетов
- Дипломы магистра
- Степени бакалавра
- Дипломы с карьерными путями, ориентированными на результат
- Бакалаврские курсы
- Что такое диплом бакалавра?
- Сколько времени нужно для получения диплома магистра?
- Стоит ли получать диплом MBA онлайн?
- 7 способов оплатить магистратуру
- Просмотреть все степени