Udemy

Modern Reinforcement Learning: Deep Q Agents (PyTorch & TF2)

Enroll Now
  • 6,669 Students
  • Updated 8/2023
  • Certificate Available
4.3
(1,107 Ratings)
CTgoodjobs selects quality courses to enhance professionals' competitiveness. By purchasing courses through links on our site, we may receive an affiliate commission.

Course Information

Registration period
Year-round Recruitment
Course Level
Study Mode
Duration
6 Hour(s) 44 Minute(s)
Language
English
Taught by
Phil Tabor
Certificate
  • Available
  • *The delivery and distribution of the certificate are subject to the policies and arrangements of the course provider.
Rating
4.3
(1,107 Ratings)

Course Overview

Modern Reinforcement Learning: Deep Q Agents (PyTorch & TF2)

How to Turn Deep Reinforcement Learning Research Papers Into Agents That Beat Classic Atari Games

In this complete deep reinforcement learning course you will learn a repeatable framework for reading and implementing deep reinforcement learning research papers. You will read the original papers that introduced the Deep Q learning, Double Deep Q learning, and Dueling Deep Q learning algorithms. You will then learn how to implement these in pythonic and concise PyTorch and Tensorflow 2 code, that can be extended to include any future deep Q learning algorithms. These algorithms will be used to solve a variety of environments from the Open AI gym's Atari library, including Pong, Breakout, and Bankheist.


You will learn the key to making these Deep Q Learning algorithms work, which is how to modify the Open AI Gym's Atari library to meet the specifications of the original Deep Q Learning papers. You will learn how to:

  • Repeat actions to reduce computational overhead

  • Rescale the Atari screen images to increase efficiency

  • Stack frames to give the Deep Q agent a sense of motion

  • Evaluate the Deep Q agent's performance with random no-ops to deal with model over training

  • Clip rewards to enable the Deep Q learning agent to generalize across Atari games with different score scales


If you do not have prior experience in reinforcement or deep reinforcement learning, that's no problem. Included in the course is a complete and concise course on the fundamentals of reinforcement learning. The introductory course in reinforcement learning will be taught in the context of solving the Frozen Lake environment from the Open AI Gym.

We will cover:

  • Markov decision processes

  • Temporal difference learning

  • The original Q learning algorithm

  • How to solve the Bellman equation

  • Value functions and action value functions

  • Model free vs. model based reinforcement learning

  • Solutions to the explore-exploit dilemma, including optimistic initial values and epsilon-greedy action selection

Also included is a mini course in deep learning using the PyTorch framework. This is geared for students who are familiar with the basic concepts of deep learning, but not the specifics, or those who are comfortable with deep learning in another framework, such as Tensorflow or Keras. You will learn how to code a deep neural network in Pytorch as well as how convolutional neural networks function. This will be put to use in implementing a naive Deep Q learning agent to solve the Cartpole problem from the Open AI gym. 

Course Content

  • 11 section(s)
  • 50 lecture(s)
  • Section 1 Introduction
  • Section 2 Fundamentals of Reinforcement Learning
  • Section 3 Deep Learning Crash Course
  • Section 4 Human Level Control Through Deep Reinforcement Learning: From Paper to Code
  • Section 5 Deep Reinforcement Learning with Double Q Learning
  • Section 6 Dueling Network Architectures for Deep Reinforcement Learning
  • Section 7 Improving On Our Solutions
  • Section 8 Conclusion
  • Section 9 Bonus Lecture
  • Section 10 Tensorflow 2 Implementations
  • Section 11 Appendix

What You’ll Learn

  • How to read and implement deep reinforcement learning papers
  • How to code Deep Q learning agents
  • How to Code Double Deep Q Learning Agents
  • How to Code Dueling Deep Q and Dueling Double Deep Q Learning Agents
  • How to write modular and extensible deep reinforcement learning software
  • How to automate hyperparameter tuning with command line arguments


Reviews

  • J
    Jair Silva
    1.0

    Apenas faz code, usa os ambientes prontos. Uma perda de tempo.

  • J
    James Norton
    4.0

    The content is good, but it has not been updated, so some of the packages are out of date (particularly gym, which I had to substitute with gymnasium to get things working).

  • S
    STS Courses
    1.0

    Just getting started, but too theoretical so far.

  • T
    Thomas Neeb
    4.0

    I like the course so far. It is a little bt hard for me to follow, because Python is new to me, but that was mentioned in the beginning, so I am totally fine with it, learning Python on the fly.

Start FollowingSee all

We use cookies to enhance your experience on our website. Please read and confirm your agreement to our Privacy Policy and Terms and Conditions before continue to browse our website.

Read and Agreed