Sample-based Learning Methods

Software > Computer Software > Educational Software Alberta Machine Intelligence Institute & University of Alberta

Course Overview

What You'll Learn

  • In this course, you will learn about several algorithms that can learn near optimal policies based on trial and error interaction with the environment---learning from the agent’s own experience.
  • Learning from actual experience is striking because it requires no prior knowledge of the environment’s dynamics, yet can still attain optimal behavior.
  • We will cover intuitively simple but powerful Monte Carlo methods, and temporal difference learning methods including Q-learning.

In this course, you will learn about several algorithms that can learn near optimal policies based on trial and error interaction with the environment---learning from the agent’s own experience. Learning from actual experience is striking because it requires no prior knowledge of the environment’s dynamics, yet can still attain optimal behavior. We will cover intuitively simple but powerful Monte Carlo methods, and temporal difference learning methods including Q-learning. We will wrap up this course investigating how we can get the best of both worlds: algorithms that can combine model-based planning (similar to dynamic programming) and temporal difference updates to radically accelerate learning. By the end of this course you will be able to: - Understand Temporal-Difference learning and Monte Carlo as two strategies for estimating value functions from sampled experience - Understand the importance of exploration, when using sampled experience rather than dynamic programming sweeps within a model - Understand the connections between Monte Carlo and Dynamic Programming and TD. - Implement and apply the TD algorithm, for estimating value functions - Implement and apply Expected Sarsa and Q-learning (two TD methods for control) - Understand the difference between on-policy and off-policy control - Understand planning with simulated experience (as opposed to classic planning strategies) - Implement a model-based approach to RL, called Dyna, which uses simulated experience - Conduct an empirical study to see the improvements in sample efficiency when using Dyna

Course FAQs

Is this an accredited online course?

Accreditation for 'Sample-based Learning Methods' is determined by the provider, Alberta Machine Intelligence Institute & University of Alberta. For online college courses or degree programs, we strongly recommend you verify the accreditation status directly on the provider's website to ensure it meets your requirements.

Can this course be used for continuing education credits?

Many of the courses listed on our platform are suitable for professional continuing education. However, acceptance for credit varies by state and licensing board. Please confirm with your board and {course.provider} that this specific course qualifies.

How do I enroll in this online school program?

To enroll, click the 'ENROLL NOW' button on this page. You will be taken to the official page for 'Sample-based Learning Methods' on the Alberta Machine Intelligence Institute & University of Alberta online class platform, where you can complete your registration.