Credits: 4 (3-0-2)

Prerequisites: Any one of ELL409/ELL784/COL774/COL341/ COL333/COL671

Overlaps with: AIL722, ELL729

Description

Introduction and Basics of RL, Markov Decision Processes (MDPs), Dynamic Programming, Monte Carlo Methods (Prediction), Temporal difference Methods (Prediction), Monte Carlo, TD Method (Control), N-step TD, Eligibility Traces, Model based RL, (Action-)Value Function Approximation, Value Function Approximation, Policy Gradient, Policy Gradient, Recent Applications, Misc./Advanced Topics.

Prerequisite Tree

flowchart TD
COL777-1318[COL777]
COL777-1318 --- Or1319[Any one of]:::empty
ELL409-1320 --> COL106-1320[COL106]
COL106-1320 --> COL100-1320[COL100]
Or1319 -.-> ELL409-1320[ELL409]
ELL409-1320 --> MTL106-1320[MTL106]
ELL784-1321 --> MTL106-1321[MTL106]
Or1319 -.-> ELL784-1321[ELL784]
Or1319 -.-> COL774-1322[COL774]
COL774-1322 --> MTL106-1322[MTL106]
COL341-1323 --> MTL106-1323[MTL106]
Or1319 -.-> COL341-1323[COL341]
COL106-1323 --> COL100-1323[COL100]
COL341-1323 --> COL106-1323[COL106]
Or1319 -.-> COL333-1324[COL333]
COL333-1324 --> COL106-1324[COL106]
COL106-1324 --> COL100-1324[COL100]
Or1319 -.-> COL671-1325[COL671]
COL671-1325 --> COL106-1325[COL106]
COL106-1325 --> COL100-1325[COL100]

classDef empty height:17px, fill:transparent, stroke:transparent;
classDef trueEmpty height:0px, width:0px;