Time and Place: Monday and Thursday 11:45-1:25pm, Richards Hall 300
Khoury College of Computer Sciences
Instructor: Chris Amato
TAs listed in general information
Unless noted otherwise, all readings are from Reinforcement Learning: An Introduction, 2nd Ed., Sutton and Barto
| Date | Topic/Notes | Reading | Assignment due | |
|---|---|---|---|---|
| 1/8 | Introduction to RL | SB 1.1--1.6 | Self Assessment (Solutions) | |
| 1/12 | Bandit Problems | SB 2.1--2.10 | ||
| 1/15 | Bandit Problems | 0. Intro assignment DUE (Friday) | ||
| 1/19 | No class (MLK Day) | |||
| 1/22 | MDPs | SB 3.1--3.8 |
1. Bandits assignment DUE (Friday) | |
| 1/26 | MDPs | |||
| 1/29 | Dynamic Programming | SB 4.1--4.8 |
2. MDP assignment DUE (Friday) | |
| 2/2 | Dynamic Programming | |||
| 2/5 | Monte Carlo Methods | SB 5.1--5.7 (you can skip Example 5.5) | 3. DP assignment DUE (Friday) | |
| 2/9 | Monte Carlo Methods | Project description OUT | ||
| 2/12 | Temporal Difference Learning | SB 6.1--6.8 |
4. MC assignment DUE (Friday) | |
| 2/16 | No class (Presidents' Day) | |||
| 2/19 | Temporal Difference Learning | Project proposal DUE | ||
| 2/23 | Linear Function Approximation | SB 9.1--9.5, 9.8, 10.1 | 5. TD Learning assignment DUE (Monday) | |
| 2/26 | Exam 1 | SB 8.1--8.6;8.9--8.12 | ||
| 3/2 | No class (Spring break) | |||
3/5 |
No class (Spring break) | |||
| 3/9 | Deep Learning Overview | GBC, 6.1--6.4, 9.1--9.3, (optional background) |
|
|
| 3/12 | Deep Q-learning (DQN) | Mnih, 2014 DQN |
7. Function approx. assignment DUE (Friday) |
|
| 3/16 | DQN and extensions | Hasselt, 2015
(Double DQN), Schaul,
2016 (Prioritized Replay), Wang,
2015 (Dueling) Mnih,
2016 (A3C), Rainbow |
||
| 3/19 | Policy gradient and actor critic | SB 13.1--13.7 |
8. DQN assignment DUE (Friday) | |
| 3/23 | Deep policy gradient and actor critic | Silver,
2014 (DPG), Lillicrap,
2016 (DDPG), Mnih,
2016 (A3C) |
||
| 3/26 | Planning and Learning | SB 8.1--8.6;8.9--8.12 | 9. PG assignment DUE (Friday) | |
| 3/30 | Planning and Learning | |||
| 4/2 | Advanced Topics | 6. Planning and Learning assignment DUE (Friday) | ||
| 4/6 | Exam 2 | |||
| 4/9 |
Advanced Topics | |||
| 4/13 | Project Presentations | |||
| 4/16 |
Project Presentations | |||
| 4/20 | No class (Patriots' Day) | |||
| 4/21 |
Project Reports Due | Report due at 11:59 PM -- This is a hard deadline, no extensions |
Important note: unless noted otherwise, all readings and assignments are due on the day that they appear in the schedule.