Topics and Objectives
- MDP and RL formulation
- Bellman equation
- Contraction Mapping Theorem
- Value Iteration
- Model-free vs. Model-based RL
- Fitted Q-Iteration
- Function approximation in RL
- Online vs. Offline RL
- Algorithms for solving MDPs
Week 12 (Nov 11 and Nov 13)
Week 13 (Nov 18 and Nov 20)
Week 14 Fall Break
Week 15 (Nov 18 and Nov 20)