Topics and Objectives (Apr 8 – Apr 12)
- Data generating process of Markov Decision Process
- Infinite horizon and discounted reward
- Bellman Equation and the optimal policy
- Contraction mapping theorem
- Bellman optimality equation
- Value iteration algorithm
Lecture Notes
Homework
- No new homework this week.
- See Homework 4 for details