Topics and Objectives (Apr 1 – Apr 5)
- Data generating process for dynamic treatment regimes
- Dynamic programming and batch Q-learning
- Generalization Bound for batch Q-learning
- Adapting OWL in sequential decisions
- Using the
DynTxRegime
package to implement these
methods
Lecture Notes
Homework