Course Overview

This course aims to cover selected advanced statistical modeling tools beyond those in Stat 432 and Stat 542. This is the first time that I am offering the full version of the course, and I plan to cover these main topics:

Especially for the last two topics, they will utilize the tools and methods covered in the first two topics. There is no required textbook for this course. However, we will provide lecture notes and homework assignments in R Markdown. You can use either R or Python to complete your homework assignments. Detailed tentative topics and objectives are summarized in the following.


Reproducing Kernel Hilbert Space

Topics covered in this section include:

Some references:


Random Forests

Topics covered in this section include:

Some references


Personalized Medicine and Reinforcement Learning

Topics covered in this section include:

Some references:


Prerequisites

You need to be familiar with basic statistical theory, linear algebra, linear regression and some common machine learning models. We assume that you are able to utilize any models covered in STAT 432, at least computationally, to implement our introduced methods.


Software

You need to be familiar with basic programming in R and Python. Lecture notes and homework assignments will be provided in R Markdown and/or Jupyter Notebook. You can use either R and Python to complete your homework assignments. However, you are responsible for installing the required packages and libraries.


Using AI Tools

As we explore programming and machine learning, I encourage you to use AI tools in your homework to gain insights, automate routine tasks, and enrich your learning. These tools can be powerful aids, but they have limitations. Relying on them without a solid grasp of the underlying principles can lead to errors and misconceptions. Always be mindful of biases in data and the ethical implications of AI use. You must also report any use of AI tools in each homework submission and final project. Treat these tools as extensions of your skills—never as substitutes for critical thinking and rigorous understanding. Used responsibly, they can significantly enhance your educational experience.

The above paragraph was generated with the help of GPT-5. The university also has a discussion about ChatGPT. There is also a Guidance for Students. When writing your homework and final project, I would also consider this statement by Elsevier. If you have questions or concerns, please let me know or talk to an expert.


Gradings

The current plan is to have six to eight sets of homework assignments, but may not be equally weighted. Each homework may contain 50-200 points. Some homework assignments contain a mini-project. Letter grades are based on your total points.

A+ A A- B+ B B- C+ C C- D+ D D-
TBD 93% 90% 87% 83% 80% 77% 73% 70% 67% 63% 60%

Academic Integrity

The official University of Illinois policy related to academic integrity can be found in Article 1, Part 4 of the Student Code. Section 1-402 in particular outlines behavior which is considered an infraction of academic integrity. These sections of the Student Code will be upheld in this course. Any violations will be dealt with in a swift, fair and strict manner. Homework assignments are meant to be learning experiences. You may discuss the exercises with other students, but you must write up the solutions on your own. In short, do not cheat, it is not worth the risk. You are more likely to get caught than you believe. If you think you may be operating in a gray area, you most likely are.


Changes

The instructor reserves the right to make any changes he considers academically advisable. Such changes, if any, will be announced on this website and through email. Please note that it is your responsibility to keep track of the proceedings.