After this course, students should be able to
R to perform data cleaning and model fitting,
evaluation and be able to interpret the resultsRMarkdown to organize your report with proper
visualization of the data and resultsTentative subjects include:
Supplemental: SMLR - Statistical Learning and Machine Learning with R. This is my ongoing project that will contain most of the course material.
Supplemental: ISL - An Introduction to Statistical Learning with Applications in R by Gareth James, Daniela Witten, Trevor Hastie and Robert Tibshirani
Supplemental: ESL - The Elements of Statistical Learning: Data Mining, Inference, and Prediction by Trevor Hastie, Robert Tibshirani and Jerome Friedman. This is a more advanced textbook.
A course that covers linear regression and uses R, such
as STAT 420/425. Good knowledge of probability and statistics
(STAT400) and preliminaries of linear algebra (MATH 415) are also
assumed.
R and RStudio are required software for
this course. R is a freely available language and
environment for statistical computing and graphics. RStudio
is a free and open-source integrated development environment for
R. Alternatively you can use
Visual Studio Code (VS Code) as your
programming environment, which has more modern programming assisting
tools. You must have access to a computer where you are able to install
the most up-to-date versions of R and RStudio,
as well as install R packages. We will have an R tutorial
during the first week.
There will usually be one homework assignment each week, with the
total around 10. The total number may vary depending on the course
progression. The lowest score can be dropped. All
homework assignments must be submitted in .pdf format to Gradescope. A PDF report can be
created using R Markdown, which is a feature in
RStudio (or equivalently in VS Code). During
the first week, we will provide a detailed guide for using
R Markdown in either environment.
Late Policy:
Gradescope
system). So do not wait until the last second.Grading Policy:
R code should be clean and easy to
follow with necessary comments. Figures/Tables should be properly
sized/simplified/colored and their labels should be easy to read. Keep
in mind that you should submit a report, but not pieces of
R code that only obtains the numerical results. I encourage
you to watch this video
from the previous semester that discusses related issue..PDF) to Gradescope. Make all
your R code chunks visible for grading.As we explore programming and machine learning, I encourage you to use AI tools in your homework to gain insights, automate routine tasks, and enrich your learning. These tools can be powerful aids, but they have limitations. Relying on them without a solid grasp of the underlying principles can lead to errors and misconceptions. Always be mindful of biases in data and the ethical implications of AI use. You must also report any use of AI tools in each homework submission and final project. Treat these tools as extensions of your skills—never as substitutes for critical thinking and rigorous understanding. Used responsibly, they can significantly enhance your educational experience.
The above paragraph was generated with the help of GPT-5. The university also has a discussion about ChatGPT. There is also a Guidance for Students. When writing your homework and final project, I would also consider this statement by Elsevier. If you have questions or concerns, please let me know or talk to an expert.
Final report is due 11:59 PM, Dec 11th. You have two options to complete the final project. You can complete the final project with a team. Each team can have up to three members. For more details, please see the project page.
| Type | Precentage | 
|---|---|
| Homework | 60% | 
| Quizzes | 10% | 
| Final Project | 30% | 
Letter grades
| A+ | A | A- | B+ | B | B- | C+ | C | C- | D+ | D | D- | 
|---|---|---|---|---|---|---|---|---|---|---|---|
| TBD | 93% | 90% | 87% | 83% | 80% | 77% | 73% | 70% | 67% | 63% | 60% | 
The official University of Illinois policy related to academic integrity can be found in Article 1, Part 4 of the Student Code. Section 1-402 in particular outlines behavior which is considered an infraction of academic integrity. These sections of the Student Code will be upheld in this course. Any violations will be dealt with in a swift, fair and strict manner. Homework assignments are meant to be learning experiences. You may discuss the exercises with other students, but you must write up the solutions on your own. In short, do not cheat, it is not worth the risk. You are more likely to get caught than you believe. If you think you may be operating in a gray area, you most likely are.
The instructor reserves the right to make any changes he considers academically advisable. Such changes, if any, will be announced on this website and through email. Please note that it is your responsibility to keep track of the proceedings.