After this course, students should be able to
R
to perform data cleaning and model fitting,
evaluation and be able to interpret the resultsRMarkdown
to organize your report with proper
visualization of the data and resultsTentative subjects include:
Supplemental: SMLR - Statistical Learning and Machine Learning with R. This is my ongoing project that will contain most of the course material.
Supplemental: ISL - An Introduction to Statistical Learning with Applications in R by Gareth James, Daniela Witten, Trevor Hastie and Robert Tibshirani
Supplemental: ESL - The Elements of Statistical Learning: Data Mining, Inference, and Prediction by Trevor Hastie, Robert Tibshirani and Jerome Friedman. This is a more advanced textbook.
A course that covers linear regression and uses R
, such
as STAT 420/425. Good knowledge of probability and statistics
(STAT400) and preliminaries of linear algebra (MATH 415) are also
assumed.
R
and RStudio
are required software for
this course. R
is a freely available language and
environment for statistical computing and graphics. RStudio
is a free and open-source integrated development environment for
R
. Alternatively you can use
Visual Studio Code
(VS Code
) as your
programming environment, which has more modern programming assisting
tools. You must have access to a computer where you are able to install
the most up-to-date versions of R
and RStudio
,
as well as install R
packages. We will have an R tutorial
during the first week.
There will usually be one homework assignment each week, with the
total around 10. The total number may vary depending on the course
progression. The lowest score can be dropped. All
homework assignments must be submitted in .pdf
format to Gradescope. A PDF report can be
created using R Markdown
, which is a feature in
RStudio
(or equivalently in VS Code
). During
the first week, we will provide a detailed guide for using
R Markdown
in either environment.
Late Policy:
Gradescope
system). So do not wait until the last second.Grading Policy:
R
code should be clean and easy to
follow with necessary comments. Figures/Tables should be properly
sized/simplified/colored and their labels should be easy to read. Keep
in mind that you should submit a report, but not pieces of
R
code that only obtains the numerical results. I encourage
you to watch this video
from the previous semester that discusses related issue..PDF
) to Gradescope. Make all
your R
code chunks visible for grading.As we explore the fascinating realm of programming and machine learning, I encourage you to utilize AI tools in your homework assignments. These cutting-edge technologies can provide you with insights, automate routine tasks, and enhance your learning experience. However, I must stress the importance of understanding both the capabilities and limitations of these tools. While they can be powerful aids, relying solely on them without a deep comprehension of the underlying principles can lead to misconceptions and errors. Be mindful of biases in data and the ethical implications of using AI in various contexts. Embrace these tools as an extension of your skills, but never substitute them for critical thinking and rigorous understanding. I believe that the responsible use of AI tools can greatly enrich your learning.
The above paragraph was generated by GPT-4. The university also has a discussion about ChatGPT. There is also a Guidance for Students. When writing your homework and final project, I would also consider this statement by Elsevier. If you have questions or concerns, please let me know or talk to an expert.
Final report is due 11:59 PM, Dec 12th. You have two options to complete the final project. You can complete the final project with a team. Each team can have up to three members. For more details, please see the project page.
Type | Precentage |
---|---|
Homework | 60% |
Quizzes | 10% |
Final Project | 30% |
Letter grades
A+ | A | A- | B+ | B | B- | C+ | C | C- | D+ | D | D- |
---|---|---|---|---|---|---|---|---|---|---|---|
TBD | 93% | 90% | 87% | 83% | 80% | 77% | 73% | 70% | 67% | 63% | 60% |
The official University of Illinois policy related to academic integrity can be found in Article 1, Part 4 of the Student Code. Section 1-402 in particular outlines behavior which is considered an infraction of academic integrity. These sections of the Student Code will be upheld in this course. Any violations will be dealt with in a swift, fair and strict manner. Homework assignments are meant to be learning experiences. You may discuss the exercises with other students, but you must write up the solutions on your own. In short, do not cheat, it is not worth the risk. You are more likely to get caught than you believe. If you think you may be operating in a gray area, you most likely are.
The instructor reserves the right to make any changes he considers academically advisable. Such changes, if any, will be announced on this website and through email. Please note that it is your responsibility to keep track of the proceedings.