Reading and Video Assigments
All readings are due by the START of the class for which they are assigned. Reading quizzes will be administered at the start of many classes to encourage completion of all reading assignments in a timely manner.
Required Textbooks
All required and reference textbooks are freely available online (though hard copies can also be purchased if desired). The six textbooks that we will focus on for the course include:
James, G., Witten, D., Hastie, T., & Tibshirani, R. (2013). An Introduction to Statistical Learning: With Applications in R (7th ed.). (pdf)
Kuhn, M. & Johnson, K. (2018). Applied Predictive Modeling. New York, NYL Springer Science. (pdf)
Grolemund, G., & Wickham, H. (2016). R for Data Science: Import, Tidy, Transform, Visualize, and Model Data (1st ed.). Sebastopol, CA: O’Reilly Media, Inc. (bookdown)
Silge, J., & Robinson, D. (2017). Text Mining with R: A Tidy Approach (1rst ed.). Beijing; Boston: O’Reilly Media. (bookdown)
Kuhn, M., & Johnson, K. (2019). Feature Engineering and Selection: A Practical Approach for Predictive Models (1 edition). Boca Raton, FL: Chapman and Hall/CRC. (bookdown)
Unit 1: Overview of machine learning concepts and uses
Unit 2: Introduction to regression models
- January 30th
- James et al. (2013) Chapter 3: Linear Regression (pp 59 - 109)
Unit 3: Introduction to classification models
Unit 4: Cross validation methods
Unit 5: Subsetting and filtering
- March 31st
- READ: James et al. (2013) Chapter 6: Linear Model Selection and Regularization (pp 203 - 214)
- WATCH: Subsetting and filtering, 1/2
- WATCH: Subsetting and filtering, 2/2
Unit 6: Regularization and penalized models
- April 6th
- READ: James et al. (2013) Chapter 6: Linear Model Selection and Regularization (pp 203 - 228)
- WATCH: Regularization, 1/3
- WATCH: Regularization, 2/3
- WATCH: Regularization, 3/3
Unit 7: Bootstrapping and permutation tests
- April 14th
- READ: James et al. (2013) Chapter 6: Resampling (pp- 187 - 190; section titled “The Bootstrap”)
- READ: Statistical significance tests for comparing machine learning models by Jason Brownlee