Machine learning is a field of inquiry devoted to understanding and building methods that ‘learn’, that is, methods that leverage data to improve performance on some set of tasks. In this course, we focus on the core concepts of machine learning that beginning ML researchers must know. We cover ‘classical machine learning’ primarily using R, and explore applications to environmental science. To understand broader concepts of artificial intelligence or deep learning, a strong fundamental knowledge of machine learning is indispensable.
Instructor: Mateo Robbins (mjrobbins@ucsb.edu)
Teaching assistant: Allie Caughman (acaughman@bren.ucsb.edu)
Lecture: TTh 9:30am - 10:45am PST (Bren 1424)
Section:
First 5 weeks: Th 2:00pm - 2:50pm, 3:00 - 3:50pm
2/15 and onward: Th 12:30 - 1:20pm, 1:30 - 2:20pm
The goal of EDS 232 is to equip students with a strong foundation in the core concepts of machine learning. By the end of the course, students should be able to:
Build machine learning models in R using popular machine learning packages
Build and train supervised machine learning models for prediction and binary classification tasks, including linear and logistic regression.
Apply best practices for machine learning development so that your models generalize to data and tasks in the real world.
Build and use decision trees and tree ensemble methods, including random forests and boosted trees.
Use unsupervised learning techniques including clustering.
R
version 4.0.2 (or higher)
RStudio version 1.4.1103 (or higher)
Hands-On Machine Learning with R, by Bradley Boehmke and Brandon Greenwell
Week # | Dates | Lecture | Lab |
1 | 1/9, 1/11 | Introduction, Linear Regression and ML Modeling Fundamentals I | 1.Regression I (Pumpkins) |
2 | 1/16, 1/18 | Regularized Regression and ML Modeling Fundamentals II | 2.Regression II (Pumpkins) |
3 | 1/23, 1/25 | Logistic Regression, Classification | 3.Regularized Regression (Abalone) |
4 | 1/30, 2/1 | K-nearest neighbors, Decision Trees | 4.Classification (Wildfire) |
5 | 2/6, 2/8 | Ethics and Bias in ML | 5.Classification (Spotify) |
6 | 2/13, 2/15 | Bagging, Random Forest | 5.Classification (Spotify) |
7 | 2/20, 2/22 | Gradient Boosting | 6.Boosting |
8 | 2/27, 3/1 | Clustering | 7.Clustering |
9 | 3/6, 3/8 | Support Vector Machines | 8.SVM |
10 | 3/13, 3/15 | Deep Learning, CalCOFI | 9.Kaggle |