DS 4400: Machine Learning and Data Mining I

Spring 2022

Class Information

Calendar

Additional Reading

Other Resources

 

Instructors:

  • Instructor: Alina Oprea (alinao)
  • TAs: Sri Harika Cherukuri, Nathaniel Hofmann, Jake Horban, Noah Lee, Xuyang Li, Talha Ongun

Class Schedule:

  • Monday and Wednesday 2:50-4:30pm EST
  • Location: Richards Hall 236

Office Hours:

  • Alina: Monday and Wednesday after class (4:45-5:45pm ET)

 

Class forum:  Piazza

Class policies:  Academic integrity policy is strictly enforced

Class description:

Machine learning is a fast-pacing and exciting field achieving human-level performance in tasks such as image classification, speech recognition, machine translation, precision medicine, and self-driving cars. Machine learning has already impacted greatly our daily lives and has the potential to transform the world even more in the near future. This course will provide a broad introduction to machine learning and cover the fundamental algorithms for supervised learning. We will cover topics related to regression, linear classification, non-linear classification, ensemble models, and deep learning. The class will also provide an introduction into ethics and fairness concerns of machine learning, as well as adversarial machine learning, an emerging area that studies the fundamental security issues of machine learning.   

 

Pre-requisites:

  • Probability
  • Calculus
  • Linear algebra

Textbook

[ISL] Gareth JamesDaniela WittenTrevor Hastie and Robert Tibshirani. An Introduction to Statistical Learning with Applications in R.

Grading

The grade will be based on:

 

-       Assignments – 25%

-       Final Project – 30%

-       Midterm Exam – 20%

-       Final Exam – 20%

-       Class Participation – 5%

 

 

 Calendar (Tentative)

 

Unit

Week

Date

Topic

Readings

1

Wed

01/19

Course outline (syllabus, grading, policies) [Slides]

[ISL] Chapters 1 and 2.1

Introduction and Review

2

Mon

01/24

Classification and regression

Bias-variance tradeoff [Slides]

[ISL] Chapters 2.2.1 and 2.2.2

Probability review from Stanford

Wed

01/26

Probability and linear algebra review [Slides]

Linear algebra review from Stanford

3

Mon

01/31

Simple linear regression

Closed from solution. Correlation [Slides]

[ISL] Chapter 3.1

Linear regression

Wed

02/02

Multiple linear regression

Closed form solution [Slides]

[ISL] Chapter 3.2

4

Mon

02/07

Gradient descent [Slides]

Lecture notes from Stanford on linear regression, part 1.1

Regularization and cross-validation

Wed

02/09

Regularization.

Lasso and ridge regression [Slides]

[ISL] Chapter 6.2

5

Mon

02/14

k-Nearest Neighbors (kNN).

Cross-validation

Linear classification. Logistic regression [Slides]

[ISL] Chapter 5.1

[ISL] Chapter 4.1, 4.2, and 4.3 (except 4.3.5)

 

 

Linear   Classification

Wed

02/16

Logistic regression

Gradient descent for logistic regression [Slides]

Lecture notes from Stanford on linear regression, part 2

 

 

6

Mon

02/21

President’s Day. University Holiday.

 

Generative Models

Wed

02/23

Evaluation of ML

ROC curves [Slides]

 

7

Mon

02/28

Generative models

LDA [Slides]

[ISL] Chapter 4.4.1 LDA

Wed

03/02

Midterm exam

 

Ethics in AI

8

Mon

03/07

Ethics in AI, Part I [Slides]

Project proposal due

Wed

03/09

Ethics in AI, Part II [Slides]

 

 

Mon

03/14

Spring break

 

 

 

Wed

03/16

Spring break

 

Tree and Ensemble Classification

 

9

Mon

03/21

Naïve Bayes

Decision trees [Slides]

Chapter 8.1.2

 

 

Wed

03/23

Decision trees

Information Gain

Ensemble learning [Slides]

 

SVM

10

Mon

03/28

Ensemble learning

Bagging

Boosting [Slides]

Chapter 8.2

 

Wed

03/30

Ensemble learning

Boosting

Deep learning introduction. [Slides]

Deep learning

 

 

10

Mon

04/04

Deep learning

Feed-Forward Networks [Slides]

Stanford notes on deep learning, parts 1 and 2

Optional: Chapter 4 from Dive into Deep Learning

 

Wed

04/06

Introduction to NLP

11

Mon

04/11

Feed-Forward Networks

Convolutional Neural Networks [Slides]

Optional: Chapter 6 from Dive into Deep Learning

Wed

04/13

Convolutional Neural Networks

Transfer Learning

Review for final exam [Slides]

 

12

Mon

04/18

Patriots day. University Holiday.

 

Wed

04/20

Final exam

 

13

Mon

04/25

Backpropagation

Regularization in Neural Networks

 

Research

Wed

04/27

Adversarial machine learning

 

 

 

Additional reading

 

Other resources

 

Books: