Supervised learning setup. LMS.
Logistic regression. Perceptron. Exponential family.
Generative learning algorithms. Gaussian discriminant analysis. Naive Bayes.
Support vector machines.
Model selection and feature selection.
Ensemble methods: Bagging, boosting.
Evaluating and debugging learning algorithms.
Learning theory. (3 classes)
Bias/variance tradeoff. Union and Chernoff/Hoeffding bounds.
VC dimension. Worst case (online) learning.
Practical advice on how to use learning algorithms.
MDPs. Bellman equations.
Value iteration and policy iteration.
Linear quadratic regulation (LQR). LQG.
Q-learning. Value function approximation.
Policy search. Reinforce. POMDPs.
Dates for assignments
Assignment 1: Out 10/03. Due 10/17.
Assignment 2: Out 10/17. Due 10/31.
Assignment 3: Out 10/31. Due 11/14.
Assignment 4: Out 11/14. Due 12/05.
Midterm: 11/07, 6-9pm.
Term project: Proposals due 10/19 (5pm). Milestone due 11/16 (5pm).
Poster presentations on 12/13 (8.30am-11.30am); final writeup due on 12/14 (11:59pm, no late days).