Schedule


Jump to: [Unit 1: Discrete] - [Unit 2: Regression] - [Unit 3: Mixtures] - [Unit 4: Time Series] - [Unit 5: MCMC]

For any class day with assigned readings, you should complete them before the start of class.

Schedule might change slightly as the semester goes on. Please check here regularly and refresh the page. Look for key announcements on Piazza.

Unit 1: Foundations for Discrete Data

Key ideas: Probability fundamentals, Maximum likelihood estimation, MAP estimation, Beta-Bernoulli and Dirichlet-Multinomial distributions, conjugacy, exponential family distributions

Models: Dirichlet-multinomial models for text

Math Practice: HW1, which covers joint, conditional, and marginal distributions; ML/MAP point estimation; Bayesian posterior estimation; Beta PDF and Gamma functions; conjugacy

Coding Practice: CP1 which covers text modeling with unigram distributions

date assignments topic required optional
Wed 01/15  
Course Overview & Probability Refresher
- Notes: day01.pdf
Sec. 1.1 and 1.2 of Bishop PRML Textbook Ch. 1
>>> Focus esp. on 1.2.1, 1.2.2, and 1.2.3
 
Mon 01/21   NO CLASS: MLK Holiday    
Wed 01/22
out:
- HW1
- CP1
Maximum Likelihood for Binary Data
- Notes: day02.pdf
>>> Contrast maximum likelihood vs Bayesian approach
>>> Focus on ML estimator Eq. 2.5-2.8
 
Mon 01/27  
Beta-Bernoulli models for Binary Data
- Notes: day03.pdf
>>> Focus on the Beta and Bernoulli distributions
 
Wed 01/29  
Dirichlet-Discrete models for Count Data
- Guest Lecturer: Prof. Anselm Blumer
- Notes: day04.pdf
- Bonus Notes: Lagrange Multipliers
>>> Focus on the Dirichlet distribution
>>> Lagrange multipliers
For more depth on Dirichlet, see Frigyik, Kapila, and Gupta 2010

Unit 2: Multivariate Gaussians and Regression

Key ideas: multivariate Gaussian distributions, model selection, Laplace approximation

Models: Bayesian linear regression, Bayesian logistic regression, generalized linear models

Algorithms: gradient descent, methods for model selection

Math Practice: HW2

Coding Practice: CP2

date assignments topic required optional
Mon 02/03  
Gaussians and Linear Algebra Review
- Notes: day05.pdf
>>> Focus on definitions of Univariate and Multivariate Gaussian, ML estimators
 
Wed 02/05
due:
- HW1
- CP1
Multivariate Gaussians and Linear Algebra
- Notes: day06.pdf
>>> Dig into linear algebra of Multivariate Gaussian
Immersive Linear Algebra: Determinants
Immersive Linear Algebra: Eigenvalues and eigenvectors
Mon 02/10  
Bayesian Linear Regression 1/2
- Notes: day07.pdf
Read Sec. 3.1 of Bishop PRML Textbook Ch. 3
>>> Focus on ML estimator for linear regression
Skim Sec. 3.2 of Bishop PRML Textbook Ch. 3
>>> Bias/Variance tradeoff
Wed 02/12
due:
- Quizlet1
out:
- HW2
Bayesian Linear Regression 2/2
- Notes: day08.pdf
Read Sec. 3.3.1 and 3.3.2 of Bishop PRML Textbook Ch. 3
>> Focus on the posterior and predictive distribution
 
Mon 02/17   NO CLASS: President's Day    
Wed 02/19  
Model Selection for Linear Regression
- Slides: day09.pdf
- Guest Lecturer: Prof. Liping Liu
Read Sec. 3.4 and 3.5 of Bishop PRML Textbook Ch. 3
>> Focus on model selection and hyperparameter estimation
 
Thu 02/20  
Bayesian Linear Models for Classification
- Notes: day10.pdf
Read Sec. 4.3 of Bishop PRML Textbook Ch. 4
>>> Focus on Binary and Multi-class classification
Read Sec. 4.4 of Bishop PRML Textbook Ch. 4
>>> Try to understand the Laplace approximation
 
Mon 02/24  
Gradient Descent for Bayesian Logistic Regression
- Notes: day11.pdf
>>> Bayesian Logistic Regression
 

Unit 3: Clustering, Mixture Models, and Expectation-Maximization

Key ideas: coordinate ascent optimization, local optima, expectations

Models: Mixture models

Algorithms: k-means, expectation maximization, variational inference

Math Practice: HW3

Coding Practice: CP3

date assignments topic required optional
Wed 02/26
due:
- HW2
out:
- CP2
K-Means Clustering
- Notes: day12.pdf
 
Mon 03/02  
Gaussian Mixture Models and ML Estimation
- Notes: day13.pdf
 
Wed 03/04
due:
- Quizlet2
- CP2
Midterm Review
   
Mon 03/09  
ML Estimation with GMMs: Expectation Maximization and Gradient Descent
- Notes: day15.pdf
Sec. 9.2.2, 9.3.1, and 9.3.2 of Bishop PRML Textbook Ch. 9
 
Wed 03/11 Take-home Midterm due 03/25 CANCELLED due to COVID-19    
Mon 03/16   NO CLASS: SPRING BREAK    
Wed 03/18   NO CLASS: SPRING BREAK    
Mon 03/23   CANCELLED due to COVID-19    
Wed 03/25
out:
- HW3
- CP3
Expectation Maximization for GMMs
- Notes: day16.pdf
- Video part 1 : Recap of GMMs
- Video part 2 : GMMs with Latent Assignments Z
- Video part 3 : Expectations of Complete Likelihood
- Video part 4 : Lower Bound of Incomplete Likelihood
- Video part 5 : EM as Coordinate Ascent on Lower Bound Objective
Sec. 9.3 and 9.4 of Bishop PRML Textbook Ch. 9
Read Sec. 10.1 of Bishop PRML Textbook Ch. 10
For background on entropy: Sec. 1.6 of Bishop PRML Textbook Ch. 1
For background on KL divergence: Sec. 1.6.1 of Bishop PRML Textbook Ch. 1
For more depth: Skim Sec. 10.2 of Bishop PRML Textbook Ch. 10

Unit 4: Hidden Markov models for Time-Series

Key ideas: dynamic programming, joint MAP vs. marginal MAP

Models: Hidden markov models

Algorithms: forward-backward algorithm, Viterbi algorithm, belief propagation algorithm

Math Practice: HW4

Coding Practice: CP4

date assignments topic required optional
Mon 03/30  
Markov Models and Hidden Markov Models
- Notes: day17.pdf
- Video day17 part1 : Unit 4 Motivation: Dependencies in Sequential Data
- Video day17 part2 : The Markov assumption
- Video day17 part3 : Markov models for discrete sequences
- Video day17 part4 : Hidden Markov models
Sec. 13.1 'Markov models' Bishop PRML Textbook Ch. 13
Sec. 13.2 'Hidden Markov models' Bishop PRML Textbook Ch. 13
>>> Only the intro before 13.2.1
>>> For more depth and coding practice
Wed 04/01
due:
- HW3
- CP3
EM for HMMs
- Notes: day18.pdf
- Video day18 part1 : EM for HMM parameter estimation
- Video day18 part2 : Expected log likelihood for HMMs
- Video day18 part3 : M-step for HMMs
- Video day18 part4 : E-step for HMMs: overview
- Video day18 part5 : E-step for HMMs: forward algorithm and backward algorithm
Sec. 13.2.1 'Maximum likelihood for the HMM' Bishop PRML Textbook Ch. 13
Sec. 13.2.2 'Forward-backward algorithm' Bishop PRML Textbook Ch. 13
Sec. 13.2.4 'Scaling factors' Bishop PRML Textbook Ch. 13
>>> Think about numerical stability of calculations
 
Mon 04/06
out:
- HW4
Viterbi for HMMs

- Notes: day19.pdf
- Video day19 part1 : Motivation for inferring hidden states
- Video day19 part2 : Most likely hidden sequence problem
- Video day19 part3 : Intuition behind recursive solution
- Video day19 part4 : Viterbi algorithm
Sec. 13.2.5 'The Viterbi Algorithm' Bishop PRML Textbook Ch. 13
Sec. 13.2.6 'Extensions to the HMM' Bishop PRML Textbook Ch. 13

Unit 5: Markov Chain Monte Carlo

Key ideas: Markov chains, revisibility, ergodicity, detailed balance, probabilistic programming

Models: mixture models, Bayesian logistic regression, factor analysis

Algorithms: Metropolis-Hastings algorithm, Gibbs sampling

Math Practice: HW5

Coding Practice: CP5

date assignments topic required optional
Wed 04/08
due:
- Quizlet3
Sampling Methods and Markov Chain Monte Carlo
- Notes: day20.pdf
- Video day20 part1 : Monte Carlo estimates of expectations
- Video day20 part2 : Directed graphical models & ancestral sampling
- Video day20 part3 : MCMC Intro, Stationary Distributions, Ergodicity
- Video day20 part4 : Sampling via Inverting the CDF
- Video day20 part5 : Transformations of Sampled Variables
Sec. 11.1 'Basic Sampling Methods' Bishop PRML Textbook Ch. 11
>>> Focus on basic methods using transformed uniform r.v. in 11.1.1
>>> Skim rejection sampling in 11.1.2
>>> Skim importance sampling in 11.1.4
Sec. 11.2 'Markov Chain Monte Carlo' Bishop PRML Textbook Ch. 11
>>> Focus on the overview section and 11.2.1 Markov chains
 
Mon 04/13
out:
- HW5
- CP4
Random Walk Proposals and Metropolis-Hastings Algorithm
- Notes: day21.pdf
- Video day21 part1 : Markov Transitions that Propose then Accept/Reject
- Video day21 part2 : Random walk proposals
- Video day21 part3 : Detailed Balance, Proof of Random Walk's Stationary Distribution
- Video day21 part4 : Metropolis and Metropolis-Hastings algorithms
Sec. 11.2.2 'Metropolis-Hastings algorithm' Bishop PRML Textbook Ch. 11
 
Wed 04/15
due:
- HW4
Gibbs Sampling
- Notes: day22.pdf
- Video day22 part1 : Overview of Sampling Vector Random Var.
- Video day22 part2 : Pro/con comparison of Gibbs vs. Random Walk
- Video day22 part3 : Gibbs Sampling Algorithm
- Video day22 part4 : Proof sketch of Gibbs Sampling correctness
- Notebook: GibbsSampling.ipynb
Sec. 11.3 'Gibbs sampling' Bishop PRML Textbook Ch. 11
 
Mon 04/20  
NO CLASS (Patriot's Day)
BONUS CONTENT: Hamiltonian Monte Carlo
This is extra material. Nothing from this will be on any exam.
Sec. 1-4 of Betancourt 2017
Wed 04/22
due:
- Quizlet4 (late this week)
- CP4
Probabilistic Programming
>>> Read thru Case Study 1
>>> Skim, focus on examples
>>> Example of Gaussian mixtures in PyMC3
Mon 04/27
due:
- HW5
- Quizlet5
- CP5 (late deadline 5/3)
Final Exam Review
   
FINALS WEEK  
FINAL EXAM
Take-home
- due 05/08