Probabilistic Graphical Models

January, 2020

theory-cs machine-learning

Probabilistic Graphical Models refers to concise representations of probability distributions using graphs. It also studies efficient algorithms for sampling distributions represented in such form. Sampling might need to be done from the joint probability distribution, the marginals or even conditional distributions. Other algorithmic questions involve computing the Maximum Likelihood Estimate (MLE), Maximum Aposteriori Estimate (MAP) etc. This topic has deep connections and applications to various fields including Theoretical Computer Science, Machine Learning, Statistical Physics, Bioinformatics etc. We will also be covering analysis of Markov Chain Monte Carlo (MCMC) Algorithms.

Broadly the course will cover four modules

Representations
Inference
Learning
Advanced Topics (More on MCMC Methods, Normalizing Flows, Learning theory)

Draft Syllabus

Grading

Type of Eval	–Weightage
Quiz 1	10
Mid Sem	15
Quiz 2	10
End Sem	25
Assignments	20
Project	20

Lectures

Lec 1-2: Probability Recap

Read: For recalling basics of probability and graph theory, please go through [DB] Chapter 1, 2
Solve: - ^sub For any two distributions $p,q$ on $\{1, \cdots, n \}$, show that: $$\max_{A \subseteq \{1, \cdots, n\}} | p(A) - q(A) | = \frac{\sum_{i=1}^n | p(i) - q(i) |}{2}.$$ Note that the event $A$ choosen in LHS is a way of distinguishing $p$ from $q$ using $1$ sample in the best possible way. The RHS is the $\ell_1$ norm $|p - q|_1$. - ^sub Prove that any DAG (Directed Acyclic Graph) with finite number of vertices, has atleast one vertex with no incoming edges (ie. pointed towards it). Also show that there is atleast one vertex with no outgoing edges.
```
    [Hint] Note that there are infinite graphs where the statement is not true. Hence you need to use the fact that the graph has only finite number of nodes in the proof.
```

Lec 3: Belief Networks I

Free parameters in distributions | Conditional Independence reduces parameters | Graph Representation | d-Connectivity and Independence

Read: [DB] Sections 3.1 - 3.3
Solve: - [DB] Section 3.8 Exercise 24^sub, 27, 35^sub.

Lec 4: Belief Networks II

d-Connectivity | I-maps | Minimal and Perfect Imaps

Read:
- [KE] Bayesian Networks
- [DB] Section 3.3
Explore: - [KF] Chapter 3. Section 3.3

Lec 10: Tutorial - I

Lec 11: Probability Basics of Sampling: Tail Bounds

Read:
- Lectures Notes by Pravesh Kothari

Lec 12: Amplification and Ising Model

Read:
- MCMC book by Art Owen. Pages 6-8 for Ising Model definition. Pages 16-18 for General MCMC formulation (also known as Metropolis - Hasting’s). Pages 31 - 32 for MCMC formulation of Ising Model.
Mid Sem Exam

Lec 13: Intro to Learning Theory

Read:
- Understanding ML by Shai Shalev-Shwartz, Shai Ben-David. Chapter 2

Lec 14: Agnostic PAC Learning

Read:
- Understanding ML by Shai Shalev-Shwartz, Shai Ben-David. Chapter 3, 4
Tutorial II on PAC Learning

Lec 15: VC Dimension I

Read: - Understanding ML by Shai Shalev-Shwartz, Shai Ben-David. Chapter 6

Lec 16: VC Dimension II - Sauer-Shellah-Peres Lemma

Read:
- Understanding ML by Shai Shalev-Shwartz, Shai Ben-David. Chapter 6

Lec 17: Recap & Learning Conjunctions

Read:
- Computational Learning Theory by Vazirani, Kearns. Section 1.3
Video
Notes

Lec 18: No Free Lunch Theorem

Read:
- Understanding ML by Shai Shalev-Shwartz, Shai Ben-David. Chapter 5
Notes

Lec 19: Learning Graphical Models

Lec 20: Group Testing

Video
Notes
Reference
- Sparse Recovery using Sparse Matrices, Section III

Textbook and References

[DB] Bayesian Reasoning and Machine Learning. David Barber
[KE] Probabilistic Graphical Models, Course Notes. Volodymyr Kuleshov and Stefano Ermon
[MMSM] Handbook of Graphical Models Marloes Maathuis, Mathias Drton, Steven Lauritzen, Martin Wainwright
[KF] Probabilistic Graphical Models: Principles and Techniques. Daphne Koller and Nir Friedman, MIT Press (2009).
[EX] Probabilistic Graphical Models Eric Xing
[KM] Machine Learning: a Probabilistic Perspective by Kevin Patrick Murphy
[WJ] Graphical Models, Exponential Families, and Variational Inference Martin J. Wainwright and Michael I. Jordan
[MM] Information, Physics, and Computation Marc Mézard and Andrea Montanari