Review: probability
- Monty Hall, weighted dice
- Frequentist v. Bayesian
- Independence
- Expectations, conditional expectations
- Exp. & independence; linearity of exp.
- Estimator (RV computed from sample)
- law of large #s, bias, variance, tradeoff
Review: probability Monty Hall, weighted dice Frequentist v. - - PowerPoint PPT Presentation
Review: probability Monty Hall, weighted dice Frequentist v. Bayesian Independence Expectations, conditional expectations Exp. & independence; linearity of exp. Estimator (RV computed from sample) law of large #s,
Review: probability
Covariance
measure of (in)dependence
Covariance
Correlation
individual r.v.s
Correlation & independence
independent?
uncorrelated? X Y
!! " ! !# !! !$ " $ ! #
5Correlation & independence
RVs are uncorrelated?
RVs are independent?
6Proofs and counterexamples
counterexample ? ?
7Counterexamples
not B
but are correlated A ⇒ B X, Y uncorrelated ⇒ X, Y independent ? ?
8Correlation & independence
independent?
uncorrelated?
!! " ! !# !! !$ " $ ! #
X Y
9Bayes Rule
1702–1761
10Exercise
emacsitis—prevalence 3 in 100,000
sensitive and 99% specific
Revisit: weighted dice
Learning from data
this class
sample
13Bayesian model learning
Prior: uniform
all H all T
0.2 0.4 0.6 0.8 1 0.05 0.1 0.15 0.2 0.25
15Posterior: after 5H, 8T
0.2 0.4 0.6 0.8 1 0.05 0.1 0.15 0.2 0.25
all H all T
16Posterior:11H, 20T
0.2 0.4 0.6 0.8 1 0.05 0.1 0.15 0.2 0.25
all H all T
17Why do we need graphical models?
distribution is as a big table
compactly using diagrams & numbers
19Example ML problem
30 days after inspection? (15 types)
Big graphical models
to express various ML algorithms
for now so we can fit them on the slides and do the math in our heads…
21Bayes nets
Rusty robot: the DAG
23Rusty robot: the CPTs
specifying P(X | pa(X))
24Interpreting it
25Benefits
marginals, conditionals ⇒ posteriors
26Inference example
P(M) P(Ra) P(O) P(W|Ra,O) P(Ru|M,W)
Independence
Conditional independence
P(M) P(Ra) P(O) P(W|Ra,O) P(Ru|M,W)
Conditional independence
break independences
derived from graph structure alone
interesting
30Graphical tests for independence
looking for factorizations
Bayes nets
Blocking
Explaining away
Son of explaining away
34d-separation
paths between X and Y
Longer paths
and inactive o/w
Another example
37Markov blanket
C = minimal set
to render C independent of rest of graph
38Learning Bayes nets
M Ra O W Ru T F T T F T T T T T F T T F F T F F F T F F T F T
P(Ra) = P(M) = P(O) = P(W | Ra, O) = P(Ru | M, W) =
39Laplace smoothing
M Ra O W Ru T F T T F T T T T T F T T F F T F F F T F F T F T
P(Ra) = P(M) = P(O) = P(W | Ra, O) = P(Ru | M, W) =
40Advantages of Laplace
Limitations of counting and Laplace smoothing
in all examples
more complicated algorithm—we’ll cover a related method later in course
Factor graphs
instead of DAG
43Rusty robot: factor graph
P(M) P(Ra) P(O) P(W|Ra,O) P(Ru|M,W)
44Convention
Non-CPT factors
factor graph
nonnegative #s allowed
Ex: image segmentation
47Factor graph → Bayes net
distribution
Independence
for independence and conditional independence
Independence example
50Modeling independence
independences
(conditional) independences