Learning From Data Lecture 5 Training Versus Testing
The Two Questions of Learning Theory of Generalization (Ein ≈ Eout) An Effective Number of Hypotheses A Combinatorial Puzzle
- M. Magdon-Ismail
CSCI 4100/6100
Learning From Data Lecture 5 Training Versus Testing The Two - - PowerPoint PPT Presentation
Learning From Data Lecture 5 Training Versus Testing The Two Questions of Learning Theory of Generalization ( E in E out ) An Effective Number of Hypotheses A Combinatorial Puzzle M. Magdon-Ismail CSCI 4100/6100 recap: The Two Questions
The Two Questions of Learning Theory of Generalization (Ein ≈ Eout) An Effective Number of Hypotheses A Combinatorial Puzzle
CSCI 4100/6100
recap: The Two Questions of Learning
The Hoeffding generalization bound:
in-sample error model complexity
|H| Error |H|∗
There is a tradeoff when picking |H|.
c A M L Creator: Malik Magdon-Ismail
Training Versus Testing: 2 /18
Goal of generalization theory − →
in-sample error model complexity
|H| Error |H|∗
in-sample error model complexity
model complexity Error
c A M L Creator: Malik Magdon-Ismail
Training Versus Testing: 3 /18
|H| is overkill − →
We do not know which g, so use a worst case union bound.
P[Bg] ≤ P[any Bm] ≤
|H|
P[Bm].
B3 B1 B2
c A M L Creator: Malik Magdon-Ismail
Training Versus Testing: 4 /18
Measuring diversity on N points − →
c A M L Creator: Malik Magdon-Ismail
Training Versus Testing: 5 /18
Example: large H − →
c A M L Creator: Malik Magdon-Ismail
Training Versus Testing: 6 /18
. . . through the eyes of D − →
c A M L Creator: Malik Magdon-Ismail
Training Versus Testing: 7 /18
Just one dichotomy − →
c A M L Creator: Malik Magdon-Ismail
Training Versus Testing: 8 /18
An effective number of hypotheses − →
(h(x1), . . . , h(xN)). A dichotomy of the inputs.
If H is diverse, we get many different dichotomies. If H contains similar functions, we only get a few dichotomies.
c A M L Creator: Malik Magdon-Ismail
Training Versus Testing: 9 /18
Growth function − →
(set of dichotomies induced by H)
x1,...,xN |H(x1, . . . , xN)|.
2N ln 2|H| δ
c A M L Creator: Malik Magdon-Ismail
Training Versus Testing: 10 /18
Example: 2-d perceptron − →
Cannot implement Can implement all 8 Can implement at most 14
c A M L Creator: Malik Magdon-Ismail
Training Versus Testing: 11 /18
Example: 1-d positive ray − →
w0 · · · x1 x2 + · · · xN
c A M L Creator: Malik Magdon-Ismail
Training Versus Testing: 12 /18
Example: 2-d positive rectangle − →
N = 4 N = 5
x1 x2 x3 x4 x1 x2 x3 x4 x4
H implements all dichotomies some point will be inside a rectangle defined by others
c A M L Creator: Malik Magdon-Ismail
Training Versus Testing: 13 /18
The growth functions summarized − →
c A M L Creator: Malik Magdon-Ismail
Training Versus Testing: 14 /18
Combinatorial puzzle: dichotomys on 3 points − →
c A M L Creator: Malik Magdon-Ismail
Training Versus Testing: 15 /18
Two points shattered − →
c A M L Creator: Malik Magdon-Ismail
Training Versus Testing: 16 /18
Another set of dichotomys − →
c A M L Creator: Malik Magdon-Ismail
Training Versus Testing: 17 /18
What about N = 4? − →
c A M L Creator: Malik Magdon-Ismail
Training Versus Testing: 18 /18