CSC421 Intro to Artificial Intelligence UNIT 28: Learning from - - PowerPoint PPT Presentation

▶

Mar 02, 2023 533 likes •770 views

CSC421 Intro to Artificial Intelligence UNIT 28: Learning from observations (+ leftovers from Prob. Reasoning over Time) Filtering Example Smoothing Divide evidence e 1:t to e 1:k , e k+1,t P(X k | e 1:t ) = P(X k | e 1:k , e k+1,t )

SLIDE 1

CSC421 Intro to Artificial Intelligence

UNIT 28: Learning from observations (+ leftovers from Prob. Reasoning

ver Time)

SLIDE 2

Filtering Example

SLIDE 3

Smoothing

Divide evidence e1:t to e1:k, ek+1,t

– P(Xk | e1:t) = P(Xk | e1:k, ek+1,t)

= a P(Xk | e1:k) P(ek+1,t| Xk,ek+1,t) = a P(Xk | e1:k) P(ek+1,t| Xk) = af1:kbk+1:t

Backward message computed by a

backwards recursion

SLIDE 4

Hidden Markov Models

Dominant method for automatic speech

recognition

Temporal probabilistic model in which the

state of the process is described by a single discrete random variable

Simple and elegant matrix implementation
f all the basic algorithms

SLIDE 5

HMM segmentation

Hidden

p( | )

Observed Model 1 2

P( | )

3 4 5 t t-1 Aucouturier & Sandler, AES 01

SLIDE 6

Kalman Filter

Modelling systems described by a set of continuous variables

Tracking a bird flying Xt = X,Y,Z, dX, dY, dZ Airplanes, robots, ecosystems...

Gaussian prior, linear Gaussian transition model and sensor model

i.e next state is a linear function of current state plus some Gaussian noise Key property: linear Gaussian family remains closed under standard Baysian network

perations

SLIDE 7

Kalman Filter

Position, velocity and position measurement Basically we forward messages (means + covariance matrix) to produce new message (means + covariance matrix)

SLIDE 8

Kalman filtering

SLIDE 9

Kalman Smoothing

SLIDE 10

Outline

Learning agents Inductive Learning Decision tree learning Measuring learning performance

SLIDE 11

Learning

Learning is essential for unknown environs

Designer lacks omniscience

Learning is useful as a system construction method

Expose the agent to reality rather than trying to write it down

Learning modifies agents decision making mechanisms to improve performance

SLIDE 12

Learning Agents

SLIDE 13

Learning Element

Design is dictated by:

Type of performance element Functional component to be learned How functional component is represented Type of feedback

SLIDE 14

Example scenarios

Supervised learning: correct answers for each training instance Unsupervised learning: no feedback Reinforcement learning: occasional rewards Other terms: Classification, Regression, Clustering, Online, Offline

SLIDE 15

Inductive Learning

Simplest form: “learn” a function from examples (training set)

An example is a pair x, f(x) Find hypothesis h(x) such that h(x) ≈ f(x) given a training set of example

Highly simplified. Why ?

SLIDE 16

Inductive Learning

Learning a function is simplified:

Ignores prior knowledge Assumes deterministic observable environment Assumes examples are given Assumes that the agent wants to learn f – why ?

SLIDE 17

Inductive Learning Method

Construct/adjust h to agree with f on training set e.g curve fitting

SLIDE 18

Inductive Learning Method

Construct/adjust h to agree with f on training set e.g curve fitting

SLIDE 19

Inductive Learning Method

Construct/adjust h to agree with f on training set e.g curve fitting

CONSISTENT HYPOTHESIS

SLIDE 20

Inductive Learning Method

Construct/adjust h to agree with f on training set e.g curve fitting

CONSISTENT HYPOTHESIS

CSC421 Intro to Artificial Intelligence

UNIT 28: Learning from observations (+ leftovers from Prob. Reasoning

Filtering Example

Smoothing

= a P(Xk | e1:k) P(ek+1,t| Xk,ek+1,t) = a P(Xk | e1:k) P(ek+1,t| Xk) = af1:kbk+1:t

backwards recursion

Hidden Markov Models

recognition

state of the process is described by a single discrete random variable

HMM segmentation

p( | )

P( | )

Kalman Filter

Modelling systems described by a set of continuous variables

Tracking a bird flying Xt = X,Y,Z, dX, dY, dZ Airplanes, robots, ecosystems...

Gaussian prior, linear Gaussian transition model and sensor model

i.e next state is a linear function of current state plus some Gaussian noise Key property: linear Gaussian family remains closed under standard Baysian network

Kalman Filter

Position, velocity and position measurement Basically we forward messages (means + covariance matrix) to produce new message (means + covariance matrix)

Kalman filtering

Kalman Smoothing

Outline

Learning agents Inductive Learning Decision tree learning Measuring learning performance

Learning

Learning is essential for unknown environs

Designer lacks omniscience

Learning is useful as a system construction method

Expose the agent to reality rather than trying to write it down

Learning modifies agents decision making mechanisms to improve performance

Learning Agents

Learning Element

Design is dictated by:

Type of performance element Functional component to be learned How functional component is represented Type of feedback

Example scenarios

Supervised learning: correct answers for each training instance Unsupervised learning: no feedback Reinforcement learning: occasional rewards Other terms: Classification, Regression, Clustering, Online, Offline

Inductive Learning

Simplest form: “learn” a function from examples (training set)

An example is a pair x, f(x) Find hypothesis h(x) such that h(x) ≈ f(x) given a training set of example

Highly simplified. Why ?

Inductive Learning

Learning a function is simplified:

Ignores prior knowledge Assumes deterministic observable environment Assumes examples are given Assumes that the agent wants to learn f – why ?

Inductive Learning Method

Construct/adjust h to agree with f on training set e.g curve fitting

Inductive Learning Method

Construct/adjust h to agree with f on training set e.g curve fitting

Inductive Learning Method

Construct/adjust h to agree with f on training set e.g curve fitting

CONSISTENT HYPOTHESIS

Inductive Learning Method

Construct/adjust h to agree with f on training set e.g curve fitting

CONSISTENT HYPOTHESIS

Inductive Learning Method

Construct/adjust h to agree with f on training set e.g curve fitting

CONSISTENT HYPOTHESIS OCCAM'S RAZOR Maximize a combination

and simplicity