Larry Holder School of EECS Washington State University Artificial - - PowerPoint PPT Presentation

▶

Jan 27, 2024 774 likes •1.05k views

Larry Holder School of EECS Washington State University Artificial Intelligence 1 } What is an agent? } Rational agent } Types of environments } Types of agents Artificial Intelligence 2 } Agent perceives its environment through sensors and

SLIDE 1

Larry Holder School of EECS Washington State University

1 Artificial Intelligence

SLIDE 2

} What is an agent? } Rational agent } Types of environments } Types of agents

Artificial Intelligence 2

SLIDE 3

} Agent perceives its environment through

sensors and acts on its environment through actuators

} Percepts: Perceptual inputs to the agent } Percept sequence: complete history of the

agent’s percepts

Artificial Intelligence 3

SLIDE 4

} Input: Percept = [location, state]

Location ∈ {A, B}
State ∈ {Clean, Dirty}

} Return: Action ∈ {Left, Right, Suction}

Artificial Intelligence 4

Action VacuumAgent (Percept p) { if (p = [?, Dirty]) then return Suction if (p = [A, Clean]) then return Right if (p = [B, Clean]) then return Left } Vacuum World Vacuum Agent

SLIDE 5

Artificial Intelligence 5

} Rational Agent takes actions that maximize

the performance measure given the percept sequence and any prior knowledge

Performance measures?
Prior knowledge?

} Is VacuumAgent rational?

SLIDE 6

Artificial Intelligence 6

CES 2019: www.youtube.com/watch?v=gfWjsKsEry0

SLIDE 7

} Should a rational agent:

Know everything?
Explore?
Learn?

Artificial Intelligence 7

SLIDE 8

} PEAS

Performance
Environment
Actuators
Sensors

Artificial Intelligence 8

Ag Agen ent Type Pe Performance En Enviro ronment Ac Actuators rs Se Sensors Self-driving car

SLIDE 9

Artificial Intelligence 9

Ag Agen ent Type Pe Performance En Enviro ronment Ac Actuators rs Se Sensors Puzzle solver Part picker

SLIDE 10

} Fully observable vs. partially observable

Do sensors give complete state of environment

} Single agent vs. multiagent

Are their other agents in the environment whose

performance is affected by this agent

} Puzzle solver? } Part picker?

Artificial Intelligence 10

SLIDE 11

} Deterministic vs. stochastic

Next state of environment completely determined

by current state and agent’s action

} Episodic vs. sequential

Future percepts and actions do not depend on past

percepts and actions

} Puzzle solver? } Part picker?

Artificial Intelligence 11

SLIDE 12

} Static vs. dynamic

Can the environment change while the agent is

deliberating

} Discrete vs. continuous

Are there a fixed number of environment states

} Known vs. unknown

Are the effects of actions known

} Puzzle solver? } Part picker?

Artificial Intelligence 12

SLIDE 13

Artificial Intelligence 13

} Hunt the Wumpus game

Written in BASIC, 1972
First available on the TI-99/4A

SLIDE 14

} Performance measure

+1000 for leaving cave with

gold

-1000 for falling in pit or

being eaten by wumpus

-1 for each action taken
-10 for using the arrow
Game ends when agent dies
r leaves cave

Artificial Intelligence 14

SLIDE 15

Artificial Intelligence 15

} Environment

4x4 grid of rooms
Agent starts in square [1,1]

facing right with 1 arrow

Location of wumpus and gold

chosen at random other than [1,1]

Each square other than [1,1]

has a 0.2 probability of containing a pit

SLIDE 16

Artificial Intelligence 16

} Actuators

Forward
TurnLeft by 90∘
TurnRight by 90∘
Grab picks up gold if agent in

gold location

Shoot shoots arrow in direction

agent is facing

Arrow continues until hits wumpus

r wall
Climb leaves cave if agent in [1,1]

SLIDE 17

} Sensors (Boolean)

Stench if wumpus in directly

(not diagonally) adjacent square

Breeze if pit in directly adjacent

square

Glitter if gold in agent’s current

square

Bump if agent walks into a wall
Scream if wumpus is killed

Artificial Intelligence 17

SLIDE 18

} Fully or partially observable? } Discrete or continuous? } Static or dynamic? } Deterministic or stochastic? } Single or multi-agent? } Episodic or sequential? } Known or unknown?

Artificial Intelligence 18

SLIDE 19

} Details of design based on task (PEAS) and

properties of environment

Artificial Intelligence 19

Action Agent (Percept percept) { Process percept Choose action return action }

SLIDE 20

} Table: Percepts à Actions } Where does table come from? } How large is table?

Artificial Intelligence 20

Action TableDrivenAgent (Percept percept) { PerceptSequence percepts Table T Append percept to end of percepts action = Lookup (percepts, T) return action }

SLIDE 21

} Where do rules come from? } Random component to

avoid repetitive behavior

Artificial Intelligence 21

Action SimpleReflexAgent (Percept percept) { RuleSet rules state = InterpretInput (percept) rule = RuleMatch (state, rules) action = rule.action return action }

SLIDE 22

} Model describes how world

evolves and effects of actions

} Where do model and rules come

from?

} How to represent state and

model?

Artificial Intelligence 22

Action ModelBasedReflexAgent (Percept percept) { RuleSet rules Model model state = UpdateState (state, action, percept, model) rule = RuleMatch (state, rules) action = rule.action return action }

SLIDE 23

} Search for sequence of actions to achieve

goals

} Model, state, goals

Source?
Representation?

Artificial Intelligence 23

SLIDE 24

} Search for sequence of actions to reach a

high utility state

} Maximize expected utility } Model, state, utility

Source?
Representation?

Artificial Intelligence 24

SLIDE 25

} Learning element

changes agent to improve performance

Models, rules, goals

} Performance element

ne of previous agents

} Critic provides feedback

n agent’s performance

} Problem generator drives

agent to explore

Artificial Intelligence 25

Larry Holder School of EECS Washington State University

} What is an agent? } Rational agent } Types of environments } Types of agents

} Agent perceives its environment through

sensors and acts on its environment through actuators

} Percepts: Perceptual inputs to the agent } Percept sequence: complete history of the

agent’s percepts

} Input: Percept = [location, state]

} Return: Action ∈ {Left, Right, Suction}

Action VacuumAgent (Percept p) { if (p = [?, Dirty]) then return Suction if (p = [A, Clean]) then return Right if (p = [B, Clean]) then return Left } Vacuum World Vacuum Agent

} Rational Agent takes actions that maximize

the performance measure given the percept sequence and any prior knowledge

} Is VacuumAgent rational?

CES 2019: www.youtube.com/watch?v=gfWjsKsEry0

} Should a rational agent:

} PEAS

Ag Agen ent Type Pe Performance En Enviro ronment Ac Actuators rs Se Sensors Self-driving car

Ag Agen ent Type Pe Performance En Enviro ronment Ac Actuators rs Se Sensors Puzzle solver Part picker

} Fully observable vs. partially observable

} Single agent vs. multiagent

performance is affected by this agent

} Puzzle solver? } Part picker?

} Deterministic vs. stochastic

by current state and agent’s action

} Episodic vs. sequential

percepts and actions

} Puzzle solver? } Part picker?

} Static vs. dynamic

deliberating

} Discrete vs. continuous

} Known vs. unknown

} Puzzle solver? } Part picker?

} Hunt the Wumpus game

} Performance measure

gold

being eaten by wumpus

} Environment

facing right with 1 arrow

chosen at random other than [1,1]

has a 0.2 probability of containing a pit

} Actuators

gold location

agent is facing

 Arrow continues until hits wumpus

} Sensors (Boolean)

(not diagonally) adjacent square

square

square

} Fully or partially observable? } Discrete or continuous? } Static or dynamic? } Deterministic or stochastic? } Single or multi-agent? } Episodic or sequential? } Known or unknown?

} Details of design based on task (PEAS) and

properties of environment

Action Agent (Percept percept) { Process percept Choose action return action }

} Table: Percepts à Actions } Where does table come from? } How large is table?

Action TableDrivenAgent (Percept percept) { PerceptSequence percepts Table T Append percept to end of percepts action = Lookup (percepts, T) return action }

} Where do rules come from? } Random component to

avoid repetitive behavior

Action SimpleReflexAgent (Percept percept) { RuleSet rules state = InterpretInput (percept) rule = RuleMatch (state, rules) action = rule.action return action }

} Model describes how world

evolves and effects of actions

} Where do model and rules come

from?

} How to represent state and

model?

Action ModelBasedReflexAgent (Percept percept) { RuleSet rules Model model state = UpdateState (state, action, percept, model) rule = RuleMatch (state, rules) action = rule.action return action }

} Search for sequence of actions to achieve

goals

} Model, state, goals

} Search for sequence of actions to reach a

high utility state

} Maximize expected utility } Model, state, utility

} Learning element

changes agent to improve performance

} Performance element

} Critic provides feedback

} Problem generator drives

agent to explore

} Rational agent seeks to maximize

performance

} Agent’s task defined in terms of Performance,

Environment, Actuators and Sensors

} Agent’s environment defined in terms of

Arrow continues until hits wumpus