Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline - - PowerPoint PPT Presentation

intelligent agents
SMART_READER_LITE
LIVE PREVIEW

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline - - PowerPoint PPT Presentation

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline Agents and environments Rationality PEAS (Performance measure, Environment, Actuators, Sensors) Environment types Agent types Intelligent Agents p.2/25


slide-1
SLIDE 1

Intelligent Agents

Chapter 2

Intelligent Agents – p.1/25

slide-2
SLIDE 2

Outline

Agents and environments Rationality PEAS (Performance measure, Environment, Actuators, Sensors) Environment types Agent types

Intelligent Agents – p.2/25

slide-3
SLIDE 3

Agents and environments

Agent

Sensors Actuators

Environment

Percepts Actions

? Agents include humans, robots, softbots, thermostats, etc. The agent function maps from percept histories to actions:

✂✁ ✄ ☎ ✆

The agent program runs on the physical architecture to produce

  • Intelligent Agents – p.3/25
slide-4
SLIDE 4

Vacuum-cleaner world

A B Percepts: location and contents, e.g.,

✂ ✄ ☎✝✆ ✞✠✟ ✡

Actions:

☛✌☞

,

✍ ☎✝✎ ✏ ✞

,

✑✓✒ ✔ ✕

,

✖✌✗ ✘✓✙

Intelligent Agents – p.4/25

slide-5
SLIDE 5

A vacuum-cleaner agent

Percept sequence Action

☞ ✂✄ ✡ ✍ ☎✝✎ ✏ ✞
✂ ✄ ☎✝✆ ✞✠✟ ✡ ✑✓✒ ✔ ✕
☞ ✂✄ ✡ ☛ ☞
✂ ✄ ☎ ✆ ✞ ✟ ✡ ✑✓✒ ✔ ✕
☞ ✂✄ ✡

,

☞ ✂✄ ✡ ✍ ☎✝✎ ✏ ✞
☞ ✂✄ ✡

,

✂ ✄ ☎✝✆ ✞✠✟ ✡ ✑✓✒ ✔ ✕

. . . . . . What is the right function? Can it be implemented in a small agent program?

Intelligent Agents – p.5/25

slide-6
SLIDE 6

A vacuum-cleaner agent

function REFLEX-VACUUM-AGENT [location,status] returns an action if status = Dirty then return Suck else if location = A then return Right else if location = B then return Left

What is the right function? Can it be implemented in a small agent program?

Intelligent Agents – p.6/25

slide-7
SLIDE 7

Rationality

Fixed performance measure evaluates the environment sequence

  • ne point per square cleaned up in time
  • ?
  • ne point per clean square per time step, minus
  • ne per move?

penalize for

✁ ✕

dirty squares? rational agent chooses whichever action maximizes the expected value of the performance measure given the percept sequence to date Rational

✂☎✄
  • mniscient

Rational

✂☎✄

clairvoyant Rational

✂☎✄

successful Rational

✄ ✆

exploration, learning, autonomy

Intelligent Agents – p.7/25

slide-8
SLIDE 8

PEAS

To design a rational agent, we must specify the task environment Consider, e.g., the task of designing an automated taxi: Performance measure: Environment: Actuators: Sensors:

Intelligent Agents – p.8/25

slide-9
SLIDE 9

PEAS

To design a rational agent, we must specify the task environment Consider, e.g., the task of designing an automated taxi: Performance measure: safety, destination, profits, legality, comfort,

  • Environment: US streets/freeways, traffic,

pedestrians, weather,

  • Actuators: steering, accelerator, brake, horn,

speaker/display,

  • Sensors: video, accelerometers, gauges, engine

sensors, keyboard, GPS,

  • Intelligent Agents – p.9/25
slide-10
SLIDE 10

Internet shopping agent

Performance measure: Environment: Actuators: Sensors:

Intelligent Agents – p.10/25

slide-11
SLIDE 11

Environment types

Internet Solitaire Backgammon shopping Taxi

Observable?? Deterministic?? Episodic?? Static?? Discrete?? Single-agent?? Fully Observable: access to the complete (relevant) state of the world Partially Observable: missing information

Intelligent Agents – p.11/25

slide-12
SLIDE 12

Environment types

Internet Solitaire Backgammon shopping Taxi

Observable?? yes (?) yes no no Deterministic?? Episodic?? Static?? Discrete?? Single-agent?? Deterministic: the next state is completely determined by the current state and the action Stochastic: Changes not known Strategic: Deterministic except for the actions of the other agents

Intelligent Agents – p.12/25

slide-13
SLIDE 13

Environment types

Internet Solitaire Backgammon shopping Taxi

Observable?? yes (?) yes no no Deterministic?? yes no partly no Episodic?? Static?? Discrete?? Single-agent?? Episodic: task divided into atomic episodes Sequential: Current decision may affect all future decisions

Intelligent Agents – p.13/25

slide-14
SLIDE 14

Environment types

Internet Solitaire Backgammon shopping Taxi

Observable?? yes (?) yes no no Deterministic?? yes no partly no Episodic?? no no no no Static?? Discrete?? Single-agent?? Static: the world does not change while the agent is thinking Dynamic: changes Semidynamic: does not change but the performance is affected as time passes

Intelligent Agents – p.14/25

slide-15
SLIDE 15

Environment types

Internet Solitaire Backgammon shopping Taxi

Observable?? yes (?) yes no no Deterministic?? yes no partly no Episodic?? no no no no Static?? yes semi no no Discrete?? Single-agent?? Discrete: time, percepts, and actions are discrete Continuous: time, percepts, and actions are continuous over time

Intelligent Agents – p.15/25

slide-16
SLIDE 16

Environment types

Internet Solitaire Backgammon shopping Taxi

Observable?? yes (?) yes no no Deterministic?? yes no partly no Episodic?? no no no no Static?? yes semi no no Discrete?? yes yes yes no Single-agent?? Single-agent: one agent Multi-agent: competitive or cooperating agents

Intelligent Agents – p.16/25

slide-17
SLIDE 17

Environment types

Internet Solitaire Backgammon shopping Taxi

Observable?? yes (?) yes no no Deterministic?? yes no partly no Episodic?? no no no no Static?? yes semi no no Discrete?? yes yes yes no Single-agent?? yes no yes (?) no The environment type largely determines the agent design The real world is (of course) partially observable, stochastic, sequential, dynamic, continuous, multi-agent

Intelligent Agents – p.17/25

slide-18
SLIDE 18

Agent types

Four basic types in order of increasing generality: simple reflex agents reflex agents with state goal-based agents utility-based agents All these can be turned into learning agents

Intelligent Agents – p.18/25

slide-19
SLIDE 19

Simple reflex agents

Agent Environment

Sensors What action I should do now Condition-action rules Actuators What the world is like now

Intelligent Agents – p.19/25

slide-20
SLIDE 20

Reflex agents with state

Agent Environment

Sensors State How the world evolves What my actions do Condition-action rules Actuators What the world is like now What action I should do now

Intelligent Agents – p.20/25

slide-21
SLIDE 21

Goal-based agents

Agent Environment

Sensors What action I should do now State How the world evolves What my actions do Actuators What the world is like now What it will be like if I do action A Goals

Intelligent Agents – p.21/25

slide-22
SLIDE 22

Utility-based agents

Agent Environment

Sensors How happy I will be in such a state State How the world evolves What my actions do Utility Actuators

  • What action I

should do now What it will be like if I do action A What the world is like now

Intelligent Agents – p.22/25

slide-23
SLIDE 23

Learning agents

Performance standard

Agent Environment

Sensors Performance element changes knowledge learning goals Problem generator feedback Learning element Critic Actuators

  • Intelligent Agents – p.23/25
slide-24
SLIDE 24

AIMA code

The code for each topic is divided into four directories: agents: code defining agent types and programs algorithms: code for the methods used by the agent programs environments: code defining environment types, simulations domains: problem types and instances for input to algorithms Often run algorithms on domains rather than agents in environments.

Intelligent Agents – p.24/25

slide-25
SLIDE 25

AIMA code

located in:/classes/cs5811/common/aima-code/ (setq joe (make-agent :name ’joe :body (make-agent-body) :program (make-dumb-agent-program))) (defun make-dumb-agent-program () (let ((memory nil)) #’(lambda (percept) (push percept memory) ’no-op)))

Intelligent Agents – p.25/25