1
20070308 chap2 1
Chapter2
Intelligent Agents
20070308 chap2
Chapter2 Intelligent Agents 20070308 chap2 1 What Is An Agent - - PDF document
Chapter2 Intelligent Agents 20070308 chap2 1 What Is An Agent ? 2 20070308 chap2 1 What Is An Agent ? (cont.-1) An agent interacts with its environments through sensors and actuators. Perceiving through sensors - human agent:
20070308 chap2 1
20070308 chap2
20070308 chap2
network packets, etc.
sending network packets, etc.
20070308 chap2
20070308 chap2
20070308 chap2
What is the right function? Can it be implemented in a small agent program?
20070308 chap2
the cleanness of the floor the amount of dirt cleaned up the amount of electricity consumed the amount of noise generated total time and effort spent performance over a short/long time
20070308 chap2
20070308 chap2
e.g. crossing the road and be flattened
Rationality maximizes expected performance, while perfection maximizes actual performance.
e.g. lowly dung beetles, female sphex wasp Information gathering/exploration -- To maximize future rewards Learn from percepts -- To extend prior knowledge Agent autonomy -- To compensate for incorrect/partial prior knowledge
20070308 chap2
20070308 chap2
Performance measure?? Healthy patients, minimize costs Environment?? Patient, hospital, staff Actuators?? Display questions, tests, treatments, diagnoses, referrals Sensors?? Keyboard entry of symptoms, findings, patient’s answer
20070308 chap2
Single-agent?? Discrete?? Static?? Episodic?? Deterministic?? Observable?? Taxi Internet shopping Backgammom
(西洋雙陸棋戲)
Solitaire
(接龍)
20070308 chap2
Single-agent?? Discrete?? Static?? Episodic?? Deterministic?? Observable?? Taxi Internet shopping Backgammom
(西洋雙陸棋戲)
Solitaire
(接龍)
Fully vs. partially observable: an environment is full observable when the
sensors can detect all aspects that are relevant to the choice of action.
20070308 chap2
Single-agent?? Discrete?? Static?? Episodic?? Deterministic?? PARTIAL PARTIAL FULL FULL Observable?? Taxi Internet shopping Backgammom
(西洋雙陸棋戲)
Solitaire
(接龍)
Fully vs. partially observable: an environment is full observable when the
sensors can detect all aspects that are relevant to the choice of action.
20070308 chap2
Single-agent?? Discrete?? Static?? Episodic?? Deterministic?? PARTIAL PARTIAL FULL FULL Observable?? Taxi Internet shopping Backgammom
(西洋雙陸棋戲)
Solitaire
(接龍)
Deterministic vs. stochastic: if the next environment state is completely
determined by the current state the executed action then the environment is
environment is strategic.
20070308 chap2
Single-agent?? Discrete?? Static?? Episodic?? NO YES NO YES Deterministic?? PARTIAL PARTIAL FULL FULL Observable?? Taxi Internet shopping Backgammom
(西洋雙陸棋戲)
Solitaire
(接龍)
Deterministic vs. stochastic: if the next environment state is completely
determined by the current state the executed action then the environment is
environment is strategic.
20070308 chap2
Single-agent?? Discrete?? Static?? Episodic?? NO YES NO YES Deterministic?? PARTIAL PARTIAL FULL FULL Observable?? Taxi Internet shopping Backgammom
(西洋雙陸棋戲)
Solitaire
(接龍)
Episodic vs. sequential: In an episodic environment the agent’s experience
can be divided into atomic steps where the agents perceives and then performs a single action. The choice of action depends only on the episode itself .
20070308 chap2
Single-agent?? Discrete?? Static?? NO NO NO NO Episodic?? NO YES NO YES Deterministic?? PARTIAL PARTIAL FULL FULL Observable?? Taxi Internet shopping Backgammom
(西洋雙陸棋戲)
Solitaire
(接龍)
Episodic vs. sequential: In an episodic environment the agent’s experience
can be divided into atomic steps where the agents perceives and then performs a single action. The choice of action depends only on the episode itself .
20070308 chap2
Single-agent?? Discrete?? Static?? NO NO NO NO Episodic?? NO YES NO YES Deterministic?? PARTIAL PARTIAL FULL FULL Observable?? Taxi Internet shopping Backgammom
(西洋雙陸棋戲)
Solitaire
(接龍)
Static vs. dynamic: If the environment can change while the agent is choosing
an action, the environment is dynamic. Semi-dynamic if the agent’s performance changes even when the environment remains the same.
20070308 chap2
Single-agent?? Discrete?? NO SEMI YES YES Static?? NO NO NO NO Episodic?? NO YES NO YES Deterministic?? PARTIAL PARTIAL FULL FULL Observable?? Taxi Internet shopping Backgammom
(西洋雙陸棋戲)
Solitaire
(接龍)
Static vs. dynamic: If the environment can change while the agent is choosing
an action, the environment is dynamic. Semi-dynamic if the agent’s performance changes even when the environment remains the same.
20070308 chap2
Single-agent?? Discrete?? NO SEMI YES YES Static?? NO NO NO NO Episodic?? NO YES NO YES Deterministic?? PARTIAL PARTIAL FULL FULL Observable?? Taxi Internet shopping Backgammom
(西洋雙陸棋戲)
Solitaire
(接龍)
Discrete vs. continuous: This distinction can be applied to the state of the environment, the way time is handled and to the percepts/actions of the agent.
20070308 chap2
Single-agent?? NO YES YES YES Discrete?? NO SEMI YES YES Static?? NO NO NO NO Episodic?? NO YES NO YES Deterministic?? PARTIAL PARTIAL FULL FULL Observable?? Taxi Internet shopping Backgammom
(西洋雙陸棋戲)
Solitaire
(接龍)
Discrete vs. continuous: This distinction can be applied to the state of the environment, the way time is handled and to the percepts/actions of the agent.
20070308 chap2
Single-agent?? NO YES YES YES Discrete?? NO SEMI YES YES Static?? NO NO NO NO Episodic?? NO YES NO YES Deterministic?? PARTIAL PARTIAL FULL FULL Observable?? Taxi Internet shopping Backgammom
(西洋雙陸棋戲)
Solitaire
(接龍)
Single vs. multi-agent: Does the environment contain other agents who
are also maximizing some performance measure that depends on the current agent’s actions?
20070308 chap2
NO NO NO YES Single-agent?? NO YES YES YES Discrete?? NO SEMI YES YES Static?? NO NO NO NO Episodic?? NO YES NO YES Deterministic?? PARTIAL PARTIAL FULL FULL Observable?? Taxi Internet shopping Backgammom
(西洋雙陸棋戲)
Solitaire
(接龍)
Single vs. multi-agent: Does the environment contain other agents who
are also maximizing some performance measure that depends on the current agent’s actions?
20070308 chap2
20070308 chap2
implements the agent function mapping from percepts to actions input from the sensors: current percepts return: an action to the actuators
runs the program on some sort of computing device with physical sensors and actuators
20070308 chap2
20070308 chap2
20070308 chap2
20070308 chap2
Reduction from 4T to 4 entries
function SIMPLE-REFLEX-AGENT(percept) returns an action static: rules, a set of condition-action rules state ← INTERPRET-INPUT(percept) rule ← RULE-MATCH(state, rule) action ← RULE-ACTION[rule] return action
20070308 chap2
20070308 chap2
function REFLEX-AGENT-WITH-STATE(percept) returns an action static: rules, a set of condition-action rules state, a description of the current world state action, the most recent action. state ← UPDATE-STATE(state, action, percept) rule ← RULE-MATCH(state, rule) action ← RULE-ACTION[rule] return action
20070308 chap2
and can be manipulated.
20070308 chap2
20070308 chap2
20070308 chap2
20070308 chap2
performance standard.