Preferences

Given states (prizes) $A$ and $B$ , an Agent can express preferences of the form

For deterministic environments we know all possible states so we can always state preferences in the following way.

$A$ is preffered over $B$ $A ≻ B$ $A$ and $B$ are the same $A \sim B$ $B$ is not preferred over $A$ but it would not matter when $A$ is preffered $A ⪰ B$

In a Stochastic Environment however we do not have full information about the states. So we use the states Priors in a lottery. $[p_{1}, A_{1}; \dots; p_{n}, A_{n}] .$

A set of preferences is called rational iff the following constraints hold:

Orderability

$A ≻ B \lor B ≻ A \lor A \sim B$

Transitivity

$A ≻ B \land B ≻ C \Rightarrow A ≻ C$ also see Transitivity

Continuity

$A ≻ B ≻ C \Rightarrow (\exists p \cdot [p, A; 1 - p, C] \sim B)$

Substitutability

$A \sim B \Rightarrow [p, A; 1 - p, C] \sim [p, B; 1 - p, C]$

Monotonicity

$A ≻ B \Rightarrow (p > q) \Leftrightarrow [p, A; 1 - p, B] ≻ [q, A; 1 - q, B]$

Decomposability

Lotteries in a lottery can be decomposed into a big lottery via their probabilities.

$[p, A; 1 - p, [q, B; 1 - q, C]] \sim [p, A; ((1 - p) q), B; ((1 - p) (1 - q)), C]$

For all constraints we could construct an example where an agent would act irrational if the constraint would not hold.

Also see Preferences on Reward Sequences for MDP.

Marcs Notes

Explorer

Preferences

Preferences

Graphansicht

Backlinks