Preferences

Given states (prizes) and , an Agent can express preferences of the form

For deterministic environments we know all possible states so we can always state preferences in the following way.

is preffered over and are the same is not preferred over but it would not matter when is preffered

In a Stochastic Environment however we do not have full information about the states. So we use the states Priors in a lottery.

A set of preferences is called rational iff the following constraints hold:

Orderability

Transitivity

also see Transitivity

Continuity

Substitutability

Monotonicity

Decomposability

Lotteries in a lottery can be decomposed into a big lottery via their probabilities.

For all constraints we could construct an example where an agent would act irrational if the constraint would not hold.

Also see Preferences on Reward Sequences for MDP.