Preferences
Given states (prizes) and , an Agent can express preferences of the form
For deterministic environments we know all possible states so we can always state preferences in the following way.
is preffered over and are the same is not preferred over but it would not matter when is preffered
In a Stochastic Environment however we do not have full information about the states. So we use the states Priors in a lottery.
A set of preferences is called rational iff the following constraints hold:
Orderability
Transitivity
also see Transitivity
Continuity
Substitutability
Monotonicity
Decomposability
Lotteries in a lottery can be decomposed into a big lottery via their probabilities.
For all constraints we could construct an example where an agent would act irrational if the constraint would not hold.
Also see Preferences on Reward Sequences for MDP.