Bellman Equation

In other words:

The utility in state is the current reward in state plus times the expected reward of the best possible action.