Bellman Equation

$U (s) = R (s) + γ \cdot max_{a \in A (s)} \sum_{s^{'}} U (s^{'}) \cdot P (s^{'} ∣ s, a)$

In other words:

The utility in state $s$ is the current reward in state $s$ plus $γ$ times the expected reward of the best possible action.