Marcs Notes

❯

❯

❯

Rational Agents

❯

Optimal Policy

09. Juni 20261 Min. Lesezeit

Optimal Policy

A policy is a mapping from states to actions. It will tell the Agent what Action to do in each state that it can be in.

An optimal policy maximizes the expected sum of rewards.

Graphansicht

Backlinks

Policy Iteration Algorithm
Value Iteration Algorithm
Markov Decision Process
Partially Observable MDP

Erstellt mit Quartz v4.5.2 © 2026

GitHub