Perplexity

An Intrinsic Evaluation method that answers:

How uncertain is a model about the predictions it makes? Confident models often correlate with accurate models.

$PP (p) = 2^{Entropy (p)} = 2^{- \sum_{i} p_{i} log_{2} (p_{i})}$ The model is “as confused” as if it had to randomly choose between $PP (p)$ different units. This also means that the wors-case scenario is fixed by the Branching Factor.

Ressources

How Good is Your Chatbot? An Introduction to Perplexity in NLP

Artificial Intelligence 2

The perplexity of a sequence $C_{1 : N}$ is $Perplexity (c_{1 : N}) := P (c_{1 : N})^{- (\frac{1}{N})}$

Intuition The reciprocal (Kehrwert) of probability, normalized by sequence length.

For a language with n characters or words and a language model that predicts that all are equally likely, the perplexity of any sequence is n. If some characters or words are more likely than others, and the model reflects that, then the perplexity of correct sequences will be less than n.

Marcs Notes

Explorer

Perplexity

Perplexity

Artificial Intelligence 2

Graphansicht

Inhaltsverzeichnis

Backlinks