Typical Sampling Filter

Implement the locally typical filter from Meister et al. 2022.

Signature: def typical_sample_filter(probs: np.ndarray, mass: float) -> np.ndarray

Compute the entropy H = -sum(p * log(p)) (use a small epsilon to avoid log(0))
For each token, compute the typicality score |−log(p_i) − H| — how far its surprisal is from the average surprisal
Sort tokens by typicality score ascending (most typical first)
Walk that order, accumulating probability mass; always include the most-typical token, then continue including tokens until the cumulative mass first reaches or exceeds mass
Zero out the rest and renormalize

Return the new probability vector.

Math

score_{i} = - lo g p_{i} - H (p), H (p) = - j \sum p_{j} lo g p_{j}

Asked at

Implement the locally typical filter from Meister et al. 2022.

Signature: def typical_sample_filter(probs: np.ndarray, mass: float) -> np.ndarray

Compute the entropy H = -sum(p * log(p)) (use a small epsilon to avoid log(0))
For each token, compute the typicality score |−log(p_i) − H| — how far its surprisal is from the average surprisal
Sort tokens by typicality score ascending (most typical first)
Walk that order, accumulating probability mass; always include the most-typical token, then continue including tokens until the cumulative mass first reaches or exceeds mass
Zero out the rest and renormalize

Return the new probability vector.

Math

score_{i} = - lo g p_{i} - H (p), H (p) = - j \sum p_{j} lo g p_{j}

Asked at

181. Typical Sampling Filter