What is M‑Stack?

A four‑layer reliability architecture — Monitor → Evaluate → Control → Learn — that supervises generation, scores quality and grounding, chooses actions (verify, revise, abstain, emit), and adapts its thresholds over time.

Monitor

Entropy, disagreement, and contradiction checks identify risky outputs before they escape.

Evaluate

Judge/PRM scoring plus citation alignment to detect unsupported claims in RAG.

Control

CMDP policy enforces risk caps and budgets: verify, abstain, or emit with a rationale.

Foundation Model (Black Box) M1: Monitor Uncertainty • Contradiction M2: Evaluate Factuality • Quality M3: Control Verify • Revise • Abstain • Emit M4: Learn Calibrate • Adapt