evals.report
BenchmarksLabsCompareRun guides

Vectara Hallucination Leaderboard

Measures how often LLMs introduce hallucinations when summarizing short documents, scored by Vectara's HHEM-2.3 factual-consistency model, reported as a hallucination rate.

OtherHallucination RateLower is better

No run guide for this benchmark yet.