Question 1

What is Vectara Hallucination Leaderboard?

Accepted Answer

Measures how often LLMs introduce hallucinations when summarizing short documents, scored by Vectara's HHEM-2.3 factual-consistency model, reported as a hallucination rate. It is a other benchmark measured by Hallucination Rate.

Question 2

What does Hallucination Rate mean on Vectara Hallucination Leaderboard?

Accepted Answer

Vectara Hallucination Leaderboard reports Hallucination Rate (%); lower is better. Scores are shown only within Vectara Hallucination Leaderboard and are never averaged with other benchmarks.

Question 3

What is the top reported Vectara Hallucination Leaderboard score?

Accepted Answer

Mistral Large has the top reported score on Vectara Hallucination Leaderboard: 4.5% (Hallucination Rate).

Question 4

Why do Vectara Hallucination Leaderboard scores differ across runs?

Accepted Answer

Harness, scaffold, reasoning effort, and prompt setup change results, so two runs of the same model can differ. evals.report keeps each score with its run context so the differences stay visible.

Question 5

Does evals.report rank models across benchmarks?

Accepted Answer

No. Vectara Hallucination Leaderboard scores are shown within their own metric; evals.report never combines benchmarks into a composite ranking or a single "best model".