evals.report
BenchmarksLabsCompareRun guides
BenchmarksReasoning

AA-Omniscience: Knowledge and Hallucination Benchmark

A factuality and knowledge benchmark of 6,000 questions across 42 economically relevant topics in six domains, scoring models on the AA-Omniscience Index (-100 to 100) that rewards correct answers, penalizes hallucinations, and applies no penalty for abstaining.

ReasoningAA-Omniscience IndexHigher is better

What this benchmark measures

A factuality and knowledge benchmark of 6,000 questions across 42 economically relevant topics in six domains, scoring models on the AA-Omniscience Index (-100 to 100) that rewards correct answers, penalizes hallucinations, and applies no penalty for abstaining.

Rows on this page are sourced from public benchmark artifacts, leaderboard exports, or source-linked model reports. Each row keeps benchmark version, source model name, and available run details attached to the score.

The metric shown here is AA-Omniscience Index. It should be interpreted within AA-Omniscience: Knowledge and Hallucination Benchmark, not compared as part of a site-wide ranking.

No composite ranking
evals.report never combines benchmarks. AA-Omniscience Index on AA-Omniscience: Knowledge and Hallucination Benchmark is its own number — don’t average it with other metrics.