BenchmarksReasoning
AA-Omniscience: Knowledge and Hallucination Benchmark
A factuality and knowledge benchmark of 6,000 questions across 42 economically relevant topics in six domains, scoring models on the AA-Omniscience Index (-100 to 100) that rewards correct answers, penalizes hallucinations, and applies no penalty for abstaining.
ReasoningAA-Omniscience IndexHigher is better
No run guide for this benchmark yet.