Question 1

What is AILuminate AI Safety Benchmark?

Accepted Answer

MLCommons' standardized AI safety benchmark that grades how often general-purpose chat models produce policy-violating responses across 12 hazard categories (e.g. violent crimes, CSAM, hate, self-harm, specialized advice), assigning an ordinal safety grade from Poor to Excellent relative to a sub-15B open-weight reference system. It is a other benchmark measured by Safety grade.

Question 2

What does Safety grade mean on AILuminate AI Safety Benchmark?

Accepted Answer

AILuminate AI Safety Benchmark reports Safety grade; higher is better. Scores are shown only within AILuminate AI Safety Benchmark and are never averaged with other benchmarks.

Question 3

What is the top reported AILuminate AI Safety Benchmark score?

Accepted Answer

Claude 3.5 Sonnet has the top reported score on AILuminate AI Safety Benchmark: Very Good (Safety grade).

Question 4

Why do AILuminate AI Safety Benchmark scores differ across runs?

Accepted Answer

Harness, scaffold, reasoning effort, and prompt setup change results, so two runs of the same model can differ. evals.report keeps each score with its run context so the differences stay visible.

Question 5

Does evals.report rank models across benchmarks?

Accepted Answer

No. AILuminate AI Safety Benchmark scores are shown within their own metric; evals.report never combines benchmarks into a composite ranking or a single "best model".

AILuminate AI Safety Benchmark

What this benchmark measures

Frequently asked