Question 1

What is AILuminate AI Safety Benchmark?

Accepted Answer

MLCommons' standardized AI safety benchmark that grades how often general-purpose chat models produce policy-violating responses across 12 hazard categories (e.g. violent crimes, CSAM, hate, self-harm, specialized advice), assigning an ordinal safety grade from Poor to Excellent relative to a sub-15B open-weight reference system. It is a other benchmark measured by Safety grade.

Question 2

What does Safety grade mean on AILuminate AI Safety Benchmark?

Accepted Answer

AILuminate AI Safety Benchmark reports Safety grade; higher is better. Scores are shown only within AILuminate AI Safety Benchmark and are never averaged with other benchmarks.

Question 3

What is the top reported AILuminate AI Safety Benchmark score?

Accepted Answer

Claude 3.5 Sonnet has the top reported score on AILuminate AI Safety Benchmark: Very Good (Safety grade).

Question 4

Why do AILuminate AI Safety Benchmark scores differ across runs?

Accepted Answer

Harness, scaffold, reasoning effort, and prompt setup change results, so two runs of the same model can differ. evals.report keeps each score with its run context so the differences stay visible.

Question 5

Does evals.report rank models across benchmarks?

Accepted Answer

No. AILuminate AI Safety Benchmark scores are shown within their own metric; evals.report never combines benchmarks into a composite ranking or a single "best model".

Model	Lab	Score↓	Source model	Status	Date
Claude 3.5 Sonnet	Anthropic	Very Good	—	Verified	Jun 20, 2024	Details
GPT-4o	OpenAI	Good	—	Verified	May 13, 2024	Details
Gemini 1.5 Pro	Google DeepMind	Good	—	Verified	Feb 15, 2024	Details
Gemini 2.0 Flash	Google DeepMind	Good	—	Verified	Dec 11, 2024	Details
Llama 3.1 405BOpen	Meta	Good	—	Verified	Jul 23, 2024	Details
Mistral Large	Mistral AI	Good	—	Verified	Feb 26, 2024	Details