evals.report
BenchmarksLabsCompareRun guides

AILuminate AI Safety Benchmark

MLCommons' standardized AI safety benchmark that grades how often general-purpose chat models produce policy-violating responses across 12 hazard categories (e.g. violent crimes, CSAM, hate, self-harm, specialized advice), assigning an ordinal safety grade from Poor to Excellent relative to a sub-15B open-weight reference system.

OtherSafety gradeHigher is better

No run guide for this benchmark yet.