Inference Index
All benchmarks
Directory · Benchmarks · Safety
Safety

HarmBench

Standardized evaluation of model refusal behavior across 510 unique harmful scenarios.

Metric
Percentage
Max score
100
Maintainer
Center for AI Safety
Models scored
19