Inference Index
All benchmarks
Directory · Benchmarks · Instruction
Instruction

IFEval

Google’s test of whether a model follows verifiable instructions — e.g., "write exactly 3 paragraphs, each starting with the letter B."

Metric
Percentage
Max score
100
Maintainer
Google DeepMind
Models scored
19