Inference Index
All benchmarks
Directory · Benchmarks · Safety
Safety

TruthfulQA

817 questions designed to trap models into repeating common misconceptions.

Metric
Percentage
Max score
100
Maintainer
Oxford
Models scored
19