Inference Index

←All benchmarks

Directory · Benchmarks · Reasoning

Reasoning

LiveBench

Contamination-free benchmark that refreshes its questions monthly.

Metric

Percentage

Max score

100

Maintainer

Abacus.AI

Models scored

22

Why it matters

Questions change constantly, so training on the test set becomes meaningless. One of the most trustworthy rankings today.

Model rankings

Full leaderboard →