Compare
Gemini 2.5 Flash vs GPT-5 Mini
Google DeepMind vs OpenAI. Specs, benchmarks, and real per-task cost — all in one page.
Verdict
Gemini 2.5 Flash leads on LMSYS ELO (1312 vs 1288). Gemini 2.5 Flash is ~1.7× cheaper on a 3:1 input:output blend.
Google DeepMind
Gemini 2.5 FlashGoogle’s price/performance darling. 1M context at $0.30/M input — nobody comes close on throughput-per-dollar.
Absurd cost-to-context ratioFast multimodalFree tier via AI Studio
OpenAI
GPT-5 MiniGPT-5’s smaller, cheaper sibling. Optimized for chat and lightweight agent tasks.
Very low latencyCheap enough for mass consumer apps
Pricing
Input / 1M$0.30$0.50
Output / 1M$1.20$2.00
Context1.0M256K
Max output32K16K
LicenseProprietaryProprietary
Released2026-02-052026-01-21
Benchmarks
LMSYS ELO1312.01288.0
MMLU Pro82.178.5
HumanEval85.885.5
MATH84.279.8
MMMU74.6—
IFEval86.185.4
Per-task cost
Summarize a 1-hour meeting transcript$0.0051$0.0085
Review a 500-line pull request$0.0048$0.0080
Answer a customer support ticket$0.0019$0.0031
Extract structured data from a resume$0.0027$0.0045
Debug a stack trace with context$0.0042$0.0070
Per-call cost using published token counts for each task. Real-world prompts vary.