Inference Index
All intelligence
ReleaseInvalid Date · NaNy ago

Google ships Gemini 2.5 Flash with 1M context at $0.30/M

Google’s latest cheap-tier model offers a 1M context window and full multimodal inputs at aggressive pricing.

Google’s Gemini 2.5 Flash is live on AI Studio and Vertex, with 1M-token context, native video and audio input, and a $0.30/M input price that’s only matched by open-weight models on aggressive aggregators.

For use cases that involve parsing long documents, transcribing audio, or doing lightweight analysis over video, Flash is the most obvious default in the market. Coding quality trails Sonnet by a wide margin, so this is a tool for extraction and summarization, not agentic work.

Byline

Inference Index

More release stories