⬡Inference Index/ AI model market data

Venice

AI inference provider · 70 models listed · wins best price on 0.

Models listed

70

across our index

Best price wins

0

0% of models

Avg input

$2.05

USD / 1M tokens

Avg output

$10.65

USD / 1M tokens

All models at Venice

Model

IQ

In/1M

Out/1M

Status

OpenAI GPT OSS 120B

NVIDIA Nemotron 3 Nano 30B

Mistral Small 3.2 24B Instruct

Google Gemma 3 27B Instruct

GLM 4.7 Flash Heretic

Nvidia Nemotron Cascade 2 30b A3b

Qwen 3 235B A22B Instruct 2507

Google Gemma 4 26b A4b It

Google Gemma 4 31b It

Mistral Small 2603

Venice Uncensored 1.1

Venice Uncensored 1 2

Venice Uncensored 1.1

Qwen3 VL 30B A3B

Qwen 3.5 35B A3B

Arcee Trinity Large Thinking

Qwen 3 Next 80b

Qwen 3 Coder 480B Turbo

Qwen 3 235B A22B Thinking 2507

Venice Role Play Uncensored

Qwen3.5 122B A10B

Gemini 3 Flash Preview

Qwen3 5 397b A17b

Qwen 3 Coder 480b

Kimi K2 Thinking

Openai Gpt 54 Mini

Aion Labs Aion 2 0

Hermes 3 Llama 3.1 405b

Z Ai Glm 5 Turbo

Z Ai Glm 5v Turbo

Zai Org Glm 5 1

Grok 4 20 Multi Agent

Gemini 3.1 Pro Preview

Claude Sonnet 4.6

Claude Sonnet 4.5

Claude Opus 4 7

Claude Opus 4.6

Claude Opus 4.5

Claude Opus 4.6 Fast

Cite: Venice pricing data

Citation

Venice — AI inference pricing (70 models). Inference Index, 2026-04-16. https://inferenceindex.ai/providers/venice

API

curl https://inferenceindex.ai/api/prices