List prices per 1M tokens · July 2026

Claude Opus 4.8$5 in / $25 out

Claude Sonnet 5$2 in / $10 out

Claude Fable 5$10 in / $50 out

Claude Haiku 4.5$1 in / $5 out

GPT-5.5$5 in / $30 out

GPT-5.5 Pro$30 in / $180 out

GPT-5.4$2.5 in / $15 out

GPT-5.4 Mini$0.75 in / $4.5 out

GPT-5.4 Nano$0.2 in / $1.25 out

Gemini 3.1 Pro$2 in / $12 out

Gemini 3.5 Flash$1.5 in / $9 out

Gemini 3.1 Flash-Lite$0.25 in / $1.5 out

Grok 4.3$1.25 in / $2.5 out

Grok 4.1 Fast$0.2 in / $0.5 out

DeepSeek V4 Pro$0.435 in / $0.87 out

DeepSeek V4 Flash$0.14 in / $0.28 out

Mistral Large$2 in / $6 out

Mistral Small$0.1 in / $0.3 out

List prices per 1M tokens · July 2026

Claude Opus 4.8$5 in / $25 out

Claude Sonnet 5$2 in / $10 out

Claude Fable 5$10 in / $50 out

Claude Haiku 4.5$1 in / $5 out

GPT-5.5$5 in / $30 out

GPT-5.5 Pro$30 in / $180 out

GPT-5.4$2.5 in / $15 out

GPT-5.4 Mini$0.75 in / $4.5 out

GPT-5.4 Nano$0.2 in / $1.25 out

Gemini 3.1 Pro$2 in / $12 out

Gemini 3.5 Flash$1.5 in / $9 out

Gemini 3.1 Flash-Lite$0.25 in / $1.5 out

Grok 4.3$1.25 in / $2.5 out

Grok 4.1 Fast$0.2 in / $0.5 out

DeepSeek V4 Pro$0.435 in / $0.87 out

DeepSeek V4 Flash$0.14 in / $0.28 out

Mistral Large$2 in / $6 out

Mistral Small$0.1 in / $0.3 out

Model economics · free tool

The cheapest model isn’t the cheapest model.

Cost per token is a distraction. What matters is cost per completed task, at your volume, with oversight priced in, against a person doing the same work by hand.

Try

1 · The task

System prompttokInput / tasktokOutput / tasktokFrequencytasks / mo

2 · The human side

Time / taskminLoaded cost$/hrAI reviewmin

$4.67 / task × 10,000 / mo = $46,667 / month

3 · Model cost per completed taskmodel call ÷ success rate

01

$5 in / $25 out per 1M

%success%out ±

$0.0133$133.15/mo

★

$2 in / $10 out per 1M

%success%out ±

$0.0056$55.68/mo

03

$10 in / $50 out per 1M

%success%out ±

$0.0255$255.21/mo

—

Done by hand

8 min @ $35/hr × 10,000/mo

100%

success

$4.674.1× vs ai all in

The verdict

Switch to Claude Sonnet 5.

$77.47 / month

saved on model cost vs Opus 4.8 (−58%) · 4.1× cheaper than by hand, all in · 10,000 tasks/mo

Book a meetingWe build it on whichever model wins

Opus 4.8 $133.15Sonnet 5 $55.68Fable 5 $255.21Review $5,833Failures $5,600By hand $46,667

Method · the formula behind the verdict+

C_month = F ·T_sys · κ · r_in + T_in · r_in + (1 + δ) · T_out · r_outp+m_rev · w60

Ftasks per monthT_sys, T_in, T_outtokens per callr_in, r_outlist price per tokenκcache: 0.1 inside the 5 min window, else 1δoutput ±psuccess ratem_revminutes of review per answerwhourly cost

Failed calls are retried until success, so the model cost divides by p. The by hand baseline is F · minutes per task · w / 60. Success rate, output adjustment and review time are your estimates, not published benchmarks. 1 token ≈ 4 characters of English text.

Full price sheet · July 2026 snapshot+

Provider	Model	Input $/1M	Cached input $/1M	Output $/1M
Anthropic	Claude Opus 4.8	$5	—	$25
Anthropic	Claude Sonnet 5Intro price through Aug 31, 2026 (standard $3 / $15)	$2	—	$10
Anthropic	Claude Fable 5	$10	—	$50
Anthropic	Claude Haiku 4.5	$1	—	$5
OpenAI	GPT-5.5	$5	$0.5	$30
OpenAI	GPT-5.5 Pro	$30	—	$180
OpenAI	GPT-5.4	$2.5	$0.25	$15
OpenAI	GPT-5.4 Mini	$0.75	$0.075	$4.5
OpenAI	GPT-5.4 Nano	$0.2	$0.02	$1.25
Google	Gemini 3.1 ProTier up to 200K context ($4 / $18 above 200K)	$2	—	$12
Google	Gemini 3.5 Flash	$1.5	—	$9
Google	Gemini 3.1 Flash-Lite	$0.25	—	$1.5
xAI	Grok 4.3	$1.25	$0.2	$2.5
xAI	Grok 4.1 Fast	$0.2	—	$0.5
DeepSeek	DeepSeek V4 ProPromotional price (75% off); standard $1.74 / $3.48	$0.435	$0.004	$0.87
DeepSeek	DeepSeek V4 Flash	$0.14	$0.003	$0.28
Mistral	Mistral Large	$2	—	$6
Mistral	Mistral Small	$0.1	—	$0.3

Sources: Anthropic, OpenAI, Google Gemini, xAI, DeepSeek and Mistral published API pricing pages. List prices change without notice. Verify before you commit.