Model & Pricing Atlas

Up-to-date model cards + pricing, normalized and source-linked.

How to read this page

API pricing is pay-as-you-go: you only pay for what you use. Prices are per 1M tokens (roughly 750,000 words, depending on language and formatting). Input is the text you send to the model; output is the text it generates for you. Some providers also offer cached input, which can reduce costs when you reuse the same text. Want a walkthrough? Follow our tutorial.

Featured pricing

USD · per 1M tokens · most expensive first Updated: 2026-01-25

Metric

Provider

Lower Higher

Anthropic • Claude Opus 4.1

$15.00 in

↗

Anthropic • Claude Opus 4.5

$5.00 in

↗

Anthropic • Claude Sonnet 4.5

$3.00 in

↗

xAI • Grok 4

$3.00 in

↗

OpenAI • GPT-4o

$2.50 in

↗

Google • Gemini 3 Pro Preview

$2.00 in

↗

OpenAI • o3

$2.00 in

↗

OpenAI • GPT-5.2

$1.75 in

↗

Google • Gemini 2.5 Pro

$1.25 in

↗

Anthropic • Claude Haiku 4.5

$1.00 in

↗

Google • Gemini 3 Flash Preview

$0.60 in

↗

Google • Gemini 2.5 Flash

$0.30 in

↗

OpenAI • GPT-5 Mini

$0.25 in

↗

xAI • Grok 4 Fast Reasoning

$0.20 in

↗

xAI • Grok 4.1 Fast Reasoning

$0.20 in

↗

xAI • Grok Code Fast 1

$0.20 in

↗

Bars are log-scaled within this view so very cheap models remain visible.

What is the best model?

See the live community leaderboard to compare performance across top models.

Open LM Arena →

Anthropic

Flagship models.

View all models →

Anthropic

Active

Claude Opus 4.5

Highest-capability Claude model for deep reasoning.

Input

$5.00

Output

$25.00

Cached input

$0.50

Source ↗

per 1M tokens

Anthropic

Active

Claude Sonnet 4.5

Balanced model for coding, analysis, and writing.

Input

$3.00

Output

$15.00

Cached input

$0.30

Source ↗

per 1M tokens

Anthropic

Active

Claude Haiku 4.5

Fast, affordable model for everyday tasks.

Input

$1.00

Output

$5.00

Cached input

$0.10

Source ↗

per 1M tokens

Anthropic

Active

Claude Opus 4.1

High-capability model for complex reasoning.

Input

$15.00

Output

$75.00

Cached input

$1.50

Source ↗

per 1M tokens

Google

Flagship models.

View all models →

Google

Preview

Gemini 3 Pro Preview

Preview Gemini 3 model for advanced multimodal tasks.

Input

$2.00

Output

$12.00

Cached input

$0.40

Source ↗

per 1M tokens

Google

Preview

Gemini 3 Flash Preview

Fast Gemini 3 preview optimized for low latency.

Input

$0.60

Output

$3.00

Cached input

$0.06

Source ↗

per 1M tokens

Google

Active

Gemini 2.5 Pro

High-accuracy Gemini model for complex tasks.

Input

$1.25

Output

$10.00

Cached input

$0.13

Source ↗

per 1M tokens

Google

Active

Gemini 2.5 Flash

Fast Gemini 2.5 variant optimized for cost.

Input

$0.30

Output

$2.50

Cached input

$0.03

Source ↗

per 1M tokens

OpenAI

Flagship models.

View all models →

OpenAI

Preview

GPT-5.2

Flagship reasoning model for complex tasks and long context.

Input

$1.75

Output

$14.00

Cached input

$0.18

Source ↗

per 1M tokens

OpenAI

Active

GPT-5 Mini

Fast, cost-efficient model for high-volume chat workloads.

Input

$0.25

Output

$2.00

Cached input

$0.03

Source ↗

per 1M tokens

OpenAI

Active

GPT-4o

Flagship multimodal model for chat, vision, and tools.

Input

$2.50

Output

$10.00

Cached input

$1.25

Source ↗

per 1M tokens

OpenAI

Preview

Reasoning model optimized for reliability and planning.

Input

$2.00

Output

$8.00

Cached input

$0.50

Source ↗

per 1M tokens

xAI

Flagship models.

View all models →

xAI

Preview

Grok 4

Frontier Grok model for reasoning and chat.

Input

$3.00

Output

$15.00

Cached input

$0.75

Source ↗

per 1M tokens

xAI

Preview

Grok 4 Fast Reasoning

Fast Grok reasoning model for latency-sensitive tasks.

Input

$0.20

Output

$0.50

Source ↗

per 1M tokens

xAI

Preview

Grok 4.1 Fast Reasoning

Updated fast reasoning variant with improved quality.

Input

$0.20

Output

$0.50

Cached input

$0.05

Source ↗

per 1M tokens

xAI

Preview

Grok Code Fast 1

Code-focused model for fast iteration and debugging.

Input

$0.20

Output

$1.50

Source ↗

per 1M tokens