AI Model Atlas
Get started

Model & Pricing Atlas

Up-to-date model cards + pricing, normalized and source-linked.

How to read this page
API pricing is pay-as-you-go: you only pay for what you use. Prices are per 1M tokens (roughly 750,000 words, depending on language and formatting). Input is the text you send to the model; output is the text it generates for you. Some providers also offer cached input, which can reduce costs when you reuse the same text. Want a walkthrough? Follow our tutorial.

Featured pricing

USD · per 1M tokens · most expensive first Updated: 2026-01-25

Lower Higher
Open model
Anthropic Claude Opus 4.1
$15.00 in
Open model
Anthropic Claude Opus 4.5
$5.00 in
Open model
Anthropic Claude Sonnet 4.5
$3.00 in
Open model
xAI Grok 4
$3.00 in
Open model
OpenAI GPT-4o
$2.50 in
Open model
OpenAI o3
$2.00 in
Open model
OpenAI GPT-5.2
$1.75 in
Open model
Google Gemini 2.5 Pro
$1.25 in
Open model
Anthropic Claude Haiku 4.5
$1.00 in
Open model
Google Gemini 2.5 Flash
$0.30 in
Open model
OpenAI GPT-5 Mini
$0.25 in
Bars are log-scaled within this view so very cheap models remain visible.

What is the best model?

See the live community leaderboard to compare performance across top models.

Open LM Arena →

Anthropic

Flagship models.

Open model
Anthropic
Active
Claude Opus 4.5
Highest-capability Claude model for deep reasoning.
Input
$5.00
Output
$25.00
Cached input
$0.50
per 1M tokens
Open model
Anthropic
Active
Claude Sonnet 4.5
Balanced model for coding, analysis, and writing.
Input
$3.00
Output
$15.00
Cached input
$0.30
per 1M tokens
Open model
Anthropic
Active
Claude Haiku 4.5
Fast, affordable model for everyday tasks.
Input
$1.00
Output
$5.00
Cached input
$0.10
per 1M tokens
Open model
Anthropic
Active
Claude Opus 4.1
High-capability model for complex reasoning.
Input
$15.00
Output
$75.00
Cached input
$1.50
per 1M tokens

Google

Flagship models.

Open model
Google
Preview
Gemini 3 Pro Preview
Preview Gemini 3 model for advanced multimodal tasks.
Input
$2.00
Output
$12.00
Cached input
$0.40
per 1M tokens
Open model
Google
Preview
Gemini 3 Flash Preview
Fast Gemini 3 preview optimized for low latency.
Input
$0.60
Output
$3.00
Cached input
$0.06
per 1M tokens
Open model
Google
Active
Gemini 2.5 Pro
High-accuracy Gemini model for complex tasks.
Input
$1.25
Output
$10.00
Cached input
$0.13
per 1M tokens
Open model
Google
Active
Gemini 2.5 Flash
Fast Gemini 2.5 variant optimized for cost.
Input
$0.30
Output
$2.50
Cached input
$0.03
per 1M tokens

OpenAI

Flagship models.

Open model
OpenAI
Preview
GPT-5.2
Flagship reasoning model for complex tasks and long context.
Input
$1.75
Output
$14.00
Cached input
$0.18
per 1M tokens
Open model
OpenAI
Active
GPT-5 Mini
Fast, cost-efficient model for high-volume chat workloads.
Input
$0.25
Output
$2.00
Cached input
$0.03
per 1M tokens
Open model
OpenAI
Active
GPT-4o
Flagship multimodal model for chat, vision, and tools.
Input
$2.50
Output
$10.00
Cached input
$1.25
per 1M tokens
Open model
OpenAI
Preview
o3
Reasoning model optimized for reliability and planning.
Input
$2.00
Output
$8.00
Cached input
$0.50
per 1M tokens

xAI

Flagship models.

Open model
xAI
Preview
Grok 4
Frontier Grok model for reasoning and chat.
Input
$3.00
Output
$15.00
Cached input
$0.75
per 1M tokens
Open model
xAI
Preview
Grok 4 Fast Reasoning
Fast Grok reasoning model for latency-sensitive tasks.
Input
$0.20
Output
$0.50
per 1M tokens
Open model
xAI
Preview
Grok 4.1 Fast Reasoning
Updated fast reasoning variant with improved quality.
Input
$0.20
Output
$0.50
Cached input
$0.05
per 1M tokens
Open model
xAI
Preview
Grok Code Fast 1
Code-focused model for fast iteration and debugging.
Input
$0.20
Output
$1.50
per 1M tokens