LLM API Pricing Table

Input / output / cache-hit prices (per 1M tokens) across models, with currency and tier tags — searchable and sortable by price.

Pricing data updated: 2026-06-14FX data updated: 2026-06-14

Search models

⚠ Prices were checked against official pages on the date below. Tiered models show all tiers in the table; provider pages are authoritative.

Model	Input /1M	Output /1M	Cache /1M	Source
GPT-5.5 OpenAI · USD	$5	$30	$0.5	official
GPT-5.5 Batch OpenAI · USD	$2.5	$15	$0.25	official
GPT-5.4 OpenAI · USD	$2.5	$15	$0.25	official
GPT-5.4 mini OpenAI · USD	$0.75	$4.5	$0.075	official
Claude Opus 4.8 Anthropic · USD	$5	$25	$0.5 write $6.25	official
Claude Sonnet 4.6 Anthropic · USD	$3	$15	$0.3 write $3.75	official
Claude Sonnet 4.6 Batch Anthropic · USD	$1.5	$7.5	$0.15	official
Claude Haiku 4.5 Anthropic · USD	$1	$5	$0.1 write $1.25	official
Claude Sonnet 4.5 Anthropic · USD	$3	$15	$0.3 write $3.75	official
Gemini 3.5 Flash Google · USD	$1.5	$9	$0.15	official
Gemini 3.5 Flash Batch Google · USD	$0.75	$4.5	$0.075	official
Gemini 3.1 Pro Preview tiered Google · USD	<= 200,000 input$2> previous input$4	<= 200,000 input$12> previous input$18	<= 200,000 input$0.2> previous input$0.4	official
Gemini 3.1 Pro Preview Batch tiered Google · USD	<= 200,000 input$1> previous input$2	<= 200,000 input$6> previous input$9	<= 200,000 input$0.2> previous input$0.4	official
Gemini 3.1 Flash-Lite Google · USD	$0.25	$1.5	$0.025	official
DeepSeek V4 Pro 官方人民币价 DeepSeek · CNY 中文官方价格页，按充值余额人民币计价。	¥3	¥6	¥0.025	official
DeepSeek V4 Pro Official USD price DeepSeek · USD English official price page.	$0.435	$0.87	$0.0036	official
DeepSeek V4 Flash 官方人民币价 DeepSeek · CNY 中文官方价格页，按充值余额人民币计价。	¥1	¥2	¥0.02	official
DeepSeek V4 Flash Official USD price DeepSeek · USD English official price page.	$0.14	$0.28	$0.0028	official
MiniMax-M3 Standard tiered MiniMax · USD Permanent 50% off pricing shown on official page; >512k availability may be limited.	<= 512,000 input$0.3> previous input$0.6	<= 512,000 input$1.2> previous input$2.4	<= 512,000 input$0.06> previous input$0.12	official
MiniMax-M3 Priority tiered MiniMax · USD Priority tier: set service_tier=priority; official page says 1.5x standard.	<= 512,000 input$0.45> previous input$0.9	<= 512,000 input$1.8> previous input$3.6	<= 512,000 input$0.09> previous input$0.18	official
MiniMax-M2.7 MiniMax · USD	$0.3	$1.2	$0.06 write $0.375	official

FX: 1 USD = 7.2 CNY · Prices are per 1M tokens in each model’s billing currency.

Pending official price checks

These models have official pages, but stable public input/output token prices were not visible in the fetched page content, so they are not used by the calculators yet.

Kimi / Moonshot AI kimi-k2.7-code, kimi-k2.6 官方文档已确认模型页、计费单位和 Batch 60% 规则，但当前公开 HTML 未暴露具体输入/输出数字；暂不加入计算器，等能核验官方数字后再加。 official
Alibaba Cloud Model Studio / Qwen qwen3.7-max, qwen3.7-plus, qwen3.6-flash 官方帮助页已确认模型存在，但价格明细在控制台动态页加载；暂不加入计算器，等能核验公开官方数字后再加。 official
Xiaomi MiMo MiMo-V2.5-Pro, MiMo-V2.5, MiMo Code 小米 MiMo 官网已确认模型和 API 接入入口，但公开页面未展示 token 输入/输出价格；暂不加入计算器，等官方价格数字可核验后再加。 official
Zhipu / GLM GLM-5.1, GLM-5, GLM-4.7 智谱开放平台价格页为动态页面，当前抓取结果未暴露具体 token 单价；暂不加入计算器，等官方价格数字可核验后再加。 official

What it is

A side-by-side LLM API pricing table and AI model price comparison: compare OpenAI, Claude, Gemini, DeepSeek and other model input, output, cached-input and tiered prices per 1M tokens, with billing currency, checked date and official source links. Pair it with the two cost calculators to compute actual spend.

Related search intents: LLM API pricing table · AI model pricing comparison · GPT pricing table · Claude API pricing · Gemini API pricing · DeepSeek API pricing · input output token price · cached input pricing · LLM token price per 1M · AI API price comparison · model pricing table · API token pricing · tiered model pricing · LLM cost comparison · OpenAI Anthropic Gemini pricing

FAQ

What unit are prices in?

All prices are per 1M tokens in the model’s billing currency — USD for USD models, CNY for CNY models. Use a calculator’s rate field to convert.

What does “tiered” mean?

The model switches price tier by per-call input or output length. The table shows all tiers; sorting uses the first tier, and the calculator selects the tier from your usage.

Why are some cache prices “—”?

It means the model has no public or modeled cache-hit price; it is billed at the input price and the hit rate has no effect.

How often are prices updated?

The table shows a checked date under the prices. This is not a provider billing system; if a provider just changed prices, its official pricing page wins.

Related tools

Pricing data updated: 2026-06-14 · Data: this pricing table is compiled from official model pricing pages, with source links and checked dates; provider pricing pages are authoritative.