🍃YeziBuilds

LLM API Pricing Table

Input / output / cache-hit prices (per 1M tokens) across models, with currency and tier tags — searchable and sortable by price.

Pricing data updated: 2026-06-14FX data updated: 2026-06-14
⚠ Prices were checked against official pages on the date below. Tiered models show all tiers in the table; provider pages are authoritative.
Model Input /1M Output /1M Cache /1M Source
GPT-5.5 OpenAI · USD $5 $30 $0.5 official
GPT-5.5 Batch OpenAI · USD $2.5 $15 $0.25 official
GPT-5.4 OpenAI · USD $2.5 $15 $0.25 official
GPT-5.4 mini OpenAI · USD $0.75 $4.5 $0.075 official
Claude Opus 4.8 Anthropic · USD $5 $25 $0.5 write $6.25 official
Claude Sonnet 4.6 Anthropic · USD $3 $15 $0.3 write $3.75 official
Claude Sonnet 4.6 Batch Anthropic · USD $1.5 $7.5 $0.15 official
Claude Haiku 4.5 Anthropic · USD $1 $5 $0.1 write $1.25 official
Claude Sonnet 4.5 Anthropic · USD $3 $15 $0.3 write $3.75 official
Gemini 3.5 Flash Google · USD $1.5 $9 $0.15 official
Gemini 3.5 Flash Batch Google · USD $0.75 $4.5 $0.075 official
Gemini 3.1 Pro Preview tiered Google · USD <= 200,000 input$2> previous input$4 <= 200,000 input$12> previous input$18 <= 200,000 input$0.2> previous input$0.4 official
Gemini 3.1 Pro Preview Batch tiered Google · USD <= 200,000 input$1> previous input$2 <= 200,000 input$6> previous input$9 <= 200,000 input$0.2> previous input$0.4 official
Gemini 3.1 Flash-Lite Google · USD $0.25 $1.5 $0.025 official
DeepSeek V4 Pro 官方人民币价 DeepSeek · CNY 中文官方价格页,按充值余额人民币计价。 ¥3 ¥6 ¥0.025 official
DeepSeek V4 Pro Official USD price DeepSeek · USD English official price page. $0.435 $0.87 $0.0036 official
DeepSeek V4 Flash 官方人民币价 DeepSeek · CNY 中文官方价格页,按充值余额人民币计价。 ¥1 ¥2 ¥0.02 official
DeepSeek V4 Flash Official USD price DeepSeek · USD English official price page. $0.14 $0.28 $0.0028 official
MiniMax-M3 Standard tiered MiniMax · USD Permanent 50% off pricing shown on official page; >512k availability may be limited. <= 512,000 input$0.3> previous input$0.6 <= 512,000 input$1.2> previous input$2.4 <= 512,000 input$0.06> previous input$0.12 official
MiniMax-M3 Priority tiered MiniMax · USD Priority tier: set service_tier=priority; official page says 1.5x standard. <= 512,000 input$0.45> previous input$0.9 <= 512,000 input$1.8> previous input$3.6 <= 512,000 input$0.09> previous input$0.18 official
MiniMax-M2.7 MiniMax · USD $0.3 $1.2 $0.06 write $0.375 official

FX: 1 USD = 7.2 CNY · Prices are per 1M tokens in each model’s billing currency.

Pending official price checks

These models have official pages, but stable public input/output token prices were not visible in the fetched page content, so they are not used by the calculators yet.

  • Kimi / Moonshot AI kimi-k2.7-code, kimi-k2.6 官方文档已确认模型页、计费单位和 Batch 60% 规则,但当前公开 HTML 未暴露具体输入/输出数字;暂不加入计算器,等能核验官方数字后再加。 official
  • Alibaba Cloud Model Studio / Qwen qwen3.7-max, qwen3.7-plus, qwen3.6-flash 官方帮助页已确认模型存在,但价格明细在控制台动态页加载;暂不加入计算器,等能核验公开官方数字后再加。 official
  • Xiaomi MiMo MiMo-V2.5-Pro, MiMo-V2.5, MiMo Code 小米 MiMo 官网已确认模型和 API 接入入口,但公开页面未展示 token 输入/输出价格;暂不加入计算器,等官方价格数字可核验后再加。 official
  • Zhipu / GLM GLM-5.1, GLM-5, GLM-4.7 智谱开放平台价格页为动态页面,当前抓取结果未暴露具体 token 单价;暂不加入计算器,等官方价格数字可核验后再加。 official
What it is

A side-by-side LLM API pricing table and AI model price comparison: compare OpenAI, Claude, Gemini, DeepSeek and other model input, output, cached-input and tiered prices per 1M tokens, with billing currency, checked date and official source links. Pair it with the two cost calculators to compute actual spend.

Related search intents: LLM API pricing table · AI model pricing comparison · GPT pricing table · Claude API pricing · Gemini API pricing · DeepSeek API pricing · input output token price · cached input pricing · LLM token price per 1M · AI API price comparison · model pricing table · API token pricing · tiered model pricing · LLM cost comparison · OpenAI Anthropic Gemini pricing

FAQ

FAQ

What unit are prices in?

All prices are per 1M tokens in the model’s billing currency — USD for USD models, CNY for CNY models. Use a calculator’s rate field to convert.

What does “tiered” mean?

The model switches price tier by per-call input or output length. The table shows all tiers; sorting uses the first tier, and the calculator selects the tier from your usage.

Why are some cache prices “—”?

It means the model has no public or modeled cache-hit price; it is billed at the input price and the hit rate has no effect.

How often are prices updated?

The table shows a checked date under the prices. This is not a provider billing system; if a provider just changed prices, its official pricing page wins.

Related tools
Pricing data updated: 2026-06-14 · Data: this pricing table is compiled from official model pricing pages, with source links and checked dates; provider pricing pages are authoritative.