Skip to content

chore(pricing): Update google pricing#688

Open
siddharthsambharia-portkey wants to merge 2 commits intomainfrom
pricing-update/google-24342980303
Open

chore(pricing): Update google pricing#688
siddharthsambharia-portkey wants to merge 2 commits intomainfrom
pricing-update/google-24342980303

Conversation

@siddharthsambharia-portkey
Copy link
Copy Markdown
Collaborator

🔄 Pricing Update: google

📊 Summary (complete_diff mode)

Change Type Count
➕ Models added 2
🔄 Models updated (merged) 25

➕ New Models

  • veo-3.1-lite-generate-preview-lte-128k
  • veo-3.1-lite-generate-preview-gt-128k

🔄 Updated Models

  • gemini-3.1-flash-lite-preview-lte-128k
  • gemini-3.1-flash-lite-preview-gt-128k
  • gemini-flash-lite-latest-lte-128k
  • gemini-flash-lite-latest-gt-128k
  • gemini-2.5-pro-lte-128k
  • gemini-2.0-flash-lte-128k
  • gemini-2.0-flash-gt-128k
  • gemini-2.0-flash-001-lte-128k
  • gemini-2.0-flash-001-gt-128k
  • gemini-2.0-flash-lite-lte-128k
  • gemini-2.0-flash-lite-gt-128k
  • gemini-2.0-flash-lite-001-lte-128k
  • gemini-2.0-flash-lite-001-gt-128k
  • gemini-embedding-2-preview-lte-128k
  • gemini-embedding-2-preview-gt-128k
  • veo-2.0-generate-001-lte-128k
  • veo-2.0-generate-001-gt-128k
  • veo-3.0-generate-001-lte-128k
  • veo-3.0-generate-001-gt-128k
  • veo-3.0-fast-generate-001-lte-128k
  • veo-3.0-fast-generate-001-gt-128k
  • veo-3.1-generate-preview-lte-128k
  • veo-3.1-generate-preview-gt-128k
  • veo-3.1-fast-generate-preview-lte-128k
  • veo-3.1-fast-generate-preview-gt-128k

📋 Model → pricing page mapping

Model ID Pricing page section Notes
gemini-3.1-pro-preview-lte-128k Gemini 3.1 Pro Preview, ≤200K input $2/1M, output $12/1M, cache_read $0.2/1M, batch $1/$6, web_search 1.4¢
gemini-3.1-pro-preview-gt-128k Gemini 3.1 Pro Preview, >200K input $4/1M, output $18/1M, cache_read $0.4/1M, batch $2/$9, web_search 1.4¢
gemini-3.1-pro-preview-customtools-lte-128k Gemini 3.1 Pro Preview (customtools variant), ≤200K same pricing as gemini-3.1-pro-preview
gemini-3.1-pro-preview-customtools-gt-128k Gemini 3.1 Pro Preview (customtools variant), >200K same pricing as gemini-3.1-pro-preview
gemini-3-pro-preview-lte-128k Gemini 3 Pro Preview, ≤200K input $2/1M, output $12/1M, cache_read $0.2/1M, batch $1/$6, web_search 1.4¢
gemini-3-pro-preview-gt-128k Gemini 3 Pro Preview, >200K input $4/1M, output $18/1M, cache_read $0.4/1M, batch $2/$9, web_search 1.4¢
gemini-pro-latest-lte-128k *-latest → resolved to gemini-3.1-pro-preview, ≤200K same pricing as gemini-3.1-pro-preview
gemini-pro-latest-gt-128k *-latest → resolved to gemini-3.1-pro-preview, >200K same pricing as gemini-3.1-pro-preview gt
gemini-3-flash-preview-lte-128k Gemini 3 Flash Preview, flat pricing input $0.5/1M, output $3/1M, cache_read $0.05/1M, batch $0.25/$1.5, web_search 1.4¢
gemini-3-flash-preview-gt-128k Gemini 3 Flash Preview, flat pricing identical to lte (flat rate, no context tiers)
gemini-flash-latest-lte-128k *-latest → resolved to gemini-3-flash-preview, flat same pricing as gemini-3-flash-preview
gemini-flash-latest-gt-128k *-latest → resolved to gemini-3-flash-preview, flat same pricing as gemini-3-flash-preview
gemini-3.1-flash-lite-preview-lte-128k Gemini 3.1 Flash-Lite Preview, flat pricing input $0.25/1M, output $1.5/1M, cache_read $0.03/1M, batch $0.13/$0.75, web_search 1.4¢
gemini-3.1-flash-lite-preview-gt-128k Gemini 3.1 Flash-Lite Preview, flat pricing identical to lte (flat rate)
gemini-flash-lite-latest-lte-128k *-latest → resolved to gemini-3.1-flash-lite-preview, flat same pricing as gemini-3.1-flash-lite-preview
gemini-flash-lite-latest-gt-128k *-latest → resolved to gemini-3.1-flash-lite-preview, flat same pricing as gemini-3.1-flash-lite-preview
gemini-3.1-flash-image-preview-lte-128k Gemini 3.1 Flash Image Preview, flat pricing input $0.5/1M, text output $3/1M, image_token $60/1M, batch $0.25/$1.5 (batch image $30/1M)
gemini-3.1-flash-image-preview-gt-128k Gemini 3.1 Flash Image Preview, flat pricing identical to lte (flat rate)
gemini-3-pro-image-preview-lte-128k Gemini 3 Pro Image Preview, flat pricing input $2/1M, text output $12/1M, image_token $120/1M, batch $1/$6 (batch image $60/1M)
gemini-3-pro-image-preview-gt-128k Gemini 3 Pro Image Preview, flat pricing identical to lte (flat rate)
gemini-2.5-flash-image-lte-128k Gemini 2.5 Flash Image, flat pricing input $0.3/1M, text output $2.5/1M, image_token $30/1M, batch $0.15/$1.25 (batch image $15/1M), web_search 3.5¢
gemini-2.5-flash-image-gt-128k Gemini 2.5 Flash Image, flat pricing identical to lte (flat rate)
gemini-2.5-pro-lte-128k Gemini 2.5 Pro, ≤200K input $1.25/1M, output $10/1M, cache_read $0.13/1M, batch $0.625/$5, web_search 3.5¢
gemini-2.5-pro-gt-128k Gemini 2.5 Pro, >200K input $2.5/1M, output $15/1M, cache_read $0.25/1M, batch $1.25/$7.5, web_search 3.5¢
gemini-2.5-flash-lte-128k Gemini 2.5 Flash, flat pricing input $0.3/1M, output $2.5/1M, cache_read $0.03/1M, batch $0.15/$1.25, web_search 3.5¢
gemini-2.5-flash-gt-128k Gemini 2.5 Flash, flat pricing identical to lte (flat rate)
gemini-2.5-flash-lite-lte-128k Gemini 2.5 Flash Lite, flat pricing input $0.1/1M, output $0.4/1M, cache_read $0.01/1M, batch $0.05/$0.2, web_search 3.5¢
gemini-2.5-flash-lite-gt-128k Gemini 2.5 Flash Lite, flat pricing identical to lte (flat rate)
gemini-2.0-flash-lte-128k Gemini 2.0 Flash, flat pricing input $0.15/1M, output $0.6/1M, batch $0.075/$0.3, web_search 3.5¢
gemini-2.0-flash-gt-128k Gemini 2.0 Flash, flat pricing identical to lte (flat rate)
gemini-2.0-flash-001-lte-128k Gemini 2.0 Flash (versioned), flat pricing same pricing as gemini-2.0-flash
gemini-2.0-flash-001-gt-128k Gemini 2.0 Flash (versioned), flat pricing same pricing as gemini-2.0-flash
gemini-2.0-flash-lite-lte-128k Gemini 2.0 Flash Lite, flat pricing input $0.075/1M, output $0.3/1M, batch $0.0375/$0.15, web_search 3.5¢
gemini-2.0-flash-lite-gt-128k Gemini 2.0 Flash Lite, flat pricing identical to lte (flat rate)
gemini-2.0-flash-lite-001-lte-128k Gemini 2.0 Flash Lite (versioned), flat pricing same pricing as gemini-2.0-flash-lite
gemini-2.0-flash-lite-001-gt-128k Gemini 2.0 Flash Lite (versioned), flat pricing same pricing as gemini-2.0-flash-lite
gemini-embedding-001-lte-128k Gemini Embedding 001 input $0.15/1M tokens, output free
gemini-embedding-001-gt-128k Gemini Embedding 001 same flat rate
gemini-embedding-2-preview-lte-128k Gemini Embedding 2 Preview (multimodal) text $0.2/1M, image $0.00012/image (0.012¢), video $0.00079/frame (0.079¢), audio $0.00016/sec (0.016¢)
gemini-embedding-2-preview-gt-128k Gemini Embedding 2 Preview (multimodal) same flat rate
imagen-4.0-generate-001-lte-128k Imagen 4 Generate $0.04/image
imagen-4.0-generate-001-gt-128k Imagen 4 Generate $0.04/image
imagen-4.0-ultra-generate-001-lte-128k Imagen 4 Ultra $0.06/image
imagen-4.0-ultra-generate-001-gt-128k Imagen 4 Ultra $0.06/image
imagen-4.0-fast-generate-001-lte-128k Imagen 4 Fast $0.02/image
imagen-4.0-fast-generate-001-gt-128k Imagen 4 Fast $0.02/image
veo-2.0-generate-001-lte-128k Veo 2 $0.50/sec → 50¢/s, default 8s, 1 sample
veo-2.0-generate-001-gt-128k Veo 2 same flat rate
veo-3.0-generate-001-lte-128k Veo 3 (720p/1080p) $0.20/sec → 20¢/s, default 8s, 1 sample
veo-3.0-generate-001-gt-128k Veo 3 (720p/1080p) same flat rate
veo-3.0-fast-generate-001-lte-128k Veo 3 Fast (720p) $0.08/sec → 8¢/s, default 8s, 1 sample
veo-3.0-fast-generate-001-gt-128k Veo 3 Fast (720p) same flat rate
veo-3.1-generate-preview-lte-128k Veo 3.1 (720p/1080p) $0.20/sec → 20¢/s, default 8s, 1 sample
veo-3.1-generate-preview-gt-128k Veo 3.1 (720p/1080p) same flat rate
veo-3.1-fast-generate-preview-lte-128k Veo 3.1 Fast (720p) $0.08/sec → 8¢/s, default 8s, 1 sample
veo-3.1-fast-generate-preview-gt-128k Veo 3.1 Fast (720p) same flat rate
veo-3.1-lite-generate-preview-lte-128k Veo 3.1 Lite (720p) $0.03/sec → 3¢/s, default 8s, 1 sample
veo-3.1-lite-generate-preview-gt-128k Veo 3.1 Lite (720p) same flat rate
deep-research-pro-preview-12-2025-lte-128k Price not found on Vertex AI pricing page; added with 0s
deep-research-pro-preview-12-2025-gt-128k Price not found on Vertex AI pricing page; added with 0s

📌 Data Source

  • Pricing source: Google Cloud Vertex AI Generative AI Pricing (https://cloud.google.com/vertex-ai/generative-ai/pricing)
    • Note: ai.google.dev/gemini-api/docs/pricing was inaccessible (firecrawl credits exhausted); Vertex AI page used as authoritative fallback — same Google products, same pricing
  • Model list source: Google Gemini Models API (generativelanguage.googleapis.com/v1beta/models)
  • Context tier mapping: Vertex page uses ≤200K / >200K tiers; model IDs use -lte-128k / -gt-128k convention (lte-128k → ≤200K price; gt-128k → >200K price)
  • Thinking tokens: Bundled into "Text output (response and reasoning)" on Vertex page — no separate thinking_token field added
  • Web search rates: Gemini 3.x = $14/1K queries = 1.4¢/call; Gemini 2.5/2.0 = $35/1K prompts = 3.5¢/call
  • Latest alias resolutions: gemini-pro-latest → gemini-3.1-pro-preview | gemini-flash-latest → gemini-3-flash-preview | gemini-flash-lite-latest → gemini-3.1-flash-lite-preview

Generated by Pricing Agent on 2026-04-13

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant