GCP Gemini API Pricing (2026)
AI API services provide access to foundation models and generative AI capabilities through simple API calls. Build intelligent applications with large language models, image generation, and multimodal AI. Gemini API provides Google's multimodal Gemini models with generous free tier for experimentation. Starting from $0.0000/hr ($0.00/mo) for GenerateContent cached input token count for Gemini 1.5 Flash 8B when input is up to 128k tokens.
Key Features
- ✓Access to multiple foundation models through a single API
- ✓Pay-per-token pricing with no upfront commitment
- ✓Fine-tuning and customization options
- ✓Content filtering and safety guardrails
- ✓Low-latency inference endpoints
Common Use Cases
Chatbots & Assistants
Build conversational AI applications powered by large language models.
Content Generation
Generate text, code, and images for marketing, documentation, and creative workflows.
RAG Applications
Combine retrieval-augmented generation with your data for accurate, grounded AI responses.
On-Demand Pricing
Pay-as-you-go pricing with no upfront commitment. You are billed per hour of usage and can start or stop at any time. Hourly rates start at $0.0000/hr ($0.00/mo) for GenerateContent cached input token count for Gemini 1.5 Flash 8B when input is up to 128k tokens.
| Instance | Price/hr | Price/mo |
|---|---|---|
| Generate_content_output_token_count_gemini_robotics_ER_1.5_short_output_text_non_thinking | $0.0000 | $0.00 |
| Generate_content_output_token_count_gemini_robotics_ER_1.5_long_output_text_batch | $0.0000 | $0.00 |
| BatchGenerateContent video input token count for Gemini 2.0 Flash | $0.0000 | $0.00 |
| BatchGenerate_content audio input token count for gemini 3 pro long | $0.0000 | $0.00 |
| Generate content output token count gemini 3 pro short text flex | $0.0000 | $0.00 |
| Imagen 4 Generation (output) | $0.0400 | $29.20 |
| Generate content input token count gemini 2.5 flash long text priority | $0.0000 | $0.00 |
| Generate content output token count gemini 3.1 flash lite preview text flex | $0.0000 | $0.00 |
| Generate_content cached text input token count for gemini 3 flash | $0.0000 | $0.00 |
| GenerateContent output token count for Gemini 1.5 Pro when input is up to 128k tokens | $0.0000 | $0.00 |
| Generate_content image output token count for Gemini 3 Pro Image | $0.0001 | $0.09 |
| Veo 3 Audio Generation (output) | $0.4000 | $292.00 |
| GenerateContent input token count for Gemini 1.5 Flash 8B when input is up to 128k tokens | $0.0000 | $0.00 |
| GenerateContent output image token count for Gemini 2.0 Flash MMGen | $0.0000 | $0.02 |
| Gemini 3.1 Flash Image Text Output - Batch Predictions | $0.0000 | $0.00 |
| Generate content cached input token count gemini 3.1 flash lite preview image | $0.0000 | $0.00 |
| Generate_content video input token count for gemini 3 pro short | $0.0000 | $0.00 |
| BatchGenerate_content text output token count for gemini 3 pro long | $0.0000 | $0.01 |
| Generate content output token count gemini 2.5 pro short text priority | $0.0000 | $0.01 |
| Gemini 3.1 Flash Image Text Input - Batch Predictionss | $0.0000 | $0.00 |
| Generate content output token count Gemini 2.5 Pro short output text | $0.0000 | $0.01 |
| GenerateContent cached input token count for Gemini 1.5 Flash 8B when input is over 128k tokens | $0.0000 | $0.00 |
| Veo Lite Generation 1080p with Audio | $0.0800 | $58.40 |
| Generate content input token count gemini 2.5 pro short text priority | $0.0000 | $0.00 |
| Generate_content text output token count for gemini 3 pro long | $0.0000 | $0.01 |
| Generate content output token count gemini 3.1 flash lite preview text priority | $0.0000 | $0.00 |
| GenerateContent cached input token count for Gemini 1.5 Flash 8B when input is up to 128k tokens | $0.0000 | $0.00 |
| EmbedContent input token count for gemini-embedding-2 text | $0.0000 | $0.00 |
| BatchGenerate_content text input token count for gemini 3 pro short | $0.0000 | $0.00 |
| BatchGenerate content output token count gemini 2.5 flash short output text non-thinking | $0.0000 | $0.00 |
| BatchGenerate content input token count gemini 2.5 flash long input text | $0.0000 | $0.00 |
| BidiGenerateContent audio output token count for Gemini 2.5 Flash Native Audio Thinking | $0.0000 | $0.01 |
| Generate content input token count gemini 2.5 pro video flex | $0.0000 | $0.00 |
| Generate_content image input token count for gemini 3 flash | $0.0000 | $0.00 |
| Generate content output token count gemini 2.5 flash native image generation flex | $0.0000 | $0.01 |
| veo3_upsampler_video_generation | $0.2000 | $146.00 |
| GenerateContent output token count for Gemini 1.5 Pro when input is longer than 128k tokens | $0.0000 | $0.01 |
| Generate_content image cached input token count for Gemini 2.5 Flash Lite | $0.0000 | $0.00 |
| Gemini 3.1 Flash Image Image Output - Predictions | $0.0001 | $0.04 |
| Generate content cached input token count gemini 2.5 flash input image | $0.0000 | $0.00 |
| Veo Fast Generation 4k with Audio | $0.3000 | $219.00 |
| Generate content output token count gemini 2.5 flash lite long text flex | $0.0000 | $0.00 |
| GenerateContent audio input token count for Gemini 2.0 Flash Lite | $0.0000 | $0.00 |
| Generate content input token count gemini 2.5 flash lite image priority | $0.0000 | $0.00 |
| Number of audio tokens of cached content for Gemini 2.0 Flash over a period of time expressed | $0.0000 | $0.00 |
| Generate content input token count Gemini 2.5 Pro input image | $0.0000 | $0.00 |
| Bidi generate content input token count gemini 3 flash live video | $0.0000 | $0.00 |
| Veo Generation 1080p with Audio | $0.4000 | $292.00 |
| Generate content input token count gemini 2.5 flash video flex | $0.0000 | $0.00 |
| Generate content input token count gemini 3.1 flash lite preview audio batch | $0.0000 | $0.00 |
Reserved Instance & Savings Plans Pricing
Commit to 1 or 3 years for lower hourly rates.
| Instance | Price/hr | Price/mo | 1yr RI/hr | 3yr RI/hr |
|---|---|---|---|---|
| Generate_content_output_token_count_gemini_robotics_ER_1.5_short_output_text_non_thinking | $0.0000 | $0.00 | - | - |
| Generate_content_output_token_count_gemini_robotics_ER_1.5_long_output_text_batch | $0.0000 | $0.00 | - | - |
| BatchGenerateContent video input token count for Gemini 2.0 Flash | $0.0000 | $0.00 | - | - |
| BatchGenerate_content audio input token count for gemini 3 pro long | $0.0000 | $0.00 | - | - |
| Generate content output token count gemini 3 pro short text flex | $0.0000 | $0.00 | - | - |
| Imagen 4 Generation (output) | $0.0400 | $29.20 | - | - |
| Generate content input token count gemini 2.5 flash long text priority | $0.0000 | $0.00 | - | - |
| Generate content output token count gemini 3.1 flash lite preview text flex | $0.0000 | $0.00 | - | - |
| Generate_content cached text input token count for gemini 3 flash | $0.0000 | $0.00 | - | - |
| GenerateContent output token count for Gemini 1.5 Pro when input is up to 128k tokens | $0.0000 | $0.00 | - | - |
| Generate_content image output token count for Gemini 3 Pro Image | $0.0001 | $0.09 | - | - |
| Veo 3 Audio Generation (output) | $0.4000 | $292.00 | - | - |
| GenerateContent input token count for Gemini 1.5 Flash 8B when input is up to 128k tokens | $0.0000 | $0.00 | - | - |
| GenerateContent output image token count for Gemini 2.0 Flash MMGen | $0.0000 | $0.02 | - | - |
| Gemini 3.1 Flash Image Text Output - Batch Predictions | $0.0000 | $0.00 | - | - |
| Generate content cached input token count gemini 3.1 flash lite preview image | $0.0000 | $0.00 | - | - |
| Generate_content video input token count for gemini 3 pro short | $0.0000 | $0.00 | - | - |
| BatchGenerate_content text output token count for gemini 3 pro long | $0.0000 | $0.01 | - | - |
| Generate content output token count gemini 2.5 pro short text priority | $0.0000 | $0.01 | - | - |
| Gemini 3.1 Flash Image Text Input - Batch Predictionss | $0.0000 | $0.00 | - | - |
| Generate content output token count Gemini 2.5 Pro short output text | $0.0000 | $0.01 | - | - |
| GenerateContent cached input token count for Gemini 1.5 Flash 8B when input is over 128k tokens | $0.0000 | $0.00 | - | - |
| Veo Lite Generation 1080p with Audio | $0.0800 | $58.40 | - | - |
| Generate content input token count gemini 2.5 pro short text priority | $0.0000 | $0.00 | - | - |
| Generate_content text output token count for gemini 3 pro long | $0.0000 | $0.01 | - | - |
| Generate content output token count gemini 3.1 flash lite preview text priority | $0.0000 | $0.00 | - | - |
| GenerateContent cached input token count for Gemini 1.5 Flash 8B when input is up to 128k tokens | $0.0000 | $0.00 | - | - |
| EmbedContent input token count for gemini-embedding-2 text | $0.0000 | $0.00 | - | - |
| BatchGenerate_content text input token count for gemini 3 pro short | $0.0000 | $0.00 | - | - |
| BatchGenerate content output token count gemini 2.5 flash short output text non-thinking | $0.0000 | $0.00 | - | - |
| BatchGenerate content input token count gemini 2.5 flash long input text | $0.0000 | $0.00 | - | - |
| BidiGenerateContent audio output token count for Gemini 2.5 Flash Native Audio Thinking | $0.0000 | $0.01 | - | - |
| Generate content input token count gemini 2.5 pro video flex | $0.0000 | $0.00 | - | - |
| Generate_content image input token count for gemini 3 flash | $0.0000 | $0.00 | - | - |
| Generate content output token count gemini 2.5 flash native image generation flex | $0.0000 | $0.01 | - | - |
| veo3_upsampler_video_generation | $0.2000 | $146.00 | - | - |
| GenerateContent output token count for Gemini 1.5 Pro when input is longer than 128k tokens | $0.0000 | $0.01 | - | - |
| Generate_content image cached input token count for Gemini 2.5 Flash Lite | $0.0000 | $0.00 | - | - |
| Gemini 3.1 Flash Image Image Output - Predictions | $0.0001 | $0.04 | - | - |
| Generate content cached input token count gemini 2.5 flash input image | $0.0000 | $0.00 | - | - |
| Veo Fast Generation 4k with Audio | $0.3000 | $219.00 | - | - |
| Generate content output token count gemini 2.5 flash lite long text flex | $0.0000 | $0.00 | - | - |
| GenerateContent audio input token count for Gemini 2.0 Flash Lite | $0.0000 | $0.00 | - | - |
| Generate content input token count gemini 2.5 flash lite image priority | $0.0000 | $0.00 | - | - |
| Number of audio tokens of cached content for Gemini 2.0 Flash over a period of time expressed | $0.0000 | $0.00 | - | - |
| Generate content input token count Gemini 2.5 Pro input image | $0.0000 | $0.00 | - | - |
| Bidi generate content input token count gemini 3 flash live video | $0.0000 | $0.00 | - | - |
| Veo Generation 1080p with Audio | $0.4000 | $292.00 | - | - |
| Generate content input token count gemini 2.5 flash video flex | $0.0000 | $0.00 | - | - |
| Generate content input token count gemini 3.1 flash lite preview audio batch | $0.0000 | $0.00 | - | - |
How Gemini API Pricing Works
On-Demand
Pay per hour with no long-term commitment. Ideal for variable workloads and development environments.
Reserved / Committed Use
Commit to 1 or 3 years for significant discounts.
Spot / Preemptible
Use spare capacity at steep discounts. Best for fault-tolerant, batch, and stateless workloads.
Monthly Cost Examples
GCP Gemini API Free Tier
Free limit: 15 RPM / 1M TPM (Gemini Flash). This free tier never expires and renews monthly.
Frequently Asked Questions
What is GCP Gemini API?
GCP Gemini API is a cloud service offered by Google Cloud Platform. It provides various configurations (200 pricing tiers available) with pay-as-you-go and committed-use pricing options.
How much does GCP Gemini API cost per month?
Prices range from $0.00/month for GenerateContent cached input token count for Gemini 1.5 Flash 8B when input is up to 128k tokens to $438.00/month for Veo Generation 4k with Audio on On-Demand pricing in us-east1.
Does GCP Gemini API have a free tier?
GCP offers various free tier options. Check the official GCP pricing page for the most current free tier details for Gemini API.
How many GCP Gemini API pricing tiers are available?
There are 200 pricing tiers available for GCP Gemini API. These range from entry-level configurations to high-performance options for enterprise workloads.
What pricing models does GCP Gemini API offer?
GCP Gemini API offers On-Demand (pay-per-hour, no commitment), Reserved/Committed Use (1-3 year commitments for significant discounts), and in some cases Spot/Preemptible pricing for interruptible workloads at the lowest cost.
How is GPU instance pricing structured?
GPU instances are priced per hour and vary significantly by GPU model (T4, A10G, A100, H100). On-demand rates are highest; spot/preemptible instances can cut costs 60-90% for fault-tolerant training jobs. Some providers also offer per-second billing and committed-use discounts for GPUs.