GCP Gemini API Pricing (2026)

Updated Jul 5, 2026GCPAI / ML

AI API services provide access to foundation models and generative AI capabilities through simple API calls. Build intelligent applications with large language models, image generation, and multimodal AI. Gemini API provides Google's multimodal Gemini models with generous free tier for experimentation. Starting from $0.0000/hr ($0.00/mo) for GenerateContent cached input token count for Gemini 1.5 Flash 8B when input is up to 128k tokens.

Key Features

✓Access to multiple foundation models through a single API
✓Pay-per-token pricing with no upfront commitment
✓Fine-tuning and customization options
✓Content filtering and safety guardrails
✓Low-latency inference endpoints

Common Use Cases

Chatbots & Assistants

Build conversational AI applications powered by large language models.

Content Generation

Generate text, code, and images for marketing, documentation, and creative workflows.

RAG Applications

Combine retrieval-augmented generation with your data for accurate, grounded AI responses.

On-Demand Pricing

Pay-as-you-go pricing with no upfront commitment. You are billed per hour of usage and can start or stop at any time. Hourly rates start at $0.0000/hr ($0.00/mo) for GenerateContent cached input token count for Gemini 1.5 Flash 8B when input is up to 128k tokens.

Region

Instance	Price/hr	Price/mo
Generate_content text input token count for Gemini 3.5 Flash Flex Caching	$0.0000	$0.00
Generate_content_output_token_count_gemini_robotics_ER_1.5_short_output_text_non_thinking	$0.0000	$0.00
Generate_content_output_token_count_gemini_robotics_ER_1.5_long_output_text_batch	$0.0000	$0.00
BatchGenerateContent video input token count for Gemini 2.0 Flash	$0.0000	$0.00
BatchGenerate_content audio input token count for gemini 3 pro long	$0.0000	$0.00
Gemini ER1.6 Image Caching Input - Online Predictions	$0.0000	$0.00
Generate content output token count gemini 3 pro short text flex	$0.0000	$0.00
Imagen 4 Generation (output)	$0.0400	$29.20
Generate content input token count gemini 2.5 flash long text priority	$0.0000	$0.00
Generate content output token count gemini 3.1 flash lite preview text flex	$0.0000	$0.00
Generate_content cached text input token count for gemini 3 flash	$0.0000	$0.00
GenerateContent output token count for Gemini 1.5 Pro when input is up to 128k tokens	$0.0000	$0.00
Generate content input token count gemini 3.5 flash image batch	$0.0000	$0.00
Generate_content image output token count for Gemini 3 Pro Image	$0.0001	$0.09
Veo 3 Audio Generation (output)	$0.4000	$292.00
GenerateContent input token count for Gemini 1.5 Flash 8B when input is up to 128k tokens	$0.0000	$0.00
Gemini ER1.6 Video Caching Input - Online Predictions	$0.0000	$0.00
GenerateContent output image token count for Gemini 2.0 Flash MMGen	$0.0000	$0.02
Gemini 3.1 Flash Image Text Output - Batch Predictions	$0.0000	$0.00
Generate content cached input token count gemini 3.1 flash lite preview image	$0.0000	$0.00
Generate_content text cached input token count for Gemini 3.1 Flash Lite	$0.0000	$0.00
Generate_content video input token count for gemini 3 pro short	$0.0000	$0.00
BatchGenerate_content text output token count for gemini 3 pro long	$0.0000	$0.01
Generate_content storage token count for Gemini 3.5 Flash Flex Caching	$0.0000	$0.00
Generate content output token count gemini 2.5 pro short text priority	$0.0000	$0.01
Gemini 3.1 Flash Image Text Input - Batch Predictionss	$0.0000	$0.00
Generate content output token count Gemini 2.5 Pro short output text	$0.0000	$0.01
GenerateContent cached input token count for Gemini 1.5 Flash 8B when input is over 128k tokens	$0.0000	$0.00
Veo Lite Generation 1080p with Audio	$0.0800	$58.40
Generate content input token count gemini 2.5 pro short text priority	$0.0000	$0.00
Generate_content text output token count for gemini 3 pro long	$0.0000	$0.01
Generate content output token count gemini 3.1 flash lite preview text priority	$0.0000	$0.00
GenerateContent cached input token count for Gemini 1.5 Flash 8B when input is up to 128k tokens	$0.0000	$0.00
EmbedContent input token count for gemini-embedding-2 text	$0.0000	$0.00
BatchGenerate_content text input token count for gemini 3 pro short	$0.0000	$0.00
Generate content search query gemini 2.5 paid one	$0.0350	$25.55
BatchGenerate content output token count gemini 2.5 flash short output text non-thinking	$0.0000	$0.00
BatchGenerate content input token count gemini 2.5 flash long input text	$0.0000	$0.00
BidiGenerateContent audio output token count for Gemini 2.5 Flash Native Audio Thinking	$0.0000	$0.01
Generate content input token count gemini 2.5 pro video flex	$0.0000	$0.00
Generate_content image input token count for gemini 3 flash	$0.0000	$0.00
Generate content output token count gemini 2.5 flash native image generation flex	$0.0000	$0.01
veo3_upsampler_video_generation	$0.2000	$146.00
GenerateContent output token count for Gemini 1.5 Pro when input is longer than 128k tokens	$0.0000	$0.01
Generate_content image cached input token count for Gemini 2.5 Flash Lite	$0.0000	$0.00
Generate_content image batched input token count for Gemini 3.1 Flash Lite	$0.0000	$0.00
Gemini 3.1 Flash Image Image Output - Predictions	$0.0001	$0.04
Generate_content_cached_input_token_count	$0.0000	$0.00
Generate content cached input token count gemini 2.5 flash input image	$0.0000	$0.00
Generate_content text input token count for Gemini 3.5 Flash Priority Caching	$0.0000	$0.00

Showing 50 of 200 rows

Reserved Instance & Savings Plans Pricing

Commit to 1 or 3 years for lower hourly rates.

Region

Instance	Price/hr	Price/mo	1yr RI/hr	3yr RI/hr
Generate_content text input token count for Gemini 3.5 Flash Flex Caching	$0.0000	$0.00	-	-
Generate_content_output_token_count_gemini_robotics_ER_1.5_short_output_text_non_thinking	$0.0000	$0.00	-	-
Generate_content_output_token_count_gemini_robotics_ER_1.5_long_output_text_batch	$0.0000	$0.00	-	-
BatchGenerateContent video input token count for Gemini 2.0 Flash	$0.0000	$0.00	-	-
BatchGenerate_content audio input token count for gemini 3 pro long	$0.0000	$0.00	-	-
Gemini ER1.6 Image Caching Input - Online Predictions	$0.0000	$0.00	-	-
Generate content output token count gemini 3 pro short text flex	$0.0000	$0.00	-	-
Imagen 4 Generation (output)	$0.0400	$29.20	-	-
Generate content input token count gemini 2.5 flash long text priority	$0.0000	$0.00	-	-
Generate content output token count gemini 3.1 flash lite preview text flex	$0.0000	$0.00	-	-
Generate_content cached text input token count for gemini 3 flash	$0.0000	$0.00	-	-
GenerateContent output token count for Gemini 1.5 Pro when input is up to 128k tokens	$0.0000	$0.00	-	-
Generate content input token count gemini 3.5 flash image batch	$0.0000	$0.00	-	-
Generate_content image output token count for Gemini 3 Pro Image	$0.0001	$0.09	-	-
Veo 3 Audio Generation (output)	$0.4000	$292.00	-	-
GenerateContent input token count for Gemini 1.5 Flash 8B when input is up to 128k tokens	$0.0000	$0.00	-	-
Gemini ER1.6 Video Caching Input - Online Predictions	$0.0000	$0.00	-	-
GenerateContent output image token count for Gemini 2.0 Flash MMGen	$0.0000	$0.02	-	-
Gemini 3.1 Flash Image Text Output - Batch Predictions	$0.0000	$0.00	-	-
Generate content cached input token count gemini 3.1 flash lite preview image	$0.0000	$0.00	-	-
Generate_content text cached input token count for Gemini 3.1 Flash Lite	$0.0000	$0.00	-	-
Generate_content video input token count for gemini 3 pro short	$0.0000	$0.00	-	-
BatchGenerate_content text output token count for gemini 3 pro long	$0.0000	$0.01	-	-
Generate_content storage token count for Gemini 3.5 Flash Flex Caching	$0.0000	$0.00	-	-
Generate content output token count gemini 2.5 pro short text priority	$0.0000	$0.01	-	-
Gemini 3.1 Flash Image Text Input - Batch Predictionss	$0.0000	$0.00	-	-
Generate content output token count Gemini 2.5 Pro short output text	$0.0000	$0.01	-	-
GenerateContent cached input token count for Gemini 1.5 Flash 8B when input is over 128k tokens	$0.0000	$0.00	-	-
Veo Lite Generation 1080p with Audio	$0.0800	$58.40	-	-
Generate content input token count gemini 2.5 pro short text priority	$0.0000	$0.00	-	-
Generate_content text output token count for gemini 3 pro long	$0.0000	$0.01	-	-
Generate content output token count gemini 3.1 flash lite preview text priority	$0.0000	$0.00	-	-
GenerateContent cached input token count for Gemini 1.5 Flash 8B when input is up to 128k tokens	$0.0000	$0.00	-	-
EmbedContent input token count for gemini-embedding-2 text	$0.0000	$0.00	-	-
BatchGenerate_content text input token count for gemini 3 pro short	$0.0000	$0.00	-	-
Generate content search query gemini 2.5 paid one	$0.0350	$25.55	-	-
BatchGenerate content output token count gemini 2.5 flash short output text non-thinking	$0.0000	$0.00	-	-
BatchGenerate content input token count gemini 2.5 flash long input text	$0.0000	$0.00	-	-
BidiGenerateContent audio output token count for Gemini 2.5 Flash Native Audio Thinking	$0.0000	$0.01	-	-
Generate content input token count gemini 2.5 pro video flex	$0.0000	$0.00	-	-
Generate_content image input token count for gemini 3 flash	$0.0000	$0.00	-	-
Generate content output token count gemini 2.5 flash native image generation flex	$0.0000	$0.01	-	-
veo3_upsampler_video_generation	$0.2000	$146.00	-	-
GenerateContent output token count for Gemini 1.5 Pro when input is longer than 128k tokens	$0.0000	$0.01	-	-
Generate_content image cached input token count for Gemini 2.5 Flash Lite	$0.0000	$0.00	-	-
Generate_content image batched input token count for Gemini 3.1 Flash Lite	$0.0000	$0.00	-	-
Gemini 3.1 Flash Image Image Output - Predictions	$0.0001	$0.04	-	-
Generate_content_cached_input_token_count	$0.0000	$0.00	-	-
Generate content cached input token count gemini 2.5 flash input image	$0.0000	$0.00	-	-
Generate_content text input token count for Gemini 3.5 Flash Priority Caching	$0.0000	$0.00	-	-

Showing 50 of 200 rows

How Gemini API Pricing Works

On-Demand

Pay per hour with no long-term commitment. Ideal for variable workloads and development environments.

Reserved / Committed Use

Commit to 1 or 3 years for significant discounts.

Spot / Preemptible

Use spare capacity at steep discounts. Best for fault-tolerant, batch, and stateless workloads.

Monthly Cost Examples

Small Workload

GenerateContent cached input token count for Gemini 1.5 Flash 8B when input is up to 128k tokens

0 vCPU, 0 GB RAM

$0.00/mo

Medium Workload

Generate content cached input token count gemini 3.1 flash lite preview video

0 vCPU, 0 GB RAM

$0.00/mo

Large Workload

Veo 3 Audio Generation (output)

0 vCPU, 0 GB RAM

$292.00/mo

GCP Gemini API Free Tier

GCPFree tierAlways Free

Free limit: 15 RPM / 1M TPM (Gemini Flash). This free tier never expires and renews monthly.

Frequently Asked Questions

What is GCP Gemini API?

GCP Gemini API is a cloud service offered by Google Cloud Platform. It provides various configurations (200 pricing tiers available) with pay-as-you-go and committed-use pricing options.

How much does GCP Gemini API cost per month?

Prices range from $0.00/month for GenerateContent cached input token count for Gemini 1.5 Flash 8B when input is up to 128k tokens to $292.00/month for Veo 3 Audio Generation (output) on On-Demand pricing in us-east1.

Does GCP Gemini API have a free tier?

GCP offers various free tier options. Check the official GCP pricing page for the most current free tier details for Gemini API.

How many GCP Gemini API pricing tiers are available?

There are 200 pricing tiers available for GCP Gemini API. These range from entry-level configurations to high-performance options for enterprise workloads.

What pricing models does GCP Gemini API offer?

GCP Gemini API offers On-Demand (pay-per-hour, no commitment), Reserved/Committed Use (1-3 year commitments for significant discounts), and in some cases Spot/Preemptible pricing for interruptible workloads at the lowest cost.

How is GPU instance pricing structured?

GPU instances are priced per hour and vary significantly by GPU model (T4, A10G, A100, H100). On-demand rates are highest; spot/preemptible instances can cut costs 60-90% for fault-tolerant training jobs. Some providers also offer per-second billing and committed-use discounts for GPUs.