CloudPriceCheck

Azure OpenAI Service Pricing (2026)

Updated Apr 6, 2026AzureAI / ML

AI API services provide access to foundation models and generative AI capabilities through simple API calls. Build intelligent applications with large language models, image generation, and multimodal AI. OpenAI Service offers GPT-4, DALL-E, and Whisper models with enterprise security and compliance. Starting from $0.0000/hr ($0.01/mo) for text-embedding-3-small-glbl - text-embedding-3-small-glbl Tokens.

Key Features

  • Access to multiple foundation models through a single API
  • Pay-per-token pricing with no upfront commitment
  • Fine-tuning and customization options
  • Content filtering and safety guardrails
  • Low-latency inference endpoints

Common Use Cases

Chatbots & Assistants

Build conversational AI applications powered by large language models.

Content Generation

Generate text, code, and images for marketing, documentation, and creative workflows.

RAG Applications

Combine retrieval-augmented generation with your data for accurate, grounded AI responses.

On-Demand Pricing

Pay-as-you-go pricing with no upfront commitment. You are billed per hour of usage and can start or stop at any time. Hourly rates start at $0.0000/hr ($0.01/mo) for text-embedding-3-small-glbl - text-embedding-3-small-glbl Tokens.

InstancevCPUMemoryPrice/hrPrice/mo
gpt 4.1 Inp regnl - gpt 4.1 Inp regnl Tokens416 GB$0.0022$1.61
Code-Interpreter-global - Code-Interpreter-global Session--$0.0300$21.90
gpt-4-8K-Batch-Outp-glbl - gpt-4-8K-Batch-Outp-glbl Tokens416 GB$0.0300$21.90
54 nano Batch cd Inp Gl - 5.4 nano Batch cd Inp Gl 1M Tokens54216 GB$0.0100$7.30
gpt-4o-rt-txt-0603 cchd Inp DZn - gpt-4o-rt-txt-0603 cchd Inp DZn Tokens416 GB$0.0027$2.01
gpt 4o 0513 Input Data Zone - gpt 4o 0513 Input Data Zone Tokens416 GB$0.0055$4.01
gpt 4o 0806 cached Inp glbl - gpt 4o 0806 cached Inp glbl Tokens416 GB$0.0013$0.91
gpt aud mini txt Inp DZone - gpt aud mini txt Inp DZone Tokens--$0.0007$0.48
gpt 4.1 nano Inp Data Zone - gpt 4.1 nano Inp Data Zone Tokens416 GB$0.0001$0.08
GPT 5 Nano Batch Inpt cchd Dzone - GPT 5 Nano Batch Inpt cchd Dzone 1M Tokens520 GB$0.0027$2.01
o3-pro Inp glbl - o3-pro Inp glbl Tokens312 GB$0.0200$14.60
o3 mini 0131 output glbl - o3 mini 0131 output glbl Tokens312 GB$0.0044$3.21
Phi-3-Medium-128K-Instruct-Finetuned - Phi-3-Medium-128K-Instruct-Finetuned Deployment Hosting Unit312 GB$0.8000$584.00
gpt-4.1-mini-dev-ft cchd inpt glbl - gpt-4.1-mini-dev-ft cchd inpt glbl Tokens416 GB$0.0001$0.07
GPT 51 chat inp Dz - GPT 5.1 chat inp Dz 1M Tokens51204 GB$1.3750$1,003.75
Phi-3.5-Mini-128K-Instruct-Output - Phi-3.5-Mini-128K-Instruct-Output Tokens312 GB$0.0005$0.38
GPT 5 Batch Inpt cchd Dzone - GPT 5 Batch Inpt cchd Dzone 1M Tokens520 GB$0.0688$50.19
GPT 51 chat cd inp Dz - GPT 5.1 chat cd inp Dz 1M Tokens51204 GB$0.1375$100.38
GPT 5.2 pro Batch inp Gl - GPT 5.2 pro Batch inp Gl 1M Tokens520 GB$10.5000$7,665.00
Phi-3-Medium-4K-Instruct-Output - Phi-3-Medium-4K-Instruct-Output Tokens312 GB$0.0007$0.50
GPT 5 Mini Batch outpt Glbl - GPT 5 Mini Batch outpt Glbl 1M Tokens520 GB$1.0000$730.00
53 codex opt Dz - 5.3 codex opt Dz 1M Tokens53212 GB$15.4000$11,242.00
gpt rt 15 txt opt Gl - gpt rt 1.5 txt opt Gl 1M Tokens1560 GB$16.0000$11,680.00
o3-deep research 0626-inp-cchd-glbl - o3-deep research 0626-inp-cchd-glbl 1M Tokens312 GB$2.5000$1,825.00
gpt-4.1-nano-ft hosting regional - gpt-4.1-nano-ft hosting regional Unit416 GB$1.7000$1,241.00
o1 model ft grader cched input - o1 model ft grader cched input Tokens14 GB$0.0083$6.02
o3-deep research 0626-inp-dzone - o3-deep research 0626-inp-dzone 1M Tokens312 GB$11.0000$8,030.00
Codestral Inp glbl - Codestral Inp glbl Tokens--$0.0003$0.22
Code Fast 1 Outp glbl - Code Fast 1 Outp glbl Tokens14 GB$0.0015$1.09
54 pro longco opt Gl - 5.4 pro longco opt Gl 1M Tokens54216 GB$270.0000$197,100.00
gpt img 1.5 in img DZ - gpt img 1.5 in img DZ 1M Tokens14 GB$8.8000$6,424.00
Phi-3-Mini-128K-Instruct-Finetuned - Phi-3-Mini-128K-Instruct-Finetuned Tokens312 GB$0.0030$2.19
gpt-4.1-dev-ft inpt glbl - gpt-4.1-dev-ft inpt glbl Tokens416 GB$0.0020$1.46
OSS-20b FT - OSS-20b FT Tokens2080 GB$0.0036$2.63
gpt-4.1-nano-ft output regional - gpt-4.1-nano-ft output regional Tokens416 GB$0.0004$0.32
gpt 4o 0513 Batch Inp glbl - gpt 4o 0513 Batch Inp glbl Tokens416 GB$0.0025$1.82
R1 Inp regnl - R1 Inp regnl Tokens14 GB$0.0015$1.08
Llama 4 Maverick 17B Outp regnl - Llama 4 Maverick 17B Outp regnl Tokens416 GB$0.0011$0.80
Qwen3 32B FT - Qwen3 32B FT Tokens312 GB$0.0032$2.34
o3-ft mdl grdr inpt - o3-ft mdl grdr inpt Tokens312 GB$0.0022$1.61
gpt4o realtimePrvwTxtInp DataZone - gpt4o realtimePrvwTxtInp DataZone Tokens416 GB$0.0055$4.01
Phi-3-Medium-4K-Instruct-Input - Phi-3-Medium-4K-Instruct-Input Tokens312 GB$0.0002$0.12
54 Batch cd inp Gl - 5.4 Batch cd inp Gl 1M Tokens54216 GB$0.1300$94.90
gpt rt img mn cchd in gl 1215 - gpt rt img mn cchd in gl 1215 1M Tokens12154860 GB$0.0800$58.40
54 pro Batch inp Gl - 5.4 pro Batch inp Gl 1M Tokens54216 GB$15.0000$10,950.00
gpt rt txt 0828 Outp glbl - gpt rt txt 0828 Outp glbl Tokens8283312 GB$0.0160$11.68
gpt 4o 0513 Input global - gpt 4o 0513 Input global Tokens416 GB$0.0050$3.65
gpt-4.1-mini-ft output global - gpt-4.1-mini-ft output global Tokens416 GB$0.0016$1.17
gpt 4o 0806 Inp Data Zone - gpt 4o 0806 Inp Data Zone Tokens416 GB$0.0027$2.01
gpt-4o-rt-aud-0603 Outp DZone - gpt-4o-rt-aud-0603 Outp DZone Tokens416 GB$0.0880$64.24
Showing 50 of 200 rows

Reserved Instance & Savings Plans Pricing

Commit to 1 or 3 years for lower hourly rates.

InstancevCPUMemoryPrice/hrPrice/mo1yr RI/hr3yr RI/hr
gpt 4.1 Inp regnl - gpt 4.1 Inp regnl Tokens416 GB$0.0022$1.61--
Code-Interpreter-global - Code-Interpreter-global Session--$0.0300$21.90--
gpt-4-8K-Batch-Outp-glbl - gpt-4-8K-Batch-Outp-glbl Tokens416 GB$0.0300$21.90--
54 nano Batch cd Inp Gl - 5.4 nano Batch cd Inp Gl 1M Tokens54216 GB$0.0100$7.30--
gpt-4o-rt-txt-0603 cchd Inp DZn - gpt-4o-rt-txt-0603 cchd Inp DZn Tokens416 GB$0.0027$2.01--
gpt 4o 0513 Input Data Zone - gpt 4o 0513 Input Data Zone Tokens416 GB$0.0055$4.01--
gpt 4o 0806 cached Inp glbl - gpt 4o 0806 cached Inp glbl Tokens416 GB$0.0013$0.91--
gpt aud mini txt Inp DZone - gpt aud mini txt Inp DZone Tokens--$0.0007$0.48--
gpt 4.1 nano Inp Data Zone - gpt 4.1 nano Inp Data Zone Tokens416 GB$0.0001$0.08--
GPT 5 Nano Batch Inpt cchd Dzone - GPT 5 Nano Batch Inpt cchd Dzone 1M Tokens520 GB$0.0027$2.01--
o3-pro Inp glbl - o3-pro Inp glbl Tokens312 GB$0.0200$14.60--
o3 mini 0131 output glbl - o3 mini 0131 output glbl Tokens312 GB$0.0044$3.21--
Phi-3-Medium-128K-Instruct-Finetuned - Phi-3-Medium-128K-Instruct-Finetuned Deployment Hosting Unit312 GB$0.8000$584.00--
gpt-4.1-mini-dev-ft cchd inpt glbl - gpt-4.1-mini-dev-ft cchd inpt glbl Tokens416 GB$0.0001$0.07--
GPT 51 chat inp Dz - GPT 5.1 chat inp Dz 1M Tokens51204 GB$1.3750$1,003.75--
Phi-3.5-Mini-128K-Instruct-Output - Phi-3.5-Mini-128K-Instruct-Output Tokens312 GB$0.0005$0.38--
GPT 5 Batch Inpt cchd Dzone - GPT 5 Batch Inpt cchd Dzone 1M Tokens520 GB$0.0688$50.19--
GPT 51 chat cd inp Dz - GPT 5.1 chat cd inp Dz 1M Tokens51204 GB$0.1375$100.38--
GPT 5.2 pro Batch inp Gl - GPT 5.2 pro Batch inp Gl 1M Tokens520 GB$10.5000$7,665.00--
Phi-3-Medium-4K-Instruct-Output - Phi-3-Medium-4K-Instruct-Output Tokens312 GB$0.0007$0.50--
GPT 5 Mini Batch outpt Glbl - GPT 5 Mini Batch outpt Glbl 1M Tokens520 GB$1.0000$730.00--
53 codex opt Dz - 5.3 codex opt Dz 1M Tokens53212 GB$15.4000$11,242.00--
gpt rt 15 txt opt Gl - gpt rt 1.5 txt opt Gl 1M Tokens1560 GB$16.0000$11,680.00--
o3-deep research 0626-inp-cchd-glbl - o3-deep research 0626-inp-cchd-glbl 1M Tokens312 GB$2.5000$1,825.00--
gpt-4.1-nano-ft hosting regional - gpt-4.1-nano-ft hosting regional Unit416 GB$1.7000$1,241.00--
o1 model ft grader cched input - o1 model ft grader cched input Tokens14 GB$0.0083$6.02--
o3-deep research 0626-inp-dzone - o3-deep research 0626-inp-dzone 1M Tokens312 GB$11.0000$8,030.00--
Codestral Inp glbl - Codestral Inp glbl Tokens--$0.0003$0.22--
Code Fast 1 Outp glbl - Code Fast 1 Outp glbl Tokens14 GB$0.0015$1.09--
54 pro longco opt Gl - 5.4 pro longco opt Gl 1M Tokens54216 GB$270.0000$197,100.00--
gpt img 1.5 in img DZ - gpt img 1.5 in img DZ 1M Tokens14 GB$8.8000$6,424.00--
Phi-3-Mini-128K-Instruct-Finetuned - Phi-3-Mini-128K-Instruct-Finetuned Tokens312 GB$0.0030$2.19--
gpt-4.1-dev-ft inpt glbl - gpt-4.1-dev-ft inpt glbl Tokens416 GB$0.0020$1.46--
OSS-20b FT - OSS-20b FT Tokens2080 GB$0.0036$2.63--
gpt-4.1-nano-ft output regional - gpt-4.1-nano-ft output regional Tokens416 GB$0.0004$0.32--
gpt 4o 0513 Batch Inp glbl - gpt 4o 0513 Batch Inp glbl Tokens416 GB$0.0025$1.82--
R1 Inp regnl - R1 Inp regnl Tokens14 GB$0.0015$1.08--
Llama 4 Maverick 17B Outp regnl - Llama 4 Maverick 17B Outp regnl Tokens416 GB$0.0011$0.80--
Qwen3 32B FT - Qwen3 32B FT Tokens312 GB$0.0032$2.34--
o3-ft mdl grdr inpt - o3-ft mdl grdr inpt Tokens312 GB$0.0022$1.61--
gpt4o realtimePrvwTxtInp DataZone - gpt4o realtimePrvwTxtInp DataZone Tokens416 GB$0.0055$4.01--
Phi-3-Medium-4K-Instruct-Input - Phi-3-Medium-4K-Instruct-Input Tokens312 GB$0.0002$0.12--
54 Batch cd inp Gl - 5.4 Batch cd inp Gl 1M Tokens54216 GB$0.1300$94.90--
gpt rt img mn cchd in gl 1215 - gpt rt img mn cchd in gl 1215 1M Tokens12154860 GB$0.0800$58.40--
54 pro Batch inp Gl - 5.4 pro Batch inp Gl 1M Tokens54216 GB$15.0000$10,950.00--
gpt rt txt 0828 Outp glbl - gpt rt txt 0828 Outp glbl Tokens8283312 GB$0.0160$11.68--
gpt 4o 0513 Input global - gpt 4o 0513 Input global Tokens416 GB$0.0050$3.65--
gpt-4.1-mini-ft output global - gpt-4.1-mini-ft output global Tokens416 GB$0.0016$1.17--
gpt 4o 0806 Inp Data Zone - gpt 4o 0806 Inp Data Zone Tokens416 GB$0.0027$2.01--
gpt-4o-rt-aud-0603 Outp DZone - gpt-4o-rt-aud-0603 Outp DZone Tokens416 GB$0.0880$64.24--
Showing 50 of 200 rows

How OpenAI Service Pricing Works

On-Demand

Pay per hour with no long-term commitment. Ideal for variable workloads and development environments.

Reserved / Committed Use

Commit to 1 or 3 years for significant discounts.

Spot / Preemptible

Use spare capacity at steep discounts. Best for fault-tolerant, batch, and stateless workloads.

Monthly Cost Examples

Small Workload
text-embedding-3-small-glbl - text-embedding-3-small-glbl Tokens
3 vCPU, 12 GB RAM
$0.01/mo
Medium Workload
R1 Inp glbl - R1 Inp glbl Tokens
1 vCPU, 4 GB RAM
$0.99/mo
Large Workload
54 pro longco opt Dz - 5.4 pro longco opt Dz 1M Tokens
54 vCPU, 216 GB RAM
$216,810.00/mo

Frequently Asked Questions

What is Azure OpenAI Service?

Azure OpenAI Service is a cloud service offered by Microsoft Azure. It provides various configurations (200 pricing tiers available) with pay-as-you-go and committed-use pricing options.

How much does Azure OpenAI Service cost per month?

Prices range from $0.01/month for text-embedding-3-small-glbl - text-embedding-3-small-glbl Tokens to $216,810.00/month for 54 pro longco opt Dz - 5.4 pro longco opt Dz 1M Tokens on On-Demand pricing in eastus.

Does Azure OpenAI Service have a free tier?

Azure offers various free tier options. Check the official Azure pricing page for the most current free tier details for OpenAI Service.

How many Azure OpenAI Service pricing tiers are available?

There are 200 pricing tiers available for Azure OpenAI Service. These range from entry-level configurations to high-performance options for enterprise workloads.

What pricing models does Azure OpenAI Service offer?

Azure OpenAI Service offers On-Demand (pay-per-hour, no commitment), Reserved/Committed Use (1-3 year commitments for significant discounts), and in some cases Spot/Preemptible pricing for interruptible workloads at the lowest cost.

How is GPU instance pricing structured?

GPU instances are priced per hour and vary significantly by GPU model (T4, A10G, A100, H100). On-demand rates are highest; spot/preemptible instances can cut costs 60-90% for fault-tolerant training jobs. Some providers also offer per-second billing and committed-use discounts for GPUs.

Compare with Other Providers

Related Azure Services