API Pricing

Flagship models

Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems.

Choose your processing mode

GPT-5.5

A new class of intelligence for coding and professional work.

Price

Input:$5.00 / 1M tokensCached input:$0.50 / 1M tokensOutput:$30.00 / 1M tokens

GPT-5.4

A more affordable model for coding and professional work.

Price

Input:$2.50 / 1M tokensCached input:$0.25 / 1M tokensOutput:$15.00 / 1M tokens

GPT-5.4 mini

Our strongest mini model yet for coding, computer use, and subagents.

Price

Input:$0.75 / 1M tokensCached input:$0.075 / 1M tokensOutput:$4.50 / 1M tokens

Pricing above reflects standard processing rates for context lengths under 270K.
Learn more about Batch Processing⁠(opens in a new window) and Data residency & Regional Processing⁠(opens in a new window)

Explore detailed pricing(opens in a new window)

Multimodal models

Power applications across text, image, and audio with models built for real-time interaction and rich media generation.

Choose your processing mode

GPT-Realtime-2

Our most capable model for realtime voice interactions.

Price

Audio:$32.00 / 1M tokens for inputs$0.40 / 1M tokens for cached inputs$64.00 / 1M tokens for outputsText:$4.00 / 1M tokens for inputs$0.40 / 1M tokens for cached inputs$24.00 / 1M tokens for outputsImage:$5.00 / 1M tokens for inputs$0.50 / 1M tokens for cached inputs

GPT-Realtime-Translate

A new live translation model that translates speech in real time and keeps pace with the speaker.

Price

$0.034 per minute / $0.00057 per second

GPT-Realtime-Whisper

A new streaming speech-to-text that transcribes speech live as the speaker talks.

Price

$0.017 per minute / $0.00028 per second

GPT-Image-2

State-of-the-art image generation model.

Price

Image:$8.00 / 1M tokens for inputs$2.00 / 1M tokens for cached inputs$30.00 / 1M tokens for outputsText:$5.00 / 1M tokens for inputs$1.25 / 1M tokens for cached inputs

Tools

Extend model capabilities with built-in tools for retrieval, execution, and external data access.

Web search

Retrieve up-to-date information from the web to ground model responses.

Price

$10.00 / 1k callsSearch content tokens are free.

Containers

Run code and tools in secure, scalable environments alongside your models.

Price

Now:1 GB for $0.03 / 64GB for $1.92 per containerStarting March 31, 2026:1 GB for $0.03 / 64GB for $1.92 per 20-minute session per container

Service tiers

Balance performance, predictable costs, and availability based on your needs.

Batch API

Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours.

Learn more

Priority processing

Offers reliable, high-speed performance with the flexibility to pay-as-you-go.

Learn more

Flex processing

Provides lower costs for requests in exchange for slower response times and occasional resource unavailability. Ideal for non-production or lower priority tasks.

Learn more

Enterprise offerings

Contact our sales team to learn more about Data residency⁠(opens in a new window), Scale Tier⁠ and Reserved Capacity⁠ designed for cutting-edge customers running larger workloads.

Contact sales

FAQ

We recommend that developers use our large and mini GPT models for everyday tasks. Our large GPT models generally perform better on a wide range of tasks, while our mini GPT models are fast and inexpensive for simpler tasks.

Our large and mini reasoning models are ideal for complex, multi-step tasks and STEM use cases that require deep thinking about tough problems. You can choose the mini reasoning model if you're looking for a faster, more inexpensive option.

We recommend experimenting with all of these models in the Playground⁠⁠(opens in a new window) to explore which models provide the best price performance trade-off for your usage.

You can set a monthly budget in your billing settings⁠⁠(opens in a new window), after which we’ll stop serving your requests. There may be a delay in enforcing the limit, and you are responsible for any overage incurred. You can also configure an email notification threshold to receive an email alert once you cross that threshold each month. We recommend checking your usage tracking dashboard⁠(opens in a new window) regularly to monitor your spend.

For customers managing work with Projects, you can set and manage billing restrictions per project⁠(opens in a new window)⁠ in the Dashboard.

Images are converted into tokens and charged per token. Text models price image tokens at standard text token rates, while GPT Image and gpt-realtime uses a separate image token rate. Models like gpt-4.1-mini, gpt-4.1-nano, and o4-mini convert images into tokens differently. Learn more in our docs⁠(opens in a new window).

Pricing calculator

Set model

Set width

Set height

Low resolution

=$0.000263

Price per 1M tokens (fixed)	$1.25
512 × 512 tiles	1 × 1
Total tiles	1
Base tokens	70
Tile tokens	140 × 1 = 140
Total tokens	210
Total price	$0.000263

Start creating with OpenAI’s powerful models.

Get started Contact sales

API Pricing

Flagship models

GPT-5.5

Price

GPT-5.4

Price

GPT-5.4 mini

Price

Multimodal models

GPT-Realtime-2

Price

GPT-Realtime-Translate

Price

GPT-Realtime-Whisper

Price

GPT-Image-2

Price

Tools

Web search

Price

Containers

Price

Service tiers

Batch API

Priority processing

Flex processing

Enterprise offerings

FAQ

Which model should I use?

Do you offer an enterprise package or SLAs?

Will I be charged for API usage in the Playground?

How will I know how many tokens I’ve used each month?

How can I manage my spending on the API platform?

Is access to the API included in ChatGPT Plus, Business, Enterprise or Edu?

How is pricing calculated for images?

Start creating with OpenAI’s powerful models.

Which model should I use?

Do you offer an enterprise package or SLAs?

Will I be charged for API usage in the Playground?

How will I know how many tokens I’ve used each month?

How can I manage my spending on the API platform?

Is access to the API included in ChatGPT Plus, Business, Enterprise or Edu?

How is pricing calculated for images?