Skip to main content
OpenAI

API Pricing

Flagship models

Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems.

Choose your processing mode

GPT-5.5

A new class of intelligence for coding and professional work.

Price

Input:$5.00 / 1M tokensCached input:$0.50 / 1M tokensOutput:$30.00 / 1M tokens

GPT-5.4

A more affordable model for coding and professional work.

Price

Input:$2.50 / 1M tokensCached input:$0.25 / 1M tokensOutput:$15.00 / 1M tokens

GPT-5.4 mini

Our strongest mini model yet for coding, computer use, and subagents.

Price

Input:$0.75 / 1M tokensCached input:$0.075 / 1M tokensOutput:$4.50 / 1M tokens

Pricing above reflects standard processing rates for context lengths under 270K.
Learn more about Batch Processing(opens in a new window) and Data residency & Regional Processing(opens in a new window)

Multimodal models

Power applications across text, image, and audio with models built for real-time interaction and rich media generation.

Choose your processing mode

GPT-Realtime-2

Our most capable model for realtime voice interactions.

Price

Audio:$32.00 / 1M tokens for inputs$0.40 / 1M tokens for cached inputs$64.00 / 1M tokens for outputsText:$4.00 / 1M tokens for inputs$0.40 / 1M tokens for cached inputs$24.00 / 1M tokens for outputsImage:$5.00 / 1M tokens for inputs$0.50 / 1M tokens for cached inputs

GPT-Realtime-Translate

A new live translation model that translates speech in real time and keeps pace with the speaker.

Price

$0.034 per minute / $0.00057 per second

GPT-Realtime-Whisper

A new streaming speech-to-text that transcribes speech live as the speaker talks.

Price

$0.017 per minute / $0.00028 per second

GPT-Image-2

State-of-the-art image generation model.

Price

Image:$8.00 / 1M tokens for inputs$2.00 / 1M tokens for cached inputs$30.00 / 1M tokens for outputsText:$5.00 / 1M tokens for inputs$1.25 / 1M tokens for cached inputs

Tools

Extend model capabilities with built-in tools for retrieval, execution, and external data access.

Web search

Retrieve up-to-date information from the web to ground model responses.

Price

$10.00 / 1k callsSearch content tokens are free.

Containers

Run code and tools in secure, scalable environments alongside your models.

Price

Now:1 GB for $0.03 / 64GB for $1.92 per containerStarting March 31, 2026:1 GB for $0.03 / 64GB for $1.92 per 20-minute session per container

Service tiers

Balance performance, predictable costs, and availability based on your needs.

Stack icon

Batch API

Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours.

Timer icon

Priority processing

Offers reliable, high-speed performance with the flexibility to pay-as-you-go.

Arrow up and down icon

Flex processing

Provides lower costs for requests in exchange for slower response times and occasional resource unavailability. Ideal for non-production or lower priority tasks.

Enterprise offerings

Contact our sales team to learn more about Data residency(opens in a new window), Scale Tier and Reserved Capacity designed for cutting-edge customers running larger workloads.

FAQ

Start creating with OpenAI’s powerful models.