API Pricing
Flagship models
Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems.
GPT-5.5
A new class of intelligence for coding and professional work.
Price
$5.00 / 1M tokens
$0.50 / 1M tokens
$30.00 / 1M tokens
GPT-5.4
A more affordable model for coding and professional work.
Price
$2.50 / 1M tokens
$0.25 / 1M tokens
$15.00 / 1M tokens
GPT-5.4 mini
Our strongest mini model yet for coding, computer use, and subagents.
Price
$0.75 / 1M tokens
$0.075 / 1M tokens
$4.50 / 1M tokens
Pricing above reflects standard processing rates for context lengths under 270K.
Learn more about Batch Processing(opens in a new window) and Data residency & Regional Processing(opens in a new window)
Multimodal models
Power applications across text, image, and audio with models built for real-time interaction and rich media generation.
GPT-Realtime-2
Our most capable model for realtime voice interactions.
Price
Audio:
$32.00 / 1M tokens for inputs
$0.40 / 1M tokens for cached inputs
$64.00 / 1M tokens for outputs
Text:
$4.00 / 1M tokens for inputs
$0.40 / 1M tokens for cached inputs
$24.00 / 1M tokens for outputs
Image:
$5.00 / 1M tokens for inputs
$0.50 / 1M tokens for cached inputs
GPT-Realtime-Translate
A new live translation model that translates speech in real time and keeps pace with the speaker.
Price
$0.034 per minute / $0.00057 per second
GPT-Realtime-Whisper
A new streaming speech-to-text that transcribes speech live as the speaker talks.
Price
$0.017 per minute / $0.00028 per second
GPT-Image-2
State-of-the-art image generation model.
Price
Image:
$8.00 / 1M tokens for inputs
$2.00 / 1M tokens for cached inputs
$30.00 / 1M tokens for outputs
Text:
$5.00 / 1M tokens for inputs
$1.25 / 1M tokens for cached inputs
Tools
Extend model capabilities with built-in tools for retrieval, execution, and external data access.
Web search
Retrieve up-to-date information from the web to ground model responses.
Price
$10.00 / 1k calls
Search content tokens are free.
Containers
Run code and tools in secure, scalable environments alongside your models.
Price
Now:
1 GB for $0.03 / 64GB for $1.92 per container
Starting March 31, 2026:
1 GB for $0.03 / 64GB for $1.92 per 20-minute session per container
Service tiers
Balance performance, predictable costs, and availability based on your needs.
Batch API
Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours.
Priority processing
Offers reliable, high-speed performance with the flexibility to pay-as-you-go.
Flex processing
Provides lower costs for requests in exchange for slower response times and occasional resource unavailability. Ideal for non-production or lower priority tasks.
Enterprise offerings
Contact our sales team to learn more about Data residency(opens in a new window), Scale Tier and Reserved Capacity designed for cutting-edge customers running larger workloads.