OpenAI

API Pricing

Flagship models

Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems.

GPT-5.4

Our most capable model for professional work.

Price

Input:
$2.50 / 1M tokens
Cached input:
$0.25 / 1M tokens
Output:
$15.00 / 1M tokens

GPT-5.4 mini

Our strongest mini model yet for coding, computer use, and subagents.

Price

Input:
$0.750 / 1M tokens
Cached input:
$0.075 / 1M tokens
Output:
$4.500 / 1M tokens

GPT-5.4 nano

Our cheapest GPT-5.4-class model for simple high-volume tasks.

Price

Input:
$0.20 / 1M tokens
Cached input:
$0.02 / 1M tokens
Output:
$1.25 / 1M tokens

Pricing above reflects standard processing rates for context lengths under 270K.
Data residency and Regional Processing(opens in a new window)⁠ endpoints are charged an additional 10% for all models released after 3/5/26.

Multimodal models

Power applications across text, image, audio, and video with models built for real-time interaction and rich media generation.

GPT-realtime-1.5

Our most capable model for realtime voice interactions.

Price

Audio:
$32.00 for inputs
$0.40 for cached inputs
$64.00 for outputs

Text:
$4.00 for inputs
$0.40 for cached inputs
$16.00 for outputs

Image:
$5.00 for inputs
$0.50 for cached inputs

GPT-image-1.5

State-of-the-art image generation model.

Price

Image:
$8.00 for inputs
$2.00 for cached inputs
$32.00 for outputs

Text:
$5.00 for inputs
$1.25 for cached inputs
$10.00 for outputs

Sora-2

Our latest video generation model.

Price

Price per second:
$0.10

For media with dimensions:
Size: 720p
Portrait: 720 x 1280
Landscape: 1280 x 720

Tools

Extend model capabilities with built-in tools for retrieval, execution, and external data access.

Web search

Retrieve up-to-date information from the web to ground model responses.

Price

$25.00 / 1k calls

Search content tokens are free.

Containers

Run code and tools in secure, scalable environments alongside your models.

Price

Now:
1 GB for $0.03 / 64GB for $1.92 per container

Starting March 31, 2026:
1 GB for $0.03 / 64GB for $1.92 per 20-minute session per container

Service tiers

Balance performance, predictable costs, and availability based on your needs.

Stack icon

Batch API

Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours.

Timer icon

Priority processing

Offers reliable, high-speed performance with the flexibility to pay-as-you-go.

Arrow up and down icon

Flex processing

Provides lower costs for requests in exchange for slower response times and occasional resource unavailability. Ideal for non-production or lower priority tasks.

Enterprise offerings

Contact our sales team to learn more about Data residency(opens in a new window), Scale Tier and Reserved Capacity designed for cutting-edge customers running larger workloads.

FAQ

Start creating with OpenAI’s powerful models.