API Pricing
Pricing below reflects standard processing rates. To optimize cost and performance for different use cases, we also offer:
- Batch API(opens in a new window): Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours.
- Priority processing: offers reliable, high-speed performance with the flexibility to pay-as-you-go.
Flagship models
Our frontier models designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems.
GPT-5
The best model for coding and agentic tasks across industries
Price
$1.250 / 1M tokens
$0.125 / 1M tokens
$10.000 / 1M tokens
GPT-5 mini
A faster, cheaper version of GPT-5 for well-defined tasks
Price
$0.250 / 1M tokens
$0.025 / 1M tokens
$2.000 / 1M tokens
GPT-5 nano
The fastest, cheapest version of GPT-5—great for summarization and classification tasks
Price
$0.050 / 1M tokens
$0.005 / 1M tokens
$0.400 / 1M tokens
Fine-tuning our models
Customize our models to get even higher performance for your specific use cases.
GPT-4.1
Fine-tuning price
$3.00 / 1M tokens
$0.75 / 1M tokens
$12.00 / 1M tokens
$25.00 / 1M tokens
GPT-4.1 mini
Fine-tuning price
$0.80 / 1M tokens
$0.20 / 1M tokens
$3.20 / 1M tokens
$5.00 / 1M tokens
GPT-4.1 nano
Fine-tuning price
$0.20 / 1M tokens
$0.05 / 1M tokens
$0.80 / 1M tokens
$1.50 / 1M tokens
o4-mini
Reinforcement fine-tuning price
$4.00 / 1M tokens
$1.00 / 1M tokens
$16.00 / 1M tokens
$100.00 / training hour
Our APIs
Realtime API
Build low-latency, multimodal experiences including speech-to-speech.
Image Generation API
Precise, high-fidelity image generation and editing with our latest multimodal model.
Responses API
Our newest API combining the simplicity of Chat Completions with the built-in tool use of Assistants.
Chat Completions API
Build text-based conversational experiences.
Assistants API
Build assistant-like experiences with our tools.
Built-in tools
Extend model capabilities with built-in tools in the API Platform.
Models | Search Content | Cost |
---|---|---|
gpt-4o, gpt-4o-mini, gpt-4.1, and gpt-4.1-mini-models* | Search content tokens free | $25.00 / 1K calls |
GPT-5, GPT-5-mini, GPT-5-nano, o3, o4-mini, o3-pro, and deep research models | Search content tokens billed at model rate | $10.00 / 1K calls |
Explore our offerings for Enterprise customers: Priority processing, Scale Tier and Reserved Capacity.
FAQ
We recommend that developers use our large and mini GPT models for everyday tasks. Our large GPT models generally perform better on a wide range of tasks, while our mini GPT models are fast and inexpensive for simpler tasks.
Our large and mini reasoning models are ideal for complex, multi-step tasks and STEM use cases that require deep thinking about tough problems. You can choose the mini reasoning model if you're looking for a faster, more inexpensive option.
We recommend experimenting with all of these models in the Playground(opens in a new window) to explore which models provide the best price performance trade-off for your usage.