Compare models | OpenAI API

1 min read Original article ↗

Best intelligence at scale for agentic, coding, and professional workflows

Reasoning

Speed

Input

Output

Reasoning tokens

Pricing

Per 1M tokens

Input

$2.50

Cached Input

$0.25

Output

$15.00

Context

Window

1,050,000

Max Output Tokens

128,000

Knowledge Cutoff

Aug 31, 2025

Endpoints

v1/chat/completions

v1/responses

v1/batch

Supported Features

Streaming

Function calling

Structured outputs

Distillation

Image input

Rate Limits

TPM

Free

-

Tier 1

500,000

Tier 2

1,000,000

Tier 3

2,000,000

Tier 4

4,000,000

Tier 5

40,000,000

Our strongest mini model yet for coding, computer use, and subagents

Reasoning

Speed

Input

Output

Reasoning tokens

Pricing

Per 1M tokens

Input

$0.75

Cached Input

$0.08

Output

$4.50

Context

Window

400,000

Max Output Tokens

128,000

Knowledge Cutoff

Aug 31, 2025

Endpoints

v1/chat/completions

v1/responses

v1/batch

Supported Features

Streaming

Function calling

Structured outputs

Distillation

Image input

Rate Limits

TPM

Free

-

Tier 1

500,000

Tier 2

2,000,000

Tier 3

4,000,000

Tier 4

10,000,000

Tier 5

180,000,000

Our cheapest GPT-5.4-class model for simple high-volume tasks

Reasoning

Speed

Input

Output

Reasoning tokens

Pricing

Per 1M tokens

Input

$0.20

Cached Input

$0.02

Output

$1.25

Context

Window

400,000

Max Output Tokens

128,000

Knowledge Cutoff

Aug 31, 2025

Endpoints

v1/chat/completions

v1/responses

v1/batch

Supported Features

Streaming

Function calling

Structured outputs

Distillation

Image input

Rate Limits

TPM

Free

-

Tier 1

200,000

Tier 2

2,000,000

Tier 3

4,000,000

Tier 4

10,000,000

Tier 5

180,000,000