GPT-5 API CHEAT SHEET

// Base API URL

https://api.openai.com/

gpt-5-nano-2025-08-07

// Model Details

Use Case: Ultra-low latency, speed-critical tasks.
Context: 128,000 tokens
Max Output: 32,000 tokens

// Pricing (per 1M)

Input: $0.05
Output: $0.50

// API Endpoints

v1/chat/completions
v1/responses
v1/embeddings
v1/moderations

gpt-5-mini-2025-08-07

// Model Details

Use Case: Balanced speed, cost, and performance.
Context: 400,000 tokens
Max Output: 128,000 tokens

// Pricing (per 1M)

Input: $0.25
Output: $2.00

// API Endpoints

v1/chat/completions
v1/responses
v1/realtime
v1/batch
v1/embeddings
v1/audio/speech
v1/audio/transcriptions
v1/moderations

gpt-5-2025-08-07

// Model Details

Use Case: Max power for reasoning, coding, agents.
Context: 400,000 tokens
Max Output: 128,000 tokens

// Pricing (per 1M)

Input: $1.25
Output: $10.00

// API Endpoints

v1/chat/completions
v1/responses
v1/realtime
v1/assistants
v1/batch
v1/fine-tuning
v1/embeddings
v1/images/generations
v1/images/edits
v1/audio/speech
v1/audio/transcriptions
v1/audio/translations
v1/moderations
v1/completions (legacy)