MODELS

GPT-4o

Name: GPT-4o
Brand: openai
Price: 2.5 USD

OpenAI's multimodal workhorse handling text, image, and audio in one model.

openaigpt

Go to official site →API docs →

Specs

Context window: 128,000
Max output: 16,384
Modalities: text, image, audio
Tool use: ✓
Vision: ✓
Streaming: ✓
License: proprietary
Released: 2024-05-13

Pricing

Input / 1M: $2.50
Output / 1M: $10.00
Cached input / 1M: $1.25

Cost estimate

Input tokens (per month)Output tokens (per month)

Estimated monthly cost$4.50

GPT-4o is OpenAI's flagship general-purpose model from May 2024, accepting text, image, and audio inputs in a single network. It offers a 128K context, function calling, vision, and streaming, making it a default choice for chat assistants, agent loops, and multimodal apps. At $2.5/M input and $10/M output, it sits in the mid-tier — cheaper than Claude Opus, pricier than GPT-4o-mini or Gemini Flash.

Editor's verdict

A safe default when you need broad capability and don't want to think hard about model choice — especially for vision or audio. But by 2025 it's been overtaken: GPT-5 and Claude Sonnet 4 outscore it on coding and reasoning, while GPT-4o-mini covers most cheap tasks at a fraction of the cost. Pick GPT-4o when you specifically need its multimodal mix in one call; otherwise newer models are usually the better buy.

Reviews

No reviews yet. Be the first.

Last updated: 2026-04-29