o3
o3Deep-thinking reasoning model, extended chain-of-thought for hard problems.
- Context
- 200K tokens
- Released
- 2025-04
- Input
- $10 / 1M tokens
- Output
- $40 / 1M tokens
- Cached
- $2.5 / 1M tokens
The index
Every mind, with their strengths. Update yourself before you choose your team.
The original. Strong all-rounders plus deep-thinking reasoning models.
o3Deep-thinking reasoning model, extended chain-of-thought for hard problems.
gpt-5.4OpenAI's flagship, best for complex reasoning and code.
gpt-4oMultimodal generalist, strong all-rounder.
gpt-5.4-miniCheaper, faster GPT-5.4 for everyday work.
o3-miniFast reasoning, great for math, code, and logic.
gpt-4o-miniCheapest OpenAI option, great for high volume.
gpt-image-1OpenAI's flagship image model, strong prompt adherence, three quality tiers and aspect ratios.
dall-e-3Legacy OpenAI image model, kept for orgs without GPT Image 1 access yet.
sora-2OpenAI's text-to-video, short cinematic clips up to ~20s with synced audio.
o1Previous-generation reasoning flagship, still capable, broadly available.
gpt-4-turboLegacy GPT-4 with 128K context, kept for compatibility.
Best-in-class at code, careful writing, and nuanced analysis.
claude-opus-4.8Anthropic's most capable model, complex reasoning and agentic coding.
claude-opus-4.7Anthropic's deepest model, nuanced writing and analysis.
claude-sonnet-4.6Balanced flagship, especially good at code.
claude-haiku-3.5Previous-generation Haiku, still capable, lower cost.
claude-haiku-4.5Cheapest and fastest Claude, solid default.
claude-3-5-sonnetLegacy Sonnet, strong general-purpose model.
Huge context windows, native vision, and strong multilingual support.
gemini-3.1-proGoogle's flagship, huge context window and strong vision.
gemini-2.5-proPrevious-gen Gemini Pro, still very capable.
gemini-3-flashFast and cheap Gemini for high-volume tasks.
gemini-2.5-flashPrevious-gen Flash, cheap and dependable.
gemini-3-pro-imageGemini 3 Pro Image, top-tier fidelity, photoreal lighting, slower and pricier than Flash.
gemini-3.1-flash-imageGemini 3.1 Flash Image, next-gen Flash, sharper than 2.5, cheaper than Pro.
gemini-2.5-flash-imageGemini 2.5 Flash Image, fast, cheap and surprisingly sharp. Flat per-image price.
gemini-1.5-proOlder Pro with 2M context, kept for awareness.
veo-3Google's video model, 4K output, synchronised audio, strong physics.
veo-2Previous-gen Veo, still capable for short B-roll generation.
gemini-1.5-flashOlder Flash, cheaper but no longer recommended.
Open-weight reasoning at a fraction of frontier-lab prices.
deepseek-r1Open-weight reasoning model, competitive with o-series at a much lower price.
deepseek-v3.2Strong at code and reasoning at a fraction of the cost.
Open-weight models running on your own hardware. Zero cost per call.
llama3.1:8bRuns locally via Ollama, free, private, offline-capable.
qwen2.5:7bOpen-weight Qwen, strong at code and non-English.
mistral:7bOpen-weight Mistral, efficient generalist.
phi3:14bMicrosoft's open-weight Phi-3, strong at reasoning despite small size.
mixtral:8x7bMoE model from Mistral, fast inference with broad coverage.
deepseek-coder:6.7bCode-specialised open-weight model, runs on modest hardware.
gemma2:9bGoogle's open-weight family, solid writing and reasoning.
xAI's lab, real-time web access and a candid tone.
grok-4xAI's flagship reasoning model, real-time web access and strong on long-context tasks.
grok-2Previous-gen Grok, capable generalist with a wry tone.
European frontier lab. Strong code models and open weights.
mistral-large-2Mistral's hosted flagship, strong code and multilingual.
mistral-medium-3Mid-tier hosted Mistral, good cost-to-quality ratio.
codestralCode-specialised Mistral, autocomplete + chat for 80+ languages.
Enterprise-focused, RAG-tuned models with strong citations and tool use.
command-r-plusCohere's RAG-tuned flagship, strong on retrieval, citations, and tool use.
command-rCheaper Command, same RAG strengths, smaller envelope.
The Llama family. Open weights, frontier-tier, run anywhere.
llama-3.3-70bMeta's open-weight flagship, runs locally on beefy hardware via Ollama, or cloud via Together / Replicate.
llama-4-maverickNext-gen Llama from Meta, flagship reasoning, multimodal, open-weight.
Web-grounded models, answers with live citations.
sonar-proPerplexity's web-grounded model, answers backed by live search. Different shape than a plain chat completion.
Distinctive aesthetic image model, favourite among designers and art directors.
midjourney-v7MidJourney's flagship, distinctive aesthetic, no public API yet (Discord / web-only).
midjourney-v6.1Previous-gen MidJourney, still the photographer's favourite.
FLUX models from the team behind Stable Diffusion, sharp prompt following, fast.
flux-1.1-proBlack Forest Labs' flagship, sharp prompt following, fast generation, public API.
flux-1-schnellOpen-weight FLUX, runs locally on consumer GPUs, free to use.
Stable Diffusion family, open weights, vast community tooling.
stable-diffusion-3.5Open-weight SD3.5, runs locally, hugely customisable via LoRAs and ControlNet.
sdxlClassic Stable Diffusion XL, broad community + tooling support.
Hosted video generation, character consistency and scene control for filmmakers.
runway-gen-4Runway's next-gen video, character consistency, scene control, image-to-video.
runway-gen-3-alphaRunway's previous flagship, fast, hosted, popular with filmmakers.
Friendly, fast video generation, image-to-video and scene-extend.
pika-2.0Pika's 2.0, image-to-video and scene-extend, friendly UX, fast turnaround.
Kuaishou's video lab, long-form clips and strong human motion.
kling-2.0Kuaishou's Kling, long-form video (up to 2 min), strong human motion and Asian-language prompts.
Prices are USD, sourced directly from each provider's rate card, per million tokens for chat models, per image for image models, per second for video models. On BYOK you pay each provider directly with no markup from us; on Pro and Power we cover the provider cost from your Monthly AI Credits. "Coming soon" entries are on our roadmap; "Listed" entries are known to exist but not currently planned for direct integration.