Models Breaking

OpenAI Ships GPT-5.4 With Pro and Thinking Variants for Paid Users

March 5, 2026 2 min read

Image: OpenAI

What Happened

OpenAI released GPT-5.4 on March 5, 2026, positioning it as "our most capable and efficient frontier model for professional work." The release includes two variants: a Pro version and a Thinking version, each targeting different use cases.

The Pro version appears aimed at users who need fast, high-quality output for daily professional tasks. The Thinking version follows the pattern OpenAI established with its o-series reasoning models, giving the model more room to work through complex problems step by step before responding.

This follows OpenAI's pattern of rapid iteration on the GPT-5 line. Rather than giant version jumps, we're seeing incremental point releases that refine capabilities for specific workflows. The "5.4" numbering suggests several internal iterations since GPT-5's initial launch.

Why It Matters

If you're using ChatGPT daily for work, the Pro vs. Thinking split is the real story here. It means OpenAI is acknowledging what power users already know: you don't want the same model behavior for a quick email rewrite as you do for debugging a complex system architecture.

The "most efficient" framing matters too. Model efficiency directly translates to speed and cost. A model that's both more capable and more efficient means faster responses in ChatGPT and potentially lower API costs for developers building on GPT-5.4.

For teams locked into the OpenAI ecosystem, this is a straightforward upgrade. For everyone else, it raises the competitive bar. Claude, Gemini, and other frontier models now have a new benchmark to match.

Our Take

The Pro/Thinking split is smart product design, not just a model release. OpenAI is learning that shipping one model and expecting users to prompt-engineer their way to different behaviors doesn't scale. Giving people an explicit toggle between "fast and good" and "slow and thorough" is what users actually want.

That said, "most capable" is a claim that needs benchmarks to back it up. OpenAI's internal evals rarely tell the full story. What matters is whether GPT-5.4 Pro actually outperforms Claude and Gemini on the messy, real-world tasks people care about: long document analysis, code generation, nuanced writing.

The efficiency angle could be the bigger deal here. If GPT-5.4 delivers GPT-5 level quality at meaningfully lower latency, that shifts the calculus for anyone choosing between API providers. Speed compounds across thousands of daily interactions.

We'll be running our own comparisons across ChatGPT, Claude, and Gemini in the coming days. Until then, don't swap your whole workflow based on a press release.

What Happened

Why It Matters

Our Take

Related Tools

More from today

Google's February Roundup: Gemini 3.1 Pro, Deep Think, and Nano Banana 2

ByteDance's Seedance 2.0 Hit a Wall: Compute Limits and Copyright Complaints

GPT-5.4 System Card: First OpenAI Model With Cyber Mitigations Built In

Cookie Preferences