Slight increase in model cost, but looks like benefits across the board to match...

jtbayly · 2025-12-11T18:44:52 1765478692

40% increase is not "slight."

credit_guy · 2025-12-11T19:25:44 1765481144

Not the OP, but I think "slight" here is in relation to Anthropic and Google. Claude Opus 4.5 comes at $25/MT (million tokens), Sonnet 4.5 at $22.5/MT, and Gemini 3 at $18/MT. GPT 5.2 at $14/MT is still the cheapest.

deaux · 2025-12-12T01:57:16 1765504636

Your numbers are very off.

  $25 - Opus 4.5
  $15 - Sonnet 4.5
  $14 - GPT 5.2
  $12 - Gemini 3 Pro

Even if you're including input, your numbers are still off.

credit_guy · 2025-12-12T20:06:16 1765569976

I used the pricing for long context (>200k) in all cases. I personally use AI as coding assistants, like lots of other people, and as such, hitting and exceeding 200k is quite the norm. The numbers you are showing are for <200k context length.

deaux · 2025-12-13T10:24:19 1765621459

I also use them as coding assistants among other things, like lots of other people, and hitting and exceeding 200k is absolutely not the norm unless you're using a large number of huge MCP servers. At those context sizes output quality significantly declines, even with the claims of "we support long context". This is why all those coding assistants use auto-compression, not just to save money, but largely to maintain quality. In any case, >200k input calls are a small fraction of all.

Ironically at that input size, input costs dominate rather than output, so if that's the use case you're going for you want to be including those in your named prices anyway.

commandar · 2025-12-11T19:01:45 1765479705

In particular, the API pricing for GPT-5.2 Pro has me wondering what on earth the possible market for that model is beyond getting to claim a couple of percent higher benchmark performance in press releases.

>Input:

>$21.00 / 1M tokens

>Output:

>$168.00 / 1M tokens

That's the most "don't use this" pricing I've seen on a model.

https://openai.com/api/pricing/

aimanbenbaha · 2025-12-11T19:22:09 1765480929

Last year o3 high did 88% on ARC-AGI 1 at more than $4,000/task. This model at its X high configuration scores 90.5% at just $11,64 per task.

General intelligence has ridiculously gotten less expensive. I don't know if it's because of compute and energy abundance,or attention mechanisms improving in efficiency or both but we have to acknowledge the bigger picture and relative prices.

commandar · 2025-12-11T19:29:05 1765481345

Sure, but the reason I'm confused by the pricing is that the pricing doesn't exist in a vacuum.

Pro barely performs better than Thinking in OpenAI's published numbers, but comes at ~10x the price with an explicit disclaimer that it's slow on the order of minutes.

If the published performance numbers are accurate, it seems like it'd be incredibly difficult to justify the premium.

At least on the surface level, it looks like it exists mostly to juice benchmark claims.

rvnx · 2025-12-11T23:17:07 1765495027

It could be using the same early trick of Grok (at least in the earlier versions) that they boot 10 agents who work on the problem in parallel and then get a consensus on the answer. This would explain the price and the latency.

Essentially a newbie trick that works really well but not efficient, but still looking like it's amazing breakthrough.

(if someone knows the actual implementation I'm curious)

anticensor · 2025-12-14T19:45:21 1765741521

The magic number appears to be 12 in case of GPT 5.2 pro.

asgraham · 2025-12-11T19:13:48 1765480428

Those prices seem geared toward people who are completely price insensitive, who just want "the best" at any cost. If the margins on that premium model are as high as they should be, it's a smart business move to give them what they want.

arthurcolle · 2025-12-11T19:09:16 1765480156

gpt-4-32k pricing was originally $60.00 / $120.00.

wahnfrieden · 2025-12-11T19:17:26 1765480646

Pro solves many problems for me on first try that the other 5.1 models are unable to after many iterations. I don't pay API pricing but if I could afford it I would in some cases for the much higher context window it affords when a problem calls for it. I'd rather spend some tens of dollars to solve a problem than grind at it for hours.

reactordev · 2025-12-11T19:08:44 1765480124

Less an issue if your company is paying

rvnx · 2025-12-11T23:19:58 1765495198

Even less an issue when OpenAI provides you free credits

Leynos · 2025-12-11T19:13:18 1765480398

Someone on Reddit reported that they were charged $17 for one prompt on 5-pro. Which suggests around 125000 reasoning tokens.

Makes me feel guilty for spamming pro with any random question I have multiple times a day.

llmslave · 2025-12-11T18:29:00 1765477740

They probably just beefed up compute run time on the what is the same underlying model

anvuong · 2025-12-11T19:06:05 1765479965

In what world is that a slight increase?