undefined

points

[-]

I believe you are looking at GPT 5.4 Pro. It's confusing in the context of subscription plan names, Gemini naming and such. But they've had the Pro version of the GPT 5 models (and I believe o3 and o1 too) for a while.

It's the one you have access to with the top ~$200 subscription and it's available through the API for a MUCH higher price ($2.5/$15 vs $30/$180 for 5.4 per 1M tokens), but the performance improvement is marginal.

Not sure what it is exactly, I assume it's probably the non-quantized version of the model or something like that.

by nsingh26 hours ago|

parent|

[-]

From what I've read online it's not necessarily a unquantized version, it seems to go through longer reasoning traces and runs multiple reasoning traces at once. Probably overkill for most tasks.

by ZeroCool2u6 hours ago|

parent|

prev|

[-]

Yup, that was it. Didn't realize they're different models. I suppose naming has never been OpenAI's strong suit.

by logicchains5 hours ago|

parent|

prev|

[-]

>It's the one you have access to with the top ~$200 subscription and it's available through the API for a MUCH higher price ($2.5/$15 vs $30/$180 for 5.4 per 1M tokens), but the performance improvement is marginal.

The performance improvement isn't marginal if you're doing something particularly novel/difficult.

by highfrequency6 hours ago|

prev|

[-]

Can you be more specific about which math results you are talking about? Looks like significant improvement on FrontierMath esp for the Pro model (most inference time compute).

by ZeroCool2u6 hours ago|

parent|

[-]

Frontier Math, GPQA Diamond, and Browsecomp are the benchmarks I noticed this on.

by csnweb6 hours ago|

parent|

[-]

Are you may be comparing the pro model to the non pro model with thinking? Granted it’s a bit confusing but the pro model is 10 times more expensive and probably much larger as well.

by ZeroCool2u6 hours ago|

parent|

[-]

Ah yes, okay that makes more sense!

by andoando6 hours ago|

prev|

[-]

The thinking models are additionally trained with reinforcement learning to produce chain of thought reasoning

by aplomb10266 hours ago|

prev|

[-]

[dead]