GPT 5.4 Pro is extremely slow but thorough, so it's not meant for the usual agentic work, rather for research or solving hard bugs/math problems when you provide it all the context.
And do you mean to say that you don't really use GPT 5.4 Pro unless it's for a hard bug? Curious which models you use for system design/architecture/planning vs execution of a plan/design.
TIA! I'm still trying to figure out an optimal system for leveraging all of the LLMs available to us as I've just been throwing 100% of my work at Claude Code in recent months but would like to branch out.
- internally same architecture of best of N
- not available in the code harness like Codex, only in the UI (gpt has API)
- GPT-5.4 pro is extremely expensive: $30.00 input vs $180.00 output
- both DT and Pro are really good at solving math problems