Hacker News
new
past
comments
ask
show
jobs
points
by
postalcoder
4 hours ago
|
comments
by
alexdobrenko
2 hours ago
|
next
[-]
can we plese make the bluey bench the gold standard for all models always
reply
by
mnicky
3 hours ago
|
prev
|
next
[-]
Can you compare it to Opus 4.6 with thinking disabled? It seems to have very impressive benchmark scores. Could also be pretty fast.
reply
by
postalcoder
3 hours ago
|
parent
|
[-]
Added a thinking-disabled Opus 4.6 timing. It took 1m 4s – coincidentally the same as 5.3-codex-low.
reply
by
Squarex
3 hours ago
|
prev
|
[-]
I wonder why they named it so similiarly to the normal codex model while it much worse, while cool of course.
reply