undefined

points

by CSMastermind13 hours ago |

comments

by dools12 hours ago|

[-]

Yeah I’ve been consistently underwhelmed by anthropic models, but then I don’t use their harness so maybe that’s it

by wwind12311 hours ago|

prev|

[-]

In my experience, for more mechanical refactoring work (like splitting a big source code file into multiple smaller ones), GPT 5.5 runs way faster than any of the Claude models. But for other tasks that require deeper reasoning, it's not that clear who is the winner.

by iLoveOncall11 hours ago|

prev|

[-]

It's just too funny to see people arguing about "no, it's my religion that's the right one!" on HackerNews.

You guys are all a lost cause.

by goosejuice8 hours ago|

parent|

[-]

How is attempting to benchmark llms like religion?

by iLoveOncall8 hours ago|

parent|

[-]

Re-read the comment I'm replying to, it's not talking about benchmarks, just models.