undefined

That claim keeps contradicted hard by other parties, who say Mythos beats 5.5 resoundingly on both autonomous search and discovery and creation of complex exploit chains.

There might be a harness difference, but also, this CTF-type benchmark might not capture the capability difference fully.

by nimchimpsky5 hours ago|

parent|

[-]

[dead]

by abirch9 hours ago|

prev|

[-]

Anthropic can sell Mythos to Fortune 500 companies and bypass the average user. I'm not sure how much is hype but I see things like this https://blog.cloudflare.com/cyber-frontier-models/

by Sevii10 hours ago|

prev|

[-]

It's doubtful they have the compute to make mythos publicly available even after the SpaceX datacenter deal. And why sell it publicly if people are still willing to pay for Opus 4.7?

by outside123410 hours ago|

prev|

[-]

I suspect that Mythos doesn't have a business model that works