Hacker News
new
past
comments
ask
show
jobs
points
by
inigyou
4 hours ago
|
comments
by
cassianoleal
3 hours ago
|
[-]
How do you qualify what makes a model "Mythos class", and how do you reliably test for it?
reply
by
generalizations
3 hours ago
|
parent
|
[-]
Presumably a deepswe benchmark, which IIRC puts GLM 5.2 between opus 4.8 and fable.
reply