Hacker News
new
past
comments
ask
show
jobs
points
by
seaal
21 hours ago
|
comments
by
winwang
21 hours ago
|
next
[-]
There's the other (orthogonal) possible explanation of using more GPUs for stress-testing before product launch.
reply
by
zamadatix
6 hours ago
|
parent
|
[-]
That's less an orthogonal explanation and more an example of why they'd do something like serve a quantized model.
reply
by
MagicMoonlight
21 hours ago
|
prev
|
[-]
Nope, they deliberately enshittify the old model right before release to fake the metrics.
reply
by
recursive
18 hours ago
|
parent
|
[-]
Good ol' sawtooth step change.
reply