Hacker News
new
past
comments
ask
show
jobs
points
by
osti
3 hours ago
|
comments
by
sdenton4
2 hours ago
|
[-]
Doing great on public datasets and underperforming on private benchmarks is not a good look.
reply
by
Deegy
2 hours ago
|
parent
|
[-]
Is it though? Do we still have the expectation that LLMs will eventually be able to solve problems they haven't seen before? Or do we just want the most accurate auto complete at the cheapest price at this point?
reply