Hacker News
new
past
comments
ask
show
jobs
points
by
vidarh
12 hours ago
|
comments
by
irthomasthomas
11 hours ago
|
[-]
Here is an example where the prompt was only a few hundred tokens and the output reasoning chain was correct, but the actual function call was wrong
https://x.com/xundecidability/status/2005647216741105962?s=2...
reply