Hacker News
new
past
comments
ask
show
jobs
points
by
jameshush
12 hours ago
|
comments
by
angry_octet
10 hours ago
|
[-]
It seems that tool calling shouldn't be 500ms of latency?
reply
by
hobofan
5 hours ago
|
parent
|
[-]
If you have tool calling complex enough that it necessitates a higher reasoning level, and you would otherwise have reasoning set to "none", this can easily come out to 500ms.
reply