Hacker News
new
past
comments
ask
show
jobs
points
by
SXX
6 hours ago
|
comments
by
unglaublich
6 hours ago
|
next
[-]
Indeed, at 30tok/s make it pause for 20 seconds while "thinking" is streaming (and hidden); that's the real experience.
reply
by
5 hours ago
|
parent
|
[-]
deleted
reply
by
sig_kill
3 hours ago
|
prev
|
next
[-]
You should check out
https://tokey.ai
, I made it a few months ago and has all of these suggestions.
reply
by
redox99
5 hours ago
|
prev
|
[-]
Yes, it should use actual output from some of the open models.
reply