undefined

upvote

points

by SXX6 hours ago |

upvote

by unglaublich6 hours ago|

[-]

Indeed, at 30tok/s make it pause for 20 seconds while "thinking" is streaming (and hidden); that's the real experience.

reply

upvote

by 5 hours ago|

[-]

deleted

reply

upvote

by sig_kill3 hours ago|

[-]

You should check out https://tokey.ai, I made it a few months ago and has all of these suggestions.

reply

upvote

by redox995 hours ago|

[-]

Yes, it should use actual output from some of the open models.

reply