Hacker News
new
past
comments
ask
show
jobs
points
by
xienze
12 hours ago
|
comments
by
fgfarben
11 hours ago
|
next
[-]
That prefill number isn't right. M4 Max hits 200-300:
https://github.com/antirez/ds4/blob/main/speed-bench/m4_max_...
reply
by
hadlock
9 hours ago
|
parent
|
[-]
M5 studio is gonna sell like hot cakes
reply
by
throwdbaaway
5 hours ago
|
prev
|
next
[-]
Hah, that's because the prompt itself was only about 30 tokens. We need a much bigger prompt to properly test PP.
reply
by
aiscoming
12 hours ago
|
prev
|
[-]
if it's just the coding agent system prompt and tools, you can cache that
reply
by
xienze
12 hours ago
|
parent
|
[-]
Yeah the problem is that's just the start of the context. There's, you know, all the tool call results and file reads and stuff.
reply
by
7 hours ago
|
parent
|
[-]
deleted
reply