upvote
H200 is not cheap, and I don't think you can run DeepSeek with full weight without any quantization on even two of them.

Although open weights in theory are good, especially for developers and market competition, it is not as wonderful as you thought.

reply
Darling, we'll always have W_q, W_k, W_v, and W_o.
reply
I think my main concern was productivity, but tell me more about this AI Girlfriend
reply
It's not just the weights. It is the system prompt, harness, safety filters, etc. Those can affect performance of the same underlying model significantly.
reply