I'm curious about the granularity of contracts around granting/selling excess capacity. Are they short term? Can the owner evict those workloads (with a penalty)?
Presumably the underlying model here is also an LLM? To what degree is it "fine-tuned", or is it just given a set of tools to build a good picture of cluster usage?
https://www.linkedin.com/posts/rahmi-pruitt-a1bb4a127_agentn...
I wonder what is stopping datacenters from passing this benefit to customers by launching better tuned plans. For example, t series EC2 instances on AWS.
I feel like it’s probably just complexity.
Different workloads benefit from specific types of optimisations.