undefined

points

[-]

For a business with ten or more engineers/people-using-ai, it might still make sense to set this up. For an individual though, I can’t imagine you’d make it through to positive ROI before the hardware ages out.

by zozbot2348 hours ago|

parent|

[-]

It's hard to tell for sure because the local inference engines/frameworks we have today are not really that capable. We have barely started exploring the implications of SSD offload, saving KV-caches to storage for reuse, setting up distributed inference in multi-GPU setups or over the network, making use of specialty hardware such as NPUs etc. All of these can reuse fairly ordinary, run-of-the-mill hardware.

by DeathArrow7 hours ago|

parent|

prev|

[-]

Since you need at least a few of H100 class hardware, I guess you need at least few tens of coders to justify the costs.

by wuschel8 hours ago|

prev|

[-]

What near SOTA open models are you referring to?

by cyberax7 hours ago|

prev|

[-]

I'm backing up a big dataset onto tapes, so I wanted to automate it. I have an idle 64Gb VRAM setup in my basement, so I decided to experiment and tasked it with writing an LTFS implementation. LTFS is an open standard for filesystems for tapes, and there's an implementation in C that can be used as the baseline.

So far, Qwen 3.6 created a functionally equivalent Golang implementation that works against the flat file backend within the last 2 days. I'm extremely impressed.

by Gareth3215 hours ago|

parent|

[-]

It is surprisingly competent. It's not Opus 4.6 but it works well for well structured tasks.