undefined

points

[-]

Interesting how Vulkan and ROCM are roughly the same age (~9 years), but one is incredibly more stable (and sometimes even more performant) for AI use cases as side-gig, while the other one is having AI as its primary raison d'être. Tells you a lot about the development teams behind them.

by Hasslequest2 days ago|

parent|

[-]

[dead]

by lrvick2 days ago|

prev|

[-]

Keep an eye out for a stable rocm PR to stagex in the next week or so if all goes well.

by icedchai2 days ago|

prev|

[-]

I've built llama.cpp against both Vulkan and ROCm on a Strix Halo dev box. I agree Vulkan is good enough, at least for my hobbyist purposes. ROCm has improved but I would say not worth the administrative overhead.

by seemaze2 days ago|

prev|

[-]

I realize it does not address the OP security concerns, but I'm having success running rocm containers[0] on alpine linux specifically for llama.cpp. I also got vLLM to run in a rocm container, but I didn't have time to to diagnose perf problems, and llama.cpp is working well for my needs.

[0] https://github.com/kyuz0/amd-strix-halo-toolboxes

by WhyNotHugo1 days ago|

parent|

[-]

FWIW, Alpine now has native packages for llama.cpp (using Vulkan).

by seemaze12 hours ago|

parent|

[-]

nice! will check it out

edit: and thanks for the packaging work!