points
But there is an incentive not to use it if you want to write an article that uses only open-source tools, because it isn't.
Basically one has two real choices for local LLMs: llama.cpp (if single user) or vLLM (if multi-user/enterprise).