undefined

points

by embedding-shape1 hours ago |

comments

by gchamonlive1 hours ago|

[-]

will do that in an edit to the post

by embedding-shape1 hours ago|

parent|

[-]

Sure, waiting :)

In the meantime, Ollama seems to default to "Q4_K_M" which is barely usable for anything, and really won't be useful for agentic coding, the quantization level is just too low. Not sure why Ollama defaults to basically unusable quantizations, but that train left a long time ago, they're more interesting in people thinking they can run stuff, rather than flagging things up front, and been since day 1.

by 2ndorderthought42 minutes ago|

parent|

[-]

Ollama is definitely not the way to go once you have an interest beyond "how quickly can I run a new LLM" rather then "how do I use a local llm to do things in a remotely optimal way"

by gchamonlive44 minutes ago|

parent|

prev|

[-]

I'm currently giving club3090 a try, it seems to have lots of pre-configured setups depending on the workflow. I'm trying vllm first, then with llama.cpp.