undefined

points

by testing223217 hours ago |

comments

by svachalek6 hours ago|

[-]

1. There are small local models that have the capabilities of frontier models a year ago

2. They aren't harvesting your data for government files or training purposes

3. They won't be altered overnight to push advertising or a political agenda

4. They won't have their pricing raised at will

5. They won't disappear as soon as their host wants you to switch

by jkl5xx40 minutes ago|

parent|

[-]

Good points. What local models have you found work best for your use cases? I feel like if we get to opus 4.6 level intelligence running on local hardware, we’re in the clear for a lot of day to day use cases.

by testing223211 hours ago|

parent|

prev|

[-]

Thanks. I understand that.

What are you doing with it?

Why do you want it?

by samuel7 hours ago|

prev|

[-]

Chat is certainly an option, but the real deal are agents, which have access to way more sensitive information.

by testing223211 hours ago|

parent|

[-]

Thanks. What do you do with such an agent? What is the use case?

by dec0dedab0de7 hours ago|

prev|

[-]

most of the llm tooling can handle different models. Ollama makes it easy to install and run different models locally. So you can configure aider or vscode or whatever you're using to connect to chatgpt to point to your local models instead.

None of them are as good as the big hosted models, but you might be surprised at how capable they are. I like running things locally when I can, and I also like not worrying about accidentally burning through tokens.

I think the future is multiple locally run models that call out to hosted models when necessary. I can imagine every device coming with a base model and using loras to learn about the users needs. With companies and maybe even households having their own shared models that do heavier lifting. while companies like openai and anhtropic continue to host the most powerful and expensive options.

by roboror4 hours ago|

parent|

[-]

What models have you found capable? I was recently recommended Qwen3 Coder Next and I did not find it very successful. I have a good amount of VRAM/RAM so would love to run something locally.

by testing2232142 minutes ago|

parent|

prev|

[-]

Thanks.

I still don’t understand. What are you using this long you’re running locally to actually do?

What is the use case?