upvote
Stop thinking billion dollar publicly traded companies are "cool" just because they make widget you like.

You will be backstabbed

You will be squeezed for all they can.

And you will be betrayed.

> Phase N: People can't afford computing anymore. Everything is thin clients and rented. It's become like the private railroad industry. End of the PC era. Like kids growing up on smartphones, there's nothing to tinker with anymore. And certainly no gradient for entrepreneurship for once-skilled labor capital.

Thankfully none of them actually makes money and just runs on investment so there is a good chance bubble will drop and the price of PC equipment will... continue to rise as US gives up Taiwan to China

reply
What leads you to say China AI is giving up on open weights?

I've been using GLM for over 6 months and pretty happy.

reply
Why would any company release open weights once the investment money stops ?

Releasing open weights have been basically a PR move, the moment those companies need to actually make money they will cut it out as that reduces their client base.

They DO NOT want you to run AI. They want you to pay them to do it

reply
Minimax just released a new model yesterday. You're conflating one company with a countries entire industry. There's more than just Qwen coming out of China.
reply
ok. maybe. I don't know. I'm asking how you know.

z.ai did go public on the HK exchange. They are under pressures similar to other public companies.

I know that China models are increasingly being trained and run using Huawei chips instead of Nvidia. I know China has a surplus of electricity from renewables (wind, solar, hydro).

reply
open weights is a way to nerf your opponent and is meaningless to your business if you need to retrain a model because your trailing

So, it makes a lot of sense to get people a "demo" and claim the paid product is better.

i think a lot of people have no idea how capable local models are atm.

reply
People keep repeating this without any real thought behind it because of the high profile resignations on the Qwen team. Meanwhile the Minimax team just released a new open weights version of their 229B model yesterday. So much for that narrative.

The AI landscape in China is larger than just Qwen and Alibaba.

reply
What I want to know is how did they make the only LLM that doesn't sound cringe?

I think it has something to do with mode collapse (although Claude certainly has its own "tells"), but I'm not sure.

It sounds trivial but even for Agentic, I found the writing style to be really important. When you give Claude a persona, it sounds like the thing. When you give GPT a persona, it sounds like GPT half-assedly pretending to be the thing.

---

Some other interesting points about Anthropic's models. I don't know if any of these relate to my LLM style question, but seems worth mentioning:

Claude models also use way less tokens for the same task (on ArtificialAnalysis, they are a clear outlier on this metric).

And there's a much stronger common sense, subjectively. (Not sure if we have a good way to actually measure that, though.) It takes context and common sense into account, to a much greater degree.

(Which ties in with their constitution. Understanding why things are wrong at a deeper level, rather than just surface level pattern matching.)

Opus is great but it should be bigger. You notice the difference between Sonnet and Opus, but with heavy use you notice Opus's limitations, too.

reply
Good read on the situation.

It all boils down to a brilliant but extremely expensive technology. Both to build and to run.

We've been sold a product with heavy subsidy. The idea (from Sam) scale out and see what happens.

Those who care to read between the lines can see what's happening. A perfect storm of demand that attract VCs who can't understand they are the real customers. Once they understand that it will be too late.

Regarding open weight models: eventually we will, as humanity, benefit from the astronomical capital poured into developing a technology ahead of its time. In a few years this and even more will run on edge.

Written by open source developers, likely former openai and anthropic employees who got so much cash in the bank they don't need to worry about renting their knowledge.

reply
> We need open weights companies now more than ever.

If you're objective it to democratize AI, sure. But for those fed up with it and the devastating effects it's having on students, for example, can opt to actively avoid paying for products with AI (I say this as someone who uses it every day, guilty). At some point large companies will see that they're bleeding money for something that most people don't seem to want, and cancel those $100k/mo deals. I've already experienced one AI-developer-turned company crash and burn.

Personally, I don't think this LLM-based AI generation will have any significant positive impacts. Time, energy (CO2) and money would have been far better spent elsewhere.

reply
> End of the PC era, there's nothing to tinker with anymore. And certainly no gradient for entrepreneurship for once-skilled labor capital.

This one seems too far fetched. Training models is widespread. There will always be open weight models in some form, and if we assume there will be some advancements in architecture, I bet you could also run them on much leaner devices. Even today you can run models on Raspberry Pis. I don't see a reason this will stop being a thing, there will be plenty of ways to tinker.

However, keep in mind the masses don't care about tinkering and never have. People want a ChatGPT experience, not a pytorch experience. In essence this is true for all tech products, not just AI.

reply
New theory of HN: every post on LLMs would involve at least a few comments hinting on class warfare and Marxism
reply
New theory: every post to HN will be about LLM or other AI. Or written by one. Usually both
reply
New theory of HN: every post on LLMs will attract the "what is wrong with AI? I don't get it [even though I've posted to HN every day for weeks/months on LLM/AI topics]. Please enlighten me" types
reply