What I want is more fully open models where everything is shared. Data, training algorithms, weights. That way we can figure out if we should trust it.
I think it's also unfair to say their success is solely due to stealing data. They are contributing a lot of advances to the literature about what they are doing. The proof is in the results we have 27b models you can vibe code with. Not 1t+
It's murky sure. But there are smear campaigns about how people can't trust China too. There's some truth to that too but we can't trust the US either so local models are an interesting way for China to offer us some level of sovereignty.
Their models would be completely useless if they didn't train on stolen data, so no, it's not unfair at all.
If anything it's a Robinhood story.
Regardless I doubt they would be useless at all. Alibaba has access to tons of data and they make qwen. Qwen models are insane for their size.
That's what I said.