undefined

points

by franga20004 hours ago|

[-]

Cool, then they can train their proprietary models on their proprietary data only.

Even if the other models were trained on the same data, which is unlikely, since they had less time and money to scrape it and fewer lawyers to be able to do something like pirate, the proprietary models are still largely built on the public data and wouldn't exist without it. At the very least, they should release the intermediate model, before training on their proprietary data. Not that that's how that works...

by thom4 hours ago|

prev|

[-]

I agree that saying that they have now trained on lots of proprietary data allows them to muddy the legislative waters further than they already have. What a happy coincidence!

by noitemtoshow3 hours ago|

parent|

[-]

I’d suggest you to learn more about how LLM training work. Training on internet data alone will not result in an agent answering your questions.

by thom2 hours ago|

parent|

[-]

Sure as shit won't answer them without that though.

by mbesto3 hours ago|

prev|

[-]

> The proprietary models are better because of proprietary data

Source? Otherwise this is pure speculation.