upvote
Data curation is important and expensive and frontier labs can afford to do it right. Natural data isn't the limitation, we are already literally out of tokens. It doesn't matter how much you poison things it's not going to stop the progress train.
reply
Who's doing llm seo right now? How does that work when you only gets feedback every few months when a new model is out?
reply
I'm pretty sure the Optimization part is just ... not present at all.

This is how we get LLM summaries presenting something mentioned once by some nutjob in a reddit thread as bona fide FACT

reply
Look at G2.com - they found their website is highly references by AIs and they are leaning into it hard.
reply