undefined

points

[-]

This isn’t accurate - most of the style comes from the fine tuning and reinforcement learning, not from the original training data.

At some point people got this idea that LLMs just repeat or imitate their training data, and that’s completely false for today’s models.

by incomingpain4 days ago|

parent|

[-]

>This isn’t accurate - most of the style comes from the fine tuning and reinforcement learning, not from the original training data.

Fine tuning, reinforcement, etc are all 'training' in my books. Perhaps this is your confusion over 'people got this idea'

by gooodvibes4 days ago|

parent|

[-]

> Fine tuning, reinforcement, etc are all 'training' in my books.

They are but they have nothing to do with how frequent anything is in literature which was your main point.

by idonotknowwhy5 days ago|

parent|

prev|

[-]

Agreed. The pre-2025 base models don't write like this.

by bediger40005 days ago|

parent|

prev|

[-]

So LLMs have gotten creativity recently?

by gooodvibes5 days ago|

parent|

[-]

No, my point has nothing to do with creativity. It's about the fact that their output is taylored to look and sound in a certain way in the later stages of model training, it's not representative of the original text data the base model was trained on.

by whycome5 days ago|

prev|

[-]

I used to use the em dash a lot. I refrain from doing it now. I hate that outcome.

by JohnFen5 days ago|

parent|

[-]

I refuse to let genAI determine what my writing style should be on principle. I may not be able to do much about the rest of the various degradations genAI brings, but I can at least stand my ground when it comes to my personal expression.

by incomingpain5 days ago|

parent|

prev|

[-]

>I used to use the em dash a lot. I refrain from doing it now. I hate that outcome.

I dont think it ever got taught in my schooling; the semi-colon is what they taught to use.