upvote
I’m given to understand that Anthropic uses something called Constitutional AI, where there is a central document of desirable and undesirable qualities (as well as reinforcement learning) whereas OpenAI relies more heavily on direct human feedback and rating with human trainers evaluating responses and the model conforming to those preferences.

I also much prefer the output of Claude at present.

reply
Yeah and for much of the HN crowd, we aspire to have better tastes than the average. So if the supervised learning uses average human trainers it will most likely be seen as having poor taste for much of HN.
reply
Speak for yourself my taste is average and I aspire for it to remain so.
reply
I think the “taste” approach at Apple died with Steve Jobs.
reply
Eh, Facebook today is farther from what anybody "wants" than macOS 26, and Facebook is about as blindly data-driven as they come.

Turns out you can get away with a lot when you have a quasi-monopoly on an addictive product, and you buy out your realistic competitors...

reply
There was a time when also Claude would absolutely fill code with emojis, which is why now their system prompt has

> Claude does not use emojis unless the person in the conversation asks it to

reply
I think it's funny how we are all tweaking LLM output by adding instructional tokens instead of, say, finding a vector that indicates "user asked for emojis", and forbidding emoji tokens in the sampling unless that vector passes a threshold.
reply