undefined

points

[-]

I’m given to understand that Anthropic uses something called Constitutional AI, where there is a central document of desirable and undesirable qualities (as well as reinforcement learning) whereas OpenAI relies more heavily on direct human feedback and rating with human trainers evaluating responses and the model conforming to those preferences.

I also much prefer the output of Claude at present.

by kccqzy14 hours ago|

parent|

[-]

Yeah and for much of the HN crowd, we aspire to have better tastes than the average. So if the supervised learning uses average human trainers it will most likely be seen as having poor taste for much of HN.

by vasco6 hours ago|

parent|

[-]

Speak for yourself my taste is average and I aspire for it to remain so.

by jimbokun9 hours ago|

prev|

[-]

I think the “taste” approach at Apple died with Steve Jobs.

by tikhonj11 hours ago|

prev|

[-]

Eh, Facebook today is farther from what anybody "wants" than macOS 26, and Facebook is about as blindly data-driven as they come.

Turns out you can get away with a lot when you have a quasi-monopoly on an addictive product, and you buy out your realistic competitors...

by stefan_4 hours ago|

prev|

[-]

There was a time when also Claude would absolutely fill code with emojis, which is why now their system prompt has

> Claude does not use emojis unless the person in the conversation asks it to

by tardedmeme2 hours ago|

parent|

[-]

I think it's funny how we are all tweaking LLM output by adding instructional tokens instead of, say, finding a vector that indicates "user asked for emojis", and forbidding emoji tokens in the sampling unless that vector passes a threshold.