upvote
A friend of mine prepared an arsenal of hooks and the like to address this and LLMs still disobey them at times.

I don't have high hopes that there exists a bulletproof solution to this.

reply
A friend of mine prepared an arsenal of hooks and the like to address this and LLMs still disobey them at times.

It's a model of language, yes? Trained on a big corpus of text.

I have read a lot of stories and accounts in which people were told not to do something and inevitably they did it. Like, lots. Far more than stories and accounts in which people were told not to do something and they then didn't do it.

If I'm reading a story or account of something, and it's really hammered home that they've been told not to do something, it's kind of inevitable that they will then do that. I'm not even an LLM and I noticed that's the way these things usually go.

So is an LLM just doing what it's been trained to do? Sometimes in the stories and accounts, there's a whole lot of time and tension before the bad thing happens, but that's just part of the fun.

reply
Personally, for me that represents job security. Having a human with a high level of domain knowledge in the loop seems pretty required to get any meaningful results.
reply
solution is to always do what it has seen in training data and how it was RL. But ai companies dont tell you that. so you have to reverse engineer its training and stick to to that.

These are no general purpose machines. They are shipping a subset mindset not general intelligence like they want us to belive .

reply
Not sure if this was ironic. I guess it mainly fails because a lot of knowledge and experience is intuitive and not codified.
reply
If compliance was the main issue we wouldn't have had to invent ways for computers to do something other than exactly what they were asked.
reply
We didn't have to do that. It is, in fact, extremely stupid that we have done that. Computers are valuable because they are fast and deterministic. Fast but stochastic has no value.
reply
What about slow and stochastic? 9 out of 10 managers seems to agree about this, at least it seems to be that, judging through their actions.
reply
> Fast but stochastic has no value.

He valuations of a bunch of AI unicorns disagree.

reply
Its all about a longshot gamble to replace workers and wages.
reply