upvote
That rule can be followed, but it gets a little tricky when mixed up with the other ten thousand rules that it's following at any given time.
reply
"The model refuses to follow my specific word detail prompts" and "The model refuses to perform hacking attempts" are on the same side of the model refusing to do something baked into it though.
reply