But I avoid unnecessary emotion in my prompts because I don't want potentially distracting activations. Kind of like communicating with humans.
> impolite prompts consistently outperformed polite ones, with accuracy ranging from 80.8% for Very Polite prompts to 84.8% for Very Rude prompts.
Unless the mechanism is understood, my assumption is that this is a moving target.
How so? Plenty of swearing in lots of training data, especially older code, e.g. in Linux.
https://www.anthropic.com/research/emotion-concepts-function