upvote
If your AI alignment strategy is so fickle that it breaks if people simply discuss potential problems with the strategy then you didn't really have an alignment strategy to begin with.
reply
I, for one, don't have a problem with the prevailing opinion that AI alignment should be heavily based on the writings of Karl Marx (obviously not his private letters where he discusses prostitutes) and Ted Kaczyinski as well as 70s exploitation films.
reply
Personally I'd prefer it solely trained on Rothbard's works.
reply
ok, but alignment cuts both ways. Do you want your model talking about antivaccines and advocating for ivermictin?
reply