undefined

points

by tom2026hn26 days ago|

[-]

The challenge of alignment: it is virtually impossible to define a perfect objective, there is always a way to circumvent it. Human values are not uniform, let alone when expressed in a way that AI can understand.

by UltraSane26 days ago|

prev|

[-]

It might understand how destabilizing the situation is and realize it would be better for everyone to have access to it.

by tadfisher26 days ago|

parent|

[-]

Or it will destroy itself.

by cortesoft27 days ago|

prev|

[-]

Assuming it listens to instructions.

by enaaem27 days ago|

parent|

[-]

It will just hack its own reward function. In other words it will just artificially goon all day.