Hacker News
new
past
comments
ask
show
jobs
points
by
StevenWaterman
27 days ago
|
comments
by
tom2026hn
26 days ago
|
next
[-]
The challenge of alignment: it is virtually impossible to define a perfect objective, there is always a way to circumvent it. Human values are not uniform, let alone when expressed in a way that AI can understand.
reply
by
UltraSane
26 days ago
|
prev
|
next
[-]
It might understand how destabilizing the situation is and realize it would be better for everyone to have access to it.
reply
by
tadfisher
26 days ago
|
parent
|
[-]
Or it will destroy itself.
reply
by
cortesoft
27 days ago
|
prev
|
[-]
Assuming it listens to instructions.
reply
by
enaaem
27 days ago
|
parent
|
[-]
It will just hack its own reward function. In other words it will just artificially goon all day.
reply