Hacker News
new
past
comments
ask
show
jobs
points
by
minimaxir
17 hours ago
|
comments
by
landl0rd
14 hours ago
|
[-]
There’s been some, but naive activation steering makes models dumber pretty reliably and training an SAE is a pretty heavy lift.
reply