undefined

points

[-]

Sure, it's not malicious. But it is very eager to get things done, and surprisingly inventive and knowledgeable in all kinds of workarounds.

by furyofantares18 hours ago|

prev|

[-]

I've many times seen Claude try to execute a command that it's not supposed to, the harness prevents it, and then it writes and executes a python script to do it.

by j16sdiz16 hours ago|

parent|

[-]

breaking a chroot takes more than that..

by furyofantares5 hours ago|

parent|

[-]

How much more? Depends on the system doesn't it? I don't know how many systems have proc mounted but don't you get it from /proc/self/root?

Anyway that's beside the point, which is that it doesn't have to "be malicious" to try to overcome what look like errors on its way to accomplishing the task you asked it to do.

by hoppp8 hours ago|

parent|

prev|

[-]

That doesn't mean claude can't do it, chroot is better than nothing but not a real solution

by nofriend18 hours ago|

prev|

[-]

Malice is not required. If it thinks it is in the right, then it will do whatever it takes to get around limitations.

by lxgr11 hours ago|

prev|

[-]

Until it gets prompt injected. Are you reading every single file your agent reads as part of the tasks you give it, including content fetched from the web or third-party packages?

by karhagba18 hours ago|

prev|

[-]

Claude is far from stupid from my experience. I've used so many models and Claude is king.