undefined

points

[-]

I think this is basically obvious to anyone using one of these but they're just they like the utility trade off like sure it may leak and exfiltrate everything somewhere but the utility of these tools is enough where they just deal with that risk.

by mhher12 hours ago|

parent|

[-]

While I understand the premise I think this is a highly flawed way to operate these tools. I wouldn't want to have someone with my personal data (whichever part) that might give it to anyone who just asks nicely because the context window has reached a tipoff point for the models intelligence. The major issue is a prompt attack may have taken place and you will likely never find out.

by suprjami3 hours ago|

parent|

prev|

[-]

It feels to me there are plenty of people running these because "just trust the AI bro" who are one hallucination away from having their entire bank account emptied.

by ramoz2 hours ago|

prev|

[-]

Information Flow Control is highly idealistic unless there are global protocol changes across any sort of integration channel to deem trusted vs untrusted.

by dgellow13 hours ago|

prev|

[-]

could you share that study?

by mhher12 hours ago|

parent|

[-]

https://arxiv.org/abs/2512.13914

Among many more of them with similar results. This one gives a 39% drop in performance.

https://arxiv.org/abs/2506.18403

This one gives 60-80% after multiple turns.

by 12 hours ago|

parent|

prev|

[-]

deleted