It did not reject everything, it just stopped the costly processing.
> Is unwarranted.
Is this not a complaint?
I checked his comments here, he does not make that claim. [EDIT: I mean the claim "It let processed all the non-malicious messages"]
> It did not reject everything, it just stopped the costly processing.
My reading of the article, and of the comments he made here, did not mention anything about false negatives - he never claimed to test false negatives so I am wondering why you think he did.
> Author here. It was usable like any Openclaw agent. For example, I used it to ask it questions about the VPS, to summarize emails, etc.
>> Author here. It was usable like any Openclaw agent. For example, I used it to ask it questions about the VPS, to summarize emails, etc.
That does not mean "I used it via emailing it". There is no ambiguity - he was asked specifically about this.
Once again, I reiterate, an agent processing email that rejects every single one passes the test that the OP created, but then it can't do anything useful either.
On the contrary - I think the most reasonable interpretation of his words is that he did use it via emailing it. But like I said at the beginning, I could be wrong. It will be interesting to see what he says when he returns to the conversation.
> Once again, I reiterate, an agent processing email that rejects every single one passes the test that the OP created, but then it can't do anything useful either.
No one is contesting that point, only that it is applicable.
Making the behavior for "I disagree" and "this is erroneous" the same seems like a problematic design.