undefined

points

[-]

If you wouldn't mind chatting about your usage, my email is in my profile, and I'd love to share experiences with other HNers using self-hosted models.

by jeffbee6 hours ago|

prev|

[-]

Does spam filtering really need a better model? My impression is that the whole game is based on having the best and freshest user-contributed labels.

by drob5182 hours ago|

parent|

[-]

He said it’s a benchmark.

by hrmtst938374 hours ago|

parent|

prev|

[-]

Better models help on the day the spam mutates, before you have fresh labels for the new scam and before spammers can infer from a few test runs which phrasing still slips through. If you need labels for each pivot you're letting them experiment on your users.

by jeffbee3 hours ago|

parent|

[-]

In my experience the contents of the message are all but totally irrelevant to the classification, and it is the behavior of the mailing peer that gives all the relevant features.