[Gllug] How do I improve spamassassin's usefulness?
Nix
nix at esperi.org.uk
Sat Aug 15 18:39:08 UTC 2009
On 11 Aug 2009, Richard Jones spake thusly:
> I'm using spamassassin on my work email, classifying email mostly by
> hand, and running sa-learn. Unfortunately SA is still very bad at
> sorting my email. After about 6 months of doing this, I still get
> lots of (very obvious) spam arriving in my inbox. I'm left thinking
> how long will SA take to get the idea that "EUROMILLIONS LOTTERY" is
> not a subject I'm interested in.
Are you using Justin Mason's sought rules?
<http://wiki.apache.org/spamassassin/SoughtRules>
> My email comes in from fetchmail and is filtered using a .procmailrc
> using the recommended recipe at the top:
>
> # Send mail through spamassassin
> :0fw: spamassassin.lock
> * < 512000
> | spamassassin
(Not spamc? spamassassin(1) has very high invocation overhead: it takes
over a second of maxed-out-core on my Nehalem...)
> :0:
> * ^X-Spam-Level: \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
> mail/spam
I've always found this annoyingly hard to read: ${MATCH} is much
clearer. Personally I do this:
,----
| # SpamAssassin, assassinate!
| :0 fw
| | spamc -d spamfilter.srvr.nix
|
| # Is this certain to be spam?
| :0 H:
| * ^X-Spam-Status: yes, +score=\/[^. ]*
| * ? (( ${MATCH} > 9 ))
| spambox
|
| # Is this merely likely to be spam?
| :0 H:
| * ^X-Spam-Flag: YES$
| blockbox
`----
> And I do get spam appearing in mail/inspect and mail/spam, so I am
> sure that SA does run. But I also get obvious spam with ridiculously
> low scores in my inbox.
>
> What am I doing wrong?
I'm not sure. Stick a sample of spam that got through (with headers)
somewhere and I'll see if it gets equally low scores here (with 3.2.x
and 3.3-to-be). Maybe it's a config error: there are some important
configuration items (like trusted_networks) that are often set wrongly
and badly damage accuracy until they're fixed.
Also, note that SA intentionally biases strongly towards allowing a
little spam through rather than mistakenly blocking legitimate email. I
get a spam or two a day through (and 2000-or-so blocked by SA). Even the
'likely' box only gets an FP once every year or so.
--
Gllug mailing list - Gllug at gllug.org.uk
http://lists.gllug.org.uk/mailman/listinfo/gllug
More information about the GLLUG
mailing list