[Gllug] A few words on the topic of stock spam

Nix nix at esperi.org.uk
Tue Jul 10 20:56:07 UTC 2007


On 10 Jul 2007, Martin A. Brooks verbalised:

> May be of interest to some, mostly won't be of interest to anyone.
>
> http://blog.hinterlands.org/2007/07/10#20070710

FWIW, FuzzyOCR with a pipeline that turns the jpegs into images and then
OCRs them as usual does a reasonable job on this (if you ignore the
hokey horrible method FuzzyOCR uses to identify spammy words: I really
must get this stuff fed through Bayes like everything else).

-- 
`... in the sense that dragons logically follow evolution so they would
 be able to wield metal.' --- Kenneth Eng's colourless green ideas sleep
 furiously
-- 
Gllug mailing list  -  Gllug at gllug.org.uk
http://lists.gllug.org.uk/mailman/listinfo/gllug




More information about the GLLUG mailing list