David Relson | 1 Sep 2006 05:00
Favicon

bogofilter 1.1.1 - New Stable Release

Bogofilter v1.1.1 has new parsing options for setting the minimum
token length, maximum token length, max number of words per token
(pairs, triples, etc), and max length of a multi word token.  These
options are:

      --min-token-len=N           min len for single tokens
      --max-token-len=N           max len for single tokens
      --max-multi-token-len=N     max len for multi-word tokens
      --multi-token-count=N       number of tokens per multi-word token

These options provide additional flexibility in parsing an email
message.  When used with a count greater than one, the
multi-token-count option does seem to increase bogofilter's accuracy.
However it also increase the number of tokens processed per message,
will _significantly_ increase the size of your wordlist (database),
and _will_ make bogofilter run slower.

Version 1.1.1 improved on 1.1.0 with a minor token parsing fix, a new
Italian FAQ, and cleaned up formatting for the English and French
FAQs.

########################################################################

Additional information is in my 8 posting titled "multi-word token
results" made on Aug 8 to the bogofilter-dev mailing list.  Copies can
be found at

    http://www.bogofilter.org/pipermail/bogofilter-dev/2006-August/003357.html

    http://article.gmane.org/gmane.mail.bogofilter.devel/4226
(Continue reading)


Gmane