l i n u x - u s e r s - g r o u p - o f - d a v i s
Next Meeting:
July 7: Social gathering
Next Installfest:
Latest News:
Jun. 14: June LUGOD meeting cancelled
Page last updated:
2003 Oct 14 19:36

The following is an archive of a post made to our 'vox-tech mailing list' by one of its subscribers.

Report this post as spam:

(Enter your email address)
Re: [vox-tech] bogofilter question
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [vox-tech] bogofilter question

På tisdag, 14 oktober 2003, skrev p@dirac.org:
> the bogofilter docs recommend that i should do this at about 10,000
> emails.  a bogofilter website (one of the developers) said this number
> should be more like 20,000.

That is rather extreme. I found that matching because really good after 1000

> that's absurd.  i've only seen a false positive once or twice, when
> i first started to use bogofilter.  false negatives are rare.  maybe one
> or two a week.
> are there any more experienced bogofilter people out there who thought
> about this issue?  if so, what was your conclusion?  i can't see doing
> this for much past 1000 emails in each ham/spam bin.

I think 1000 mails is plenty, unless you are obsessed with never getting a
false positive. LWN.net did a study on this and came to a similar conclusion
(sorry, i don't have the url handy.)

> lastly, the docs recommend not to share databases with other people
> because the whole point is to tailor bogofilter for the type of spam and
> ham that arrives in YOUR inbox.  not other people's inboxes.  otherwise,
> you might as well use a lexical analyzer like spamcop.  are there any
> experienced bogofilter users here that have thought about this issue?  i
> suspect the docs may overstate this claim.  we all get offered XXX
> videos, penis enlargements and international bank transfers.  but then
> again, i'm still vaguely a bogofilter newbie, so i'd like some guidance
> if anybody has actually thought about this issue.

I used to discount this, but I am starting to agree that sharing is bad since
I am now getting degraded matching quality (spams getting through when they
should not) with a shared database.

Henry House
The unintelligible text that may follow is a digital signature. 
See <http://hajhouse.org/pgp> for information.  My OpenPGP key:

Attachment: pgp00010.pgp
Description: PGP signature

LUGOD Group on LinkedIn
Sign up for LUGOD event announcements
Your email address:
LUGOD Group on Facebook
'Like' LUGOD on Facebook:

Hosting provided by:
Sunset Systems
Sunset Systems offers preconfigured Linux systems, remote system administration and custom software development.

LUGOD: Linux Users' Group of Davis
PO Box 2082, Davis, CA 95617
Contact Us

LUGOD is a 501(c)7 non-profit organization
based in Davis, California
and serving the Sacramento area.
"Linux" is a trademark of Linus Torvalds.

Sponsored in part by:
O'Reilly and Associates
For numerous book donations.