Source: Pajunas Interactive Blog

Pajunas Interactive Blog Per-domain bayesian databases in SQL

We store all bayesian and whitelist data for Spamassassin in a PostgreSQL.Â Keeping it all in a database like this allows all members of our email cluster to access the same data.Bayesian processing works by noticing lots of little unique snippets - tokens - and storing them for future reference.Â Â If lots of tokens were found in a message flagged as spam then future messages containing these tokens are more likely to be spam as well.Â This has the same weighing effect for non-spam messages.Â Over time, watching these tokens increases the quality of your spam detection.We have been running a SQL-backed bayesian instance for several months, with remarkable results.read more

Est. Annual Revenue

$5.0-25M

Est. Employees

25-100

CEO

Update CEO

CEO Approval Rating

- -/100