<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Erning.write() &#187; spambayes</title>
	<atom:link href="http://erning.net/tags/spambayes/feed" rel="self" type="application/rss+xml" />
	<link>http://erning.net</link>
	<description>Reloading</description>
	<lastBuildDate>Thu, 15 Sep 2011 10:04:33 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Filter spam and virus</title>
		<link>http://erning.net/archives/filter-spam-and-virus</link>
		<comments>http://erning.net/archives/filter-spam-and-virus#comments</comments>
		<pubDate>Mon, 27 Oct 2003 21:46:49 +0000</pubDate>
		<dc:creator>erning</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[anti-spam]]></category>
		<category><![CDATA[spambayes]]></category>

		<guid isPermaLink="false">http://www.erning.net/archives/2003/10/28/filter-spam-and-virus/</guid>
		<description><![CDATA[I&#8217;m currently working on a large email system which will serives millions of mailboxes &#8211; like hotmail.com. Except sending/receiving emails, the most important feature is spam and virus filtering. I&#8217;m using SpamBayes for spam filtering and AnomySanitizer plus ClamAV for virus filtering. They look working well. Here&#8217;s a Chinese document about how to configurate SpamBayes, [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m currently working on a large email system which will serives millions of mailboxes &#8211; like hotmail.com.</p>
<p>Except sending/receiving emails, the most important feature is spam and virus filtering.  I&#8217;m using <a href="http://spambayes.sourceforge.net/">SpamBayes</a> for spam filtering and <a href="http://mailtools.anomy.net/">AnomySanitizer</a>  plus <a href="http://clamav.sourceforge.net/">ClamAV</a> for virus filtering.  They look working well.</p>
<p><a href="http://www.dragonsoft.net/cgi-bin/moin.cgi/_e5_9c_a8_e5_a4_a9_e5_a5_87_e9_82_ae_e4_bb_b6_e7_b3_bb_e7_bb_9f_e4_b8_8a_e5_8a_a0_e8_a3_85_e5_9e_83_e5_9c_be_e9_82_ae_e4_bb_b6_e5_92_8c_e7_97_85_e6_af_92_e8_bf_87_e6_bb_a4_e5_99_a8">Here&#8217;s a Chinese document</a> about how to configurate SpamBayes, AnomySanitizer and ClamAV on an exists email system.</p>
<p>The next problem is to make the email system effective and scalable. Anyone has suggestion?</p>
]]></content:encoded>
			<wfw:commentRss>http://erning.net/archives/filter-spam-and-virus/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Server Site Spambayes II</title>
		<link>http://erning.net/archives/server-site-spambayes-ii</link>
		<comments>http://erning.net/archives/server-site-spambayes-ii#comments</comments>
		<pubDate>Thu, 25 Sep 2003 21:38:55 +0000</pubDate>
		<dc:creator>erning</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[anti-spam]]></category>
		<category><![CDATA[spambayes]]></category>

		<guid isPermaLink="false">http://www.erning.net/archives/2003/09/26/server-site-spambayes-ii/</guid>
		<description><![CDATA[Since everyone has a different idea of what is spam. The spam filter with a global database will filter some useful emails. It&#8217;s not acceptable for customers although there&#8217;re only 1% useful emails filtered as spam. So we individual spam databases for each mailbox and let the customers to train the spam filter themselves. Three [...]]]></description>
			<content:encoded><![CDATA[<p>Since everyone has a different idea of what is spam. The spam filter with a global database will filter some useful emails. It&#8217;s not acceptable for customers although there&#8217;re only 1% useful emails filtered as spam. So we individual spam databases for each mailbox and let the customers to train the spam filter themselves.</p>
<p>Three buttons was added to the web mail. &#8216;Delete as Spam&#8217;, &#8216;Recover from Spam&#8217; and &#8216;Reset Spam Filter&#8217;. It&#8217;s similar to SpamBayes plug-in for Outlook.</p>
<p>I&#8217;ve configured the solution on a production server. Below is the summary of the e-mail server&#8217;s log summary for Sep. 25,</p>
<pre>
<code>
Grand Totals
------------
messages

  87812   received
  56351   delivered
   2012   forwarded
    606   deferred  (5096  deferrals)
   1342   bounced
  54150   rejected (49%)
      0   reject warnings
      0   held
      0   discarded (0%)

   3486m  bytes received
   3816m  bytes delivered
  13591   senders
   4993   sending hosts/domains
   6937   recipients
   1750   recipient hosts/domains
</code>
</pre>
]]></content:encoded>
			<wfw:commentRss>http://erning.net/archives/server-site-spambayes-ii/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Server Site Spambayes</title>
		<link>http://erning.net/archives/server-site-spambayes</link>
		<comments>http://erning.net/archives/server-site-spambayes#comments</comments>
		<pubDate>Sat, 20 Sep 2003 21:18:26 +0000</pubDate>
		<dc:creator>erning</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[anti-spam]]></category>
		<category><![CDATA[spambayes]]></category>

		<guid isPermaLink="false">http://www.erning.net/archives/2003/09/21/server-site-spambayes/</guid>
		<description><![CDATA[Installed and tested Spambayes on one of the productive email server of our company. It works very well. About 30,000 (around 1GByte) junk emails are filtered per day. The email server hosts more than 2,500 domains and 30,000 mailboxes. Currently, we use global spam recognition database and only drop the very certain junk emails. I [...]]]></description>
			<content:encoded><![CDATA[<p>Installed and tested Spambayes on one of the productive email server of our company. It works very well. About 30,000 (around 1GByte) junk emails are filtered per day.<br />
The email server hosts more than 2,500 domains and 30,000 mailboxes.</p>
<p>Currently, we use global spam recognition database and only drop the very certain junk emails. I think it&#8217;s better to setup database for each mailbox and provide an user friendly control panel.</p>
]]></content:encoded>
			<wfw:commentRss>http://erning.net/archives/server-site-spambayes/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Anti-Spam</title>
		<link>http://erning.net/archives/anti-spam</link>
		<comments>http://erning.net/archives/anti-spam#comments</comments>
		<pubDate>Sat, 26 Jul 2003 21:15:12 +0000</pubDate>
		<dc:creator>erning</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[anti-spam]]></category>
		<category><![CDATA[spambayes]]></category>

		<guid isPermaLink="false">http://www.erning.net/archives/2003/07/27/anti-spam/</guid>
		<description><![CDATA[Do you hate spam? I do. I get hundreds of spam a day! I&#8217;ve tried several methods to stop spam. Outlook rules, procmail regex filters, black/white lists and others. But all of them are not clever enough. After tried SpamBayes, I believe the filters can kill spam. I&#8217;ve installed a spam filter plug-in (SpamBayes) for [...]]]></description>
			<content:encoded><![CDATA[<p>Do you hate spam? I do. I get hundreds of spam a day! I&#8217;ve tried several methods to stop spam. Outlook rules, procmail regex filters, black/white lists and others. But all of them are not clever enough. After tried SpamBayes, I believe the filters can kill spam.</p>
<p>I&#8217;ve installed a spam filter plug-in (SpamBayes) for my Outlook 2002. I just gave 10 spam emails and 10 non-spam emails to the filter. After the training the filter works very well. And I was able to disable the stupid Outlook&#8217;s rules.</p>
<p>But it still have a problem. I have to receive all mails and filter them locally. It should be better to recognize the spams at the server-side. So I decided to setup a server-side filter. Fortunately, SpamBayes works well with procmail.</p>
<p>Our email system is powered by the last stable postfix. I am very care about secure issue, so SMTPs, POP3s, IMAPs and HTTPs have been configuated for sending, receiving and reading email. To enable the SpamBayes filter, I just add a few rules to the procmailrc file. Now, all coming emails will be handled by SpamBayes first and delivered to different folders. Spam will be delivered to &#8220;Spam Certain&#8221; folder, unsure will be delivered to &#8220;Spam Unsure&#8221; folder and ham will be leave in the Inbox.</p>
<p>SpamBayes&#8217; accuracy is dependent upon having a &#8220;sufficient&#8221; sample from which to make its decisions. I have to teach SpamBayes which is spam and which is not. There&#8217;re two special folders named &#8216;Filter Train/Ham&#8217; and &#8216;Filter Train/Spam&#8217; in my mailbox. I wrote a small script to let SpamBayes train from the two folders and empty them after a successful training. Copy the wrong delivered emails and unsure emails to the Ham and/or Spam folders. SpamBayes will train and handle them correctly next time.</p>
<p>Everyone has a different idea of what is spam. So we have individual spam databases for each user.</p>
<p>Now, guys use our email system are able to receive non-spam emails to local mailbox using POP3s and train the filter using IMAPs or the webmail. It saves the time, bandwidth and money.</p>
<p><a href="http://www.flickr.com/photos/zendragon/19431086/" title="Photo Sharing"><img src="http://photos14.flickr.com/19431086_e8b30b9cbf_m.jpg" width="240" height="181" alt="anti-spam-webmail" /></a></p>
<p>Next, I am trying to find a better anti-spam solution for large scale. For instance, OnlineNIC Inc. the company I am working for currently. It hosts thousands of virtual hosts, email boxes. The solution should be clever, effective, customizable. And it should be able to filter both incoming and outgoing emails.</p>
<p>Resources,<br />
<a href="http://spambayes.sourceforge.net/">SpamBayes</a><br />
<a href="http://www.paulgraham.com/wfks.html">Will Filters Kill Spam? </a><br />
<a href=" http://www.onlinenic.com/">OnlineNIC Inc.</a></p>
]]></content:encoded>
			<wfw:commentRss>http://erning.net/archives/anti-spam/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

