DSPAM v3.6 Released

Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

DSPAM v3.6 Released 100

Posted by ScuttleMonkey on Monday October 17, 2005 @08:04AM from the spam-canned dept.

Nuclear Elephant writes "After six months of development, DSPAM v3.6 has been released. The most notable change is the series of new features added to make an anti-spam gateway appliance possible (Knoppix anyone?). Version 3.6 also includes a highly accurate alternative to Bayesian filtering known as Markovian discrimination, based on Bill Yerazunis' research. Other significant enhancements include trusted sender whitelisting, integrated Clam Antivirus and LDAP support, a centralized spam training alias, and a new dependency-free storage driver. Much of the documentation has also been rewritten to make installation easier. A change log and release notes are also available. Slashdot has recently featured a review of the author's book, Ending Spam and an interview as well."

This discussion has been archived. No new comments can be posted.

DSPAM v3.6 Released

Load All Comments

Search 100 Comments Log In/Create an Account

Comments Filter:

Comparison to other tools (Score:2, Insightful)

by Puramoca ( 703835 ) writes:

It would be interesting to compare this version to other spam filters and see how it measures.
- Re:Comparison to other tools (Score:3, Informative)
  
  by gvc ( 167165 ) writes:
  
  TREC [nist.gov]'s Spam Track [uwaterloo.ca] will evaluate several spam filters. There's also a toolkit for do-it-yourself comparison.
  Although DSPAM is not an official participant at TREC, three configurations will be evaluated for comparison - with tum, toe, and teft training modes. Zdziarski reported some of the preliminary results in his interview, but complete and comparative results won't be available until TREC in November.
- Re:Comparison to other tools (Score:2, Informative)
  
  by pushf popf ( 741049 ) writes:
  
  While it's great that it learns and makes decisions about the "spamminess" of various incoming items, the most reliable method I've found so far is Greylisting.
  
  The moment I installed and started GLD (gasmi.net), the spam simply stopped. It was like flipping the "nospam" switch on. The spam just stopped. No false positives, no missed spam, nothing.
  
  Every now and then I get unwanted email, but at least now it's from an actual, identifiable SMTP server, not a spam-bot.
  
  It's an amazing improvement from i
finally (Score:1)

by antivoid ( 751399 ) writes:

Finally a decent anti-spamming utility. There's been a lot of hype around this product and it is not out of place. I like the way its (at least partially) integrated to clam(win?). I still feel it wont be long for spammers to find ways around this tool... but for now, great, im definately using it.
Windows and Exchange. (Score:5, Interesting)

by Jaruzel ( 804522 ) writes: on Monday October 17, 2005 @08:15AM (#13808337) Homepage Journal

I know I'm going to get mauled over this quesiton... but has anyone compiled it on Windows 2003 server ?

For practical reasons I don't have linux in my test lab, and I'd like to have DSpam on my Webserver which is running IIS6 and Windows 2003 Server.

I can see I need to run it in SMTP mode with a relay to my Exchange box, but I don't want to waste my time trying to compile it (using Visual Studio), if someone already knows it wont work.

-Jar.

Share
twitter facebook
- Most likely need cygwin. (Score:2)
  
  by khasim ( 1285 ) writes:
  
  That was how earlier version worked. I don't know of anyone who actually got them to work natively under Windows.
- Re:Windows and Exchange. (Score:5, Informative)
  
  by myspys ( 204685 ) writes: on Monday October 17, 2005 @08:40AM (#13808400) Homepage
  
  from the FAQ (http://dspam.nuclearelephant.com/faq.shtml#1.15 [nuclearelephant.com])
  
  Q. Does it work with Windows?
  A. v3.2 is the first to include a Windows build supplement, which includes the necessary Visual C++ project files and portage to compile the agent and tools under Windows. Check out the win32/ directory in the source tree for more information. Win32 support is still unofficial, but seems to work well. Of course getting it compiled is one thing, getting it integrated is another. It's probably best to build it under Cygwin using the general distribution.
  
  Parent Share
  twitter facebook
  - Re:Windows and Exchange. (Score:2)
    
    by wwwillem ( 253720 ) writes:
    
    A. v3.2 is the first to include a Windows build supplement
    I downloaded version 3.6.0, but there seems to be nada :) support for Visual C. No win32 directory to be found. However on the download page, in the unsupported section, there was also DSPAM v. 3.2.8 [nuclearelephant.com], which indeed does contain the Windows stuff.
- Linux Router (Score:4, Interesting)
  
  by Stavr0 ( 35032 ) writes: on Monday October 17, 2005 @08:46AM (#13808421) Homepage Journal
  
  I know I'm going to get mauled over this quesiton... but has anyone compiled it on Windows 2003 server ? (Release the hounds!)
  How about getting it compiled into a Linksys WRT54G router firmware i.e Sveasoft firmware?
  
  Parent Share
  twitter facebook
  - Re:Linux Router (Score:4, Informative)
    
    by op00to ( 219949 ) writes: on Monday October 17, 2005 @08:57AM (#13808474)
    
    DSPAM, as it's running in my cluster, is using way more ram than the WRT54G physically has. Probably not a good idea to run it on that little box.
    
    Parent Share
    twitter facebook
  - Re:Linux Router (Score:2)
    
    by maggard ( 5579 ) writes:
    
    My understanding is this sort of filtering isn't practical on any of the consumer routers due to their limited memory. The applications load the email messages to scan them, and between the OS code, the scanning package, and the email being scanned there simply isn't enough memory to hold it all, even on the larger WRT54GS units. My own hope is that Cisco's Linksys subsidiary eventually 'gets smart' and releases a combination WRT54GS / NSLU2 / PAP2 appliance, with more RAM, that is Linux-based and hackable
- Re:Windows and Exchange. (Score:1)
  
  by Nuclear Elephant ( 700938 ) writes:
  
  Version 3.4 has win32 support, but nobody wanted to maintain the build kit. It stopped working with 3.6 and was removed. You can build 3.4 natively in Windows, or you can build 3.6 under Cygwin.
  - Re:Windows and Exchange. (Score:2)
    
    by Jaruzel ( 804522 ) writes:
    
    Cheers for that, Mr Elephant. :)
    
    I owe you One (1) Beer.
    
    -Jar.
SPAM (TM) (Score:1)

by Uukrul ( 835197 ) writes:

There isn't any trademark problems with DSPAM?
SPAM [spam.com] is a registered trademark of Hormel Foods Corporation, and DSPAM aren't the Monty Python [montypythonsspamalot.com].
Still getting on Hormel's nerves, I suppose (Score:1, Informative)

by Anonymous Coward writes:

DSPAM is also noted for their trademark spat with Hormel, who tend to be nice about "spam" as a term until it's spelled in all-caps. (Previous Slashdot coverage.) [slashdot.org]
Too late (Score:3, Funny)

by mordors9 ( 665662 ) writes: on Monday October 17, 2005 @08:30AM (#13808381)

But the great news is this product is no longer needed. After all the FBI has put a stop to all of that: http://www.detnews.com/2005/technology/0510/16/B01 -349738.htm [detnews.com] (For those that are easily confused, the comment was tongue in cheek)

Share
twitter facebook
- Re:hiding your address (Score:5, Insightful)
  
  by Bogtha ( 906264 ) writes: on Monday October 17, 2005 @08:51AM (#13808440)
  
  Though this is only possibly with PHP, ideally running on a Debian system, it's the most important language to learn in the universe.
  
  What kind of fuckwittery is this? No, plenty of languages can code a simple contact form handler, the platform you run it on is pretty irrelevant, and PHP is by no means "the most important language to learn in the universe". It's a pretty typical scripting language, not the magic you make it out to be.
  
  Parent Share
  twitter facebook
  - Re:hiding your address (Score:2)
    
    by shaitand ( 626655 ) writes:
    
    Aye. It is pretty obvious the gp is something of a fuckwit. However, for its intended purpose PHP is practically magic. Personally I have always been something of a Perl addict and then one day I was pondering some web work and decided to dive into php by recoding a couple of perl scrips in php. I was simply amazed at how much more simply one can do web cgi's in php.
    
    For just about everything else there is still perl (which is definately superior to php in every NON web task) and when perl fails there is C (
    - Re:hiding your address (Score:2)
      
      by imroy ( 755 ) writes:
      
      Have you been living under a rock for the last ten years? Of course web programming in PHP is easier than CGI! Just about anything is easier than CGI, not matter what language the CGI script is programmed in. If you want a similar (but more powerful) PHP-like environment for Perl, I highly recommend HTML::Mason [masonhq.com]. Two other interesting mod_perl environments are AxKit [axkit.org] (centred around XML and XSLT) and Catalyst [perl.org] (a tight MVC framework). But they both are rougher to develop on, requiring restarts of Apache to lo
      - Re:hiding your address (Score:2)
        
        by shaitand ( 626655 ) writes:
        
        shhh don't tell anyone but when you program in PHP you ARE still programming to the CGI ;) In fact everything you mentioned above still interacts with the client via CGI and html/xhtml just like it has for the last 10 years.
        
        Re:hiding your address (Score:1)
        
        by hobbit ( 5915 ) writes:
        
        1990 called, they want their webserver back.
        
        Why not use Apache + mod_perl/mod_php, like the vast majority of souls in the known universe?
        
        Re:hiding your address (Score:2)
        
        by cloudmaster ( 10662 ) writes:
        
        Since you may have been serious - CGI stands for "Common Gateway Interface". In other words, CGI defines the "common" "interface" between the browser and the webserver (aka "gateway"). Many early CGI programs were written with perl, and several still are. I've written several CGI programs in C, PHP, perl, and bash - among others (Cold Fusion is something I'd like to forget - what a POS!). Using mod_blah generaly just moves the interpreter (or parts of it) into the web server so you save the launch time
        
        Re:hiding your address (Score:2)
        
        by imroy ( 755 ) writes:
        
        Huh? My understanding of CGI was that it defines the interface between the web server and the program/script. It defines how the URL, headers, and POST variables are passed to it, and how the program/script returns the page and status code. The Apache modules like mod_perl, mod_php, and mod_python put the interpreter into the web server, eschewing the overhead of launching a program (and parsing the perl/python) for each request. Thus the interface is an internal Apache API instead of the CGI. Now, mod_perl
        
        Re:hiding your address (Score:2)
        
        by cloudmaster ( 10662 ) writes:
        
        You're right - it's the interface between the app and the server. Doh. :) Though, isn't the mod_* API more of a superset of CGI rather than a replacement? Trying to save a little face here... ;)
        
        Re:hiding your address (Score:2)
        
        by shaitand ( 626655 ) writes:
        
        Don't worry CGI is NOT just an interface between the webserver and the application. CGI also defines much of the information the browser is required to exchange with the webserver.
  - Re:hiding your address (Score:1)
    
    by dvaldenaire ( 52153 ) writes:
    
    I think this is the kind of things like, you know, "humour".
    
    As you know, comments on PHP vs. Other Scripting Languages are totally useless... ... because PHP is the best. (the same joke with Debian and Other Distribs is left as an exercice to the reader...)
  - Re:php problems -- too specialized (Score:1)
    
    by ToyKeeper ( 17042 ) writes:
    
    I must agree about PHP being un-magical. It's great for one or two specific purposes, but is pretty lacking for anything else. Want a simple web email form? It'd be hard to find an easier way to do it than PHP. But if you want a large web application, it's worth trying other languages. What's magical and amazing is that people have built incredible things with it despite its shortcomings -- projects like Drupal and Mediawiki are sheer wizardry.
    
    I've been keeping a list of problems with PHP [toykeeper.net], if anyo
- Re:hiding your address (Score:2)
  
  by kimba ( 12893 ) writes:
  
  The best defense against spam is never to type your personal address anywhere on the internet.
  
  You have to do more than that. You also have to not email anyone, and also not have an easy to guess username.
  
  The problem is, you can never publish your email address anywhere - and someone else will gladly do it for you. All it takes is one person you have emailed to come down with an email virus, which then propogates your address all over the net.
  
  Email address synthesis will also guarantee unless you have the m
- Re:hiding your address (Score:4, Interesting)
  
  by BigJim.fr ( 40893 ) writes: <jim@liotier.org> on Monday October 17, 2005 @08:53AM (#13808457) Homepage
  
  > The best defense against spam is never to type your
  > personal address anywhere on the internet.
  
  Hiding your address does not work because some viruses collect addresses from your correspondents addressbook. Your address will percolate to spam lists, it is only a matter of time. If like me you have kept your adress for many years, you absolutely need some form of spam defense.
  
  Parent Share
  twitter facebook
  - Re:hiding your address (Score:2)
    
    by Antique Geekmeister ( 740220 ) writes:
    
    Also, spammers steal addressbooks or buy them from unethical employees, others make partnership contracts where you've submitted a contact address and use those contacts to get spam addresses, some spammers use alphabetical or name-guess spam, and any unethical sysadmin with a clue can use the mail logs of his servers to generate a list of valid email addresses from other sites for sale.
  - Re:hiding your address (Score:1)
    
    by Monkier ( 607445 ) writes:
    
    also if your email is a combination of firstname and/or surname - chances are the spammers will guess it..
  - Re:hiding your address (Score:2)
    
    by Nethead ( 1563 ) writes:
    
    And if you're running a mail system for 10,000 Real Estate agents..... 4x Barracuda 400 Spam Firewalls.
- Re:hiding your address (Score:2)
  
  by MichaelSmith ( 789609 ) writes:
  
  The best defense against spam is never to type your personal address anywhere on the internet.
  You still have to communicate with people, and many of them will have windows boxes which will get rooted at one time or another. It is made worse by people who innocently spam whole lists of people with documents or joke emails. Your address can get spread around that way.
- Re: (Score:2)
  
  by account_deleted ( 4530225 ) writes:
  
  Comment removed based on user account deletion
  - Re:hiding your address (Score:1)
    
    by Damer Face ( 910606 ) writes:
    
    > or you can always do stuff like: foo AT gmail DOT com
    > or you can always use the html encoding for the characters in the email
    
    These are no protection against a number of more advanced bots, and that number will increase over time.
    
    Also, in many situations, like signing up for stuff online, an encoded email address won't be seen as valid input and will be rejected out of hand.
    
    > or you can always just put the words inside an image.
    
    This might work on your personal website, but is useless in most situ
- Re:hiding your address (Score:1)
  
  by Damer Face ( 910606 ) writes:
  
  And make damn sure that your code isn't vulnerbale to "e-mail injection" exploits; these will result in spammers using your simple form to spam others AND you getting your hosting revoked.
  
  See, eg, here: http://www.nyphp.org/phundamentals/email_header_in jection.php [nyphp.org]
  - Re:hiding your address (Score:2)
    
    by Alioth ( 221270 ) writes:
    
    Or any injections at all. I host a modest number of people's domains (a dozen people). One user had PHPBB. When I told him what trouble his buggy, old version of PHPBB had caused, he swore he'd deleted it - all he'd actually done is removed the links to the board, but the code was still there.
    
    A Romanian phishing gang found it, and tried to send over 2 million phishing emails by uploading a PHP script via the exploit. Fortunately, the way I have the email relay configured (the firewall blocks port 25 egress
- Re:hiding your address (Score:2)
  
  by ozbird ( 127571 ) writes:
  
  The best defense against spam is never to type your personal address anywhere on the internet.
  
  It's at least ten years too late for that for me, and I'll be damned if I'm going to give up my email address now just because of a few pesky spammers. Besides, the worst of the spam flood seems to be over. A year ago, I was getting hundreds of spam messages a day; now I might get ten, occasionally twenty a day. SpamAssassin + ClamAV identify the vast majority of those.
  - Re:hiding your address (Score:1)
    
    by samkass ( 174571 ) writes:
    
    "A year ago, I was getting hundreds of spam messages a day; now I might get ten, occasionally twenty a day. SpamAssassin + ClamAV identify the vast majority of those."
    
    For me, most spam (unwanted email not intended for me personally) I receive are either bounces or "confirmation" emails from other people's spam filters. Since spammers never send FROM their own address, they usually just pick a random address off their list and send from them (ie. Mine.) So bounces go to me.
    
    These days, I've started clicking
- Re:not "bulletproofly" reliable (Score:1)
  
  by NutMan ( 614868 ) writes:
  
  This isn't "bulletproofly" reliable either. My brothers and I run a small local ISP. Years ago I created an address for my youngest daughter. She never used it, it was never posted anywhere, and it wasn't an easy to guess address since it was a combination of her name and her nickname. However spammers are constantly trying to discover email addresses on our domain, we get about 2,000 invalid recipient attempts every hour of the day. So eventually they discovered her address and she now gets a small amount
- Re:hiding your address (Score:1)
  
  by edesio ( 93726 ) writes:
  
  You can also use a "short-term" e-mail like the ones provided at SpamGourmet.com.
- Re:hiding your address (Score:2)
  
  by HermanAB ( 661181 ) writes:
  
  Never heard of dictionary attacks on domains have you?
- Re:hiding your address (Score:2)
  
  by horza ( 87255 ) writes:
  
  just set up a simple form and use simple php to make it convenient for them to reach you while keeping your email address safely tucked away
  
  All you've done is swapped vigilence in maintaining anti-spam on your inbox to vigilence in protecting your contact form against spammers abusing your email form as a spam gateway. My contact form page gets an attempted hit every couple of days (usually a combination of MIME attachments in the comments field and injecting a BCC field to forward to the recipient) and thi
- - Mod parent into oblivion! (Score:1)
    
    by Thalagyrt ( 851883 ) writes:
    
    Nice troll. PHP has nothing to do with spam, if anything it was your blatant stupidity that got you on a spam list.
- Re:hiding your address (Score:2, Funny)
  
  by MasTRE ( 588396 ) writes:
  
  > Other than annoying whitelists, there is no anti spam warez that is bulletproofly reliable.
  
  Yeah yo, no bulletproofly reliable warez yo!
  
  > ...just set up a simple form and use simple php to make it convenient for them to...
  
  Make it convinient to root your server, yo! Yeah, yo! Bulletproofly warez, yo!
  
  > Though this is only possibly with PHP...
  
  Yeeeeaaaah, buddy! Warez, yo!
  
  NOT!
  
  Whatever TF this guy is smoking, you lemmings shouldn't mod it +4/Informative. It's a crap post.
Try DSPAM (Score:4, Informative)

by ajs ( 35943 ) writes: <{ajs} {at} {ajs.com}> on Monday October 17, 2005 @08:41AM (#13808407) Homepage Journal

I'm a long-time proponent of and rare contributor to SpamAssassin, and I'll continue to be, but fighting spam is much like fighting disease: you have to diversify your defenses. DSPAM is a nice package, and is very well designed. I've spoken to the author in the past, and he has an excellent understanding of the complexities of the issue (as opposed to the legions of people who seem to think that spam filtering should be easy, given the right algorithm).

As far as I'm concerned there are two tools for spam filtering: DSPAM and SpamAssassin. Try them both. See what fits your needs. My impression is that SpamAssassin provides more knobs and buttons and is more easily extended by the casual user, but DSPAM can be lighter weight. Both are highly accurate, with very low false positive rates.

Share
twitter facebook
- Re:Try DSPAM (Score:2)
  
  by gvc ( 167165 ) writes:
  
  There are lots of alternatives. Bogofilter, spamprobe, spambayes, popfile, dbacl, are all quite effective.
  - Re:Try DSPAM (Score:2)
    
    by imroy ( 755 ) writes:
    
    From what I know of those projects, they're all Bayesian filters and little more. Maybe a white/black list. That's what the GP post was referring to when he wrote "as opposed to the legions of people who seem to think that spam filtering should be easy, given the right algorithm". I don't know much about this DSPAM, but SpamAssassin covers a whole bunch of tests. It started off as a list of common-sense patterns looking for the usual penis/breast enlargement etc spam in the email body and suspicious info in
    - Re:Try DSPAM (Score:3, Informative)
      
      by gvc ( 167165 ) writes:
      
      I use Spamassassin with a special user configuration file [uwaterloo.ca] and I train it systematically. In this configuration it works pretty well (much, much, better than out-of-the box). But Bogofilter and Popfile work about as well. As does just the Bayesian component of Spamassassin, ignoring all the other cruft. DSPAM, on the other hand, doesn't work at all well for me.
      - Re:Try DSPAM (Score:2)
        
        by ajs ( 35943 ) writes:
        
        For the most part you seem to be:
        
        Shutting off auto-learn (mistake, see below)
        Upping BAYES scores (good plan, I do too)
        enabling a few knobs that are generally useful (though I've had too many false positives with RCVD_IN_DSBL).
        
        The only thing I would critisize is shutting off auto-learn. If you want to be conservative, just lower the ham threshold and raise the spam threshold a bit. I tried to manually train for a while, and what I found was that I was actually lying to SA. auto-learn means that a view of yo
        
        Re:Try DSPAM (Score:2)
        
        by gvc ( 167165 ) writes:
        
        Auto-learn in spamassassin is broken. In fact my mail script automatically calls sa-learn for every message, with ham or spam depending on what Spamassassin claims. Then if I want to correct it I call sa-learn over again with the correct classification. That's why the user-prefs file has it turned off.
        
        I should make this more clear in my notes. Thanks for pointing it out.
        
        Re:Try DSPAM (Score:2)
        
        by ajs ( 35943 ) writes:
        
        Explain "broken". Works great for me.....
        
        Training on everything is probably a mistake. Catching all of the edge conditions where that fails is going to be a very laborious task. Do all of your users do the same, or do you force their auto-learning off and have them use your bayes tokens? That has its own problems (you're not training on their mail), but at least would not leave an inattentive user in the horrible situation where they are constantly training incorrectly. That quickly leads to a broken classi
        
        Re:Try DSPAM (Score:2)
        
        by gvc ( 167165 ) writes:
        
        Explain "broken". Works great for me.....
        Some explanation appears here [uwaterloo.ca].
        In summary, auto-learn re-evaluates the message using only the static rules - not the bayes rules. Then, if the static rules give an extreme score that differs from the bayes score, and a couple of extra ad hoc conditions hold (number of "hits" exceeds some threshold) the bayes filter is trained.
        You can adjust the "extremeness" of the score under which Bayes is trained but training will not be on what Spamassassin reports; only on
        
        Re:Try DSPAM (Score:2)
        
        by ajs ( 35943 ) writes:
        
        "In summary, auto-learn re-evaluates the message using only the static rules - not the bayes rules. Then, if the static rules give an extreme score that differs from the bayes score, and a couple of extra ad hoc conditions hold (number of "hits" exceeds some threshold) the bayes filter is trained."
        
        Hrm... well, no.
        
        First off "number of hits" is not an "extra ad hoc condition". Number of "hits" is exactly "score". There's no difference, just two pieces of terminology for the same thing. "Level" is another thin
        
        Re:Try DSPAM (Score:2)
        
        by gvc ( 167165 ) writes:
        
        Hrm... well, no.
        All that said, you seem uncomfortable with static rules of any kind, so if you don't buy into what I've said above, then I suggest that you stop using SA. Static rules are a giant advantage, but if you are going to defeat most of their value, then you might as well not suffer their overhead.
        For further reading, I suggest: http://plg.uwaterloo.ca/~gvcormac/spamcormack.html [uwaterloo.ca]
        I wrote that paper, and the configuration I posted here is what was used in the best-scoring run.
        For your conven
- Re:Try DSPAM (Score:1)
  
  by jaseuk ( 217780 ) writes:
  
  The problem with SPAMD, SpamAssassin etc. is they rely too much on training and user interaction. If a user has to go into the SPAM box and double check that no mistakes have been made then the system is worse than not having any SPAM checking at all as most users will not check the SPAM box, this is especially true for larger deployments where it is much harder to train users and these environments usually cannot afford for these sorts of mistakes to be made.
  
  I've found greylisting to be the best solution
  - Re:Try DSPAM (Score:3, Insightful)
    
    by gvc ( 167165 ) writes:
    
    If a user has to go into the SPAM box and double check that no mistakes have been made then the system is worse than not having any SPAM checking at all.
    Not true. First, if the user's mailbox is cluttered with spam, the user is more likely to overlook good mail. More likely than a good spam filter. Second, it is way easier to scan a list of predominantly spam for occasional good mails (and vice versa) than to have everything jumbled together. Third, spam filters are good enough that one does not need n
    - Re:social effects (Score:1)
      
      by ToyKeeper ( 17042 ) writes:
      
      I've found that nearly all of my users actually prefer an interactive system like dspam over a fully-automatic system. Both systems make mistakes, but the interactive system gives the user a feeling of empowerment to fix mistakes and improve their accuracy over time.
      
      It's better for the admin, too... When a non-interactive system makes a mistake, I find that the users complain -- either to the admin or to each other. But with dspam, they reclassify the missed message and continue working, happy to know th
      - Re:social effects (Score:2)
        
        by gvc ( 167165 ) writes:
        
        Absolutely. It is cathartic to punish spam by reporting it to your spam filter. And, of course, fully automatic systems aren't nearly as good as claimed. (Neither are learning filters - 99.9...% accuracy? pshaw! - but they're better than non-learning ones.)
        
        Re:social effects (Score:1)
        
        by jaseuk ( 217780 ) writes:
        
        I used SPAM Assassin quite happily for many years but found the effectiveness started dropping, there are some messages that just can't be caught, usually these are the worst kinds of messages (ie. a face full of spunk) almost always received by the people most likely to be offended (ie. 55 year old female administrative staff).
        
        False positives seem to be more of a problem written in languages other than English. Pretty much all of our e-mail in Welsh language we receive through AOL has been tagged by AOL a
  - Solution (Score:2)
    
    by lorcha ( 464930 ) writes:
    
    If a user has to go into the SPAM box and double check that no mistakes have been made then the system is worse than not having any SPAM checking at all as most users will not check the SPAM box
    
    I use a three-outcome approach with SpamAssassin. Messages scored below 5 are delivered to the user's INBOX. Messages scored 5 or higher, but less than 10 go into the spam box. Messages scored 10 or higher are rejected during the SMTP session, with instructions on how to proceed.
    I did this because, in practice,
- Re:Try DSPAM (Score:1)
  
  by ToyKeeper ( 17042 ) writes:
  
  This is just one admin's viewpoint... it may not reflect anyone else's experiences. It's just what I've found over the years, using both systems.
  
  Accuracy... SpamAssassin generally offers higher accuracy with less effort, at first, but the accuracy degrades over time. DSPAM takes more effort initially, but offers higher, sustained accuracy over the long term. I see an average of about 99.5% long-term accuracy with dspam. I can't tell what the accuracy was with spamassassin, since it doesn't include a wa
  - Re:Try DSPAM (Score:2)
    
    by cloudmaster ( 10662 ) writes:
    
    Were you using procmail and individual spamassassins, or using spamd/spamc for mail checking? I wonder if that's the reason people see such super-high CPU loads with SA. I was delivering around 10K-15K messages/day (roughly 50 users), with SA identifying around 85% as spam. The backup MX ran spamd with user prefs and bayesian keys stored in MySQL, and the primary MX delivered through procmail using spamc. The backup MX/spamd machine was a P3/800 with 512M RAM and the primary MX was an Athlon 1000 with 1
    - Re:Try DSPAM (Score:1)
      
      by ToyKeeper ( 17042 ) writes:
      
      I saw the SA load problem happen both with and without using the daemon setup. However, the systems were slower than what you described, and did a lot more than just handle email. They were dual-500MHz boxes, but couldn't keep up with the incoming mail. Mail arrived faster than SA could process it, even though it was just a few dozen accounts. It would tend to catch up at night, but email during the day was pretty lagged.
      
      I haven't tried dspam as a daemon yet, but intend to try it soon to see how it work
      - Re:Try DSPAM (Score:2)
        
        by cloudmaster ( 10662 ) writes:
        
        That backup MX was also the primary DNS and syslog server, but that's not much of a load. The primary MX was also the pop/imap/web server, for what it's worth. My home setup is about 5 users with around 5-7K messages/day, and I run spamd and MySQL on the same box - which is a dual Celeron 400 machine. Messages come in on an AMD 5x86-133 gateway which does the DNS lookups and tehn forwards to a PPro233 which calls spamc (that one's also the web server). All three machines combined have less computing pow
Thanks, but... (Score:1)

by Kagura ( 843695 ) writes:

I use Gmail. :)
- Re:Thanks, but... (Score:1, Funny)
  
  by Slashcrap ( 869349 ) writes:
  
  I use Gmail. :)
  
  "So I let Google spam me in a targeted and personal manner via HTML rather than random people spamming me through SMTP."
  
  I can understand why you're so proud.
  - Re:Thanks, but... (Score:1)
    
    by Talinth ( 855653 ) writes:
    
    I configured my gMail account to Moz Thunderbird. No targeted ads, and the benefit of the greatness that is the gMail spam filter. I would say that it is quite possible the GP poster does as well.
    - Re:Thanks, but... (Score:1)
      
      by Slashcrap ( 869349 ) writes:
      
      I configured my gMail account to Moz Thunderbird. No targeted ads, and the benefit of the greatness that is the gMail spam filter. I would say that it is quite possible the GP poster does as well.
      
      Yeah, I bet at least 99% of gMail users know how to do that.
      - Re:Thanks, but... (Score:2)
        
        by shaitand ( 626655 ) writes:
        
        If that was meant to be sarcastic it should not be. Gmail is invite only and the first invitations went to an all tech savy crowd. Although gmail has spread far and wide I think the audience is still primarily tech oriented.
- So do I... and it could so easily be improved! (Score:2)
  
  by hobbit ( 5915 ) writes:
  
  I get an incredible amount of spam bounces in my GMail account -- from somebody sending lots of spam using my GMail address as the From: or the Return-to: address.
  
  I really, really want an option for GMail to record the message-id of all messages I ever send through their server, and bounce any which are returned to me but which they haven't got on record as being sent by me.
  
  I requested this ages ago, and it should be relatively straightforward. Does anyone else have this problem?
A 'chicken-and-egg' random thought (Score:2)

by TVmisGuided ( 151197 ) writes:

This is one of those things that makes me wonder...which "side" is pushing the technological envelope further and faster, the {spammers | malware slimers | virus breeders} or those who develop to defeat them?

Since it's generally agreed that history is written by the winners of a given conflict, I guess we won't have an answer to that until the war's over.

This comment generously brought to you by a severe lack of caffeine.
- Re:A 'chicken-and-egg' random thought (Score:2)
  
  by EpsCylonB ( 307640 ) writes:
  
  This isnt really a chicken and egg situation. Whats the answer to 99 out of a 100 questions ?, Money.
  
  Spammers used email to sell things whilst at the same time pissing everybody off. Eventually people hate spam so much that they are willing to pay for services that try and and eliminate spam.
  
  It may not always be so but spammers have always been one step ahead, they have more incentive.
curious about MD (Score:1)

by jkind ( 922585 ) writes:

How well does "Markovian discrimination" work in practice? It sounds fascinating, but what is the false-positive rate that can be expected on average??
Geez from dealing with spammers to working with the crap DiamondTouch, Yerazunis is a real glutton for punishment :)
- Re:curious about MD (Score:1)
  
  by junics ( 664175 ) writes:
  
  The CRM114 classifier/filter has used markovian and derivatives thereof for quite some time and claims 99.984% accuracy.
  A downside is that markovian is quite a lot more resource intensive than simple bayesian.
  
  I used bogofilter (a fast bayesian filter) before CRM114. Even if it was harder to setup CRM114 than bogofilter and it used more resources, it was totally worth it.
- Re:curious about MD (Score:2)
  
  by Antique Geekmeister ( 740220 ) writes:
  
  You apparently missed Iglassware, Bill's contribution to measured drinking, and his role in the JunkYard Wars, at http://www.tms.org/pubs/journals/JOM/0310/Byko/Byk o-0310.html [tms.org]
  - Re:curious about MD (Score:1)
    
    by jkind ( 922585 ) writes:
    
    Iglassware.. now there is a fun Masters thesis :)
- Re:curious about MD (Score:2, Interesting)
  
  by Nuclear Elephant ( 700938 ) writes:
  
  Below are some tests I ran with a pre-release version of DSPAM on a test corpus. As you can see, Markovian discrimination is significantly more efficient than any Bayesian methods and Chi-Square. Markovian showed slightly more (4 more than the top contender) false positives, but it also caught 100 more spam... some additional tuning, tweaking, and most importantly, training, can easily get this down to a very low error rate.
  
  Bayesian (burton)
  TP: 785 TN: 1003 FN: 218 FP: 4 SC: 4 IC: 0
  SR:
  - Re:curious about MD (Score:1)
    
    by jshaped ( 899227 ) writes:
    
    so in your opinion, is 4 more false positives worth the increase in true positives?
    
    this is one thing i'm struggling with, is how to compare the results of 2 filters on the same corpus.
    we know FP's are substantially worse than any spam that gets through, but how much worse?
    - Re:curious about MD (Score:2, Insightful)
      
      by Nuclear Elephant ( 700938 ) writes:
      
      4FPs for 100-something more TPs? Heck yeah. At least for me.. But keep in mind these are just preliminary training numbers with 1000 messages in each corpus. After real-world training, any of these approaches will be much more accurate.
- Re:curious about MD (Score:2)
  
  by markhb ( 11721 ) writes:
  
  Yerazunis is a real glutton for punishment
  
  He leavened it with appearances on Junkyard Wars [the-nerds.org].
OpenBSD port (Score:3, Informative)

by chrysalis ( 50680 ) writes: on Monday October 17, 2005 @09:43AM (#13808727) Homepage

The OpenBSD port can be downloaded from ftp://ftp.00f.net/misc/port-dspam-3.6.0.tar.gz [00f.net]

Share
twitter facebook
Enhancement? (Score:1)

by hazzey ( 679052 ) writes:

...significant enhancements include trusted sender whitelisting...
I thought that whitelisting had been a feature of every email reader/server since spam filtering began.
- Re:Enhancement? (Score:1)
  
  by Nuclear Elephant ( 700938 ) writes:
  
  I thought that whitelisting had been a feature of every email reader/server since spam filtering began.
  
  DSPAM's trusted sender whitelisting is automatic, based on who you converse with. It's not quite social networking, but is very useful, and requires no effort on the end-users part.
Not an advertisement... (Score:3, Interesting)

by pabl0 ( 228298 ) writes: on Monday October 17, 2005 @01:34PM (#13810333)

... but it'll sound like one: I recently converted from a rather involved anti-spam defense utilizing SpamAssassin with Razor, Pyzor, and several RBL checks. I spent a fair amount of time selecting RBLs that worked the best and tweaking SA test scores whenever I got false positive/negative messages. I even had all sorts of validity checks turned on in the MTA to block out badly formed messages and the like.

I replaced all those defenses with: DSPAM. And I'm seeing better results out of the box than I ever did with a multi-layered SA-based solution, even after a lot of time tweaking.

A quick anecdote: When I converted, I opened up a bunch of previously blocked spamtrap addresses, just to get some good training material for the filter. I've long since passed my initial training threshhold but haven't even bothered to block the spamtraps again because I never see the spam. At the risk of sounding like I'm bragging, I literally don't have a spam problem anymore, and DSPAM is entirely responsible for that.

Now, I'm not necessarily advocating that you give up all your custom defenses and switch to DSPAM. (I've turned off all my other filters, but I haven't removed them completely.) There's always a chance that an ingenious spammer will find a weakness in DSPAM setups, but I can testify to the fact that DSPAM is "scary good" as of right now. Training the filter is a simple matter of dropping misclassified messages (and there aren't many) into an IMAP folder.

If what you have is working for you, stick with it. But if you're looking for a low-maintenance, high accuracy filter, you should definitely give DSPAM a shot.

Share
twitter facebook
What is wrong with DSPAM? (Score:1)

by frn123 ( 242374 ) writes:

Why is it not included in Debian?
Spamassassin is.
Bogofilter is.
Popfile is.

I thought it was the license, but seems that DSPAM is GPL.
So, can anyone comment? I'm not installing it
for my server if i can not apt-get it and have debian
security support for it.
- Re:What is wrong with DSPAM? (Score:1)
  
  by Nuclear Elephant ( 700938 ) writes:
  
  Why is it not included in Debian?
  
  There's been a lot of interest in this area but nobody's felt like taking it upon themselves to make a Debian distro AFAIK. Part of it may have had to do with the storage driver backend, which supports several different approaches, but required a recompile to switch from say Postgres to MySQL. In 3.6, the storage backend can be built dynamically making packaging much easier. Perhaps someone will pick 3.6 up now.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Comparison to other tools (Score:2, Insightful)

Re:Comparison to other tools (Score:3, Informative)

Re:Comparison to other tools (Score:2, Informative)

finally (Score:1)

Windows and Exchange. (Score:5, Interesting)

Most likely need cygwin. (Score:2)

Re:Windows and Exchange. (Score:5, Informative)

Re:Windows and Exchange. (Score:2)

Linux Router (Score:4, Interesting)

Re:Linux Router (Score:4, Informative)

Re:Linux Router (Score:2)

Re:Windows and Exchange. (Score:1)

Re:Windows and Exchange. (Score:2)

SPAM (TM) (Score:1)

Still getting on Hormel's nerves, I suppose (Score:1, Informative)

Too late (Score:3, Funny)

Re:hiding your address (Score:5, Insightful)

Re:hiding your address (Score:2)

Re:hiding your address (Score:2)

Re:hiding your address (Score:2)

Re:hiding your address (Score:1)

Re:hiding your address (Score:2)

Re:hiding your address (Score:2)

Re:hiding your address (Score:2)

Re:hiding your address (Score:2)

Re:hiding your address (Score:1)

Re:php problems -- too specialized (Score:1)

Re:hiding your address (Score:2)

Re:hiding your address (Score:4, Interesting)

Re:hiding your address (Score:2)

Re:hiding your address (Score:1)

Re:hiding your address (Score:2)

Re:hiding your address (Score:2)

Re: (Score:2)

Re:hiding your address (Score:1)

Re:hiding your address (Score:1)

Re:hiding your address (Score:2)

Re:hiding your address (Score:2)

Re:hiding your address (Score:1)

Re:not "bulletproofly" reliable (Score:1)

Re:hiding your address (Score:1)

Re:hiding your address (Score:2)

Re:hiding your address (Score:2)

Mod parent into oblivion! (Score:1)

Re:hiding your address (Score:2, Funny)

Try DSPAM (Score:4, Informative)

Re:Try DSPAM (Score:2)

Re:Try DSPAM (Score:2)

Re:Try DSPAM (Score:3, Informative)

Re:Try DSPAM (Score:2)

Re:Try DSPAM (Score:2)

Re:Try DSPAM (Score:2)

Re:Try DSPAM (Score:2)

Re:Try DSPAM (Score:2)

Re:Try DSPAM (Score:2)

Re:Try DSPAM (Score:1)

Re:Try DSPAM (Score:3, Insightful)

Re:social effects (Score:1)

Re:social effects (Score:2)

Re:social effects (Score:1)

Solution (Score:2)

Re:Try DSPAM (Score:1)

Re:Try DSPAM (Score:2)

Re:Try DSPAM (Score:1)

Re:Try DSPAM (Score:2)

Thanks, but... (Score:1)

Re:Thanks, but... (Score:1, Funny)

Re:Thanks, but... (Score:1)

Re:Thanks, but... (Score:1)

Re:Thanks, but... (Score:2)

So do I... and it could so easily be improved! (Score:2)

A 'chicken-and-egg' random thought (Score:2)

Re:A 'chicken-and-egg' random thought (Score:2)

curious about MD (Score:1)

Re:curious about MD (Score:1)

Re:curious about MD (Score:2)

Re:curious about MD (Score:1)

Re:curious about MD (Score:2, Interesting)

Re:curious about MD (Score:1)

Re:curious about MD (Score:2, Insightful)