DSPAM v3.6 Released 100
Nuclear Elephant writes "After six months of development, DSPAM v3.6 has been released. The most notable change is the series of new features added to make an anti-spam gateway appliance possible (Knoppix anyone?). Version 3.6 also includes a highly accurate alternative to Bayesian filtering known as Markovian discrimination, based on Bill Yerazunis' research. Other significant enhancements include trusted sender whitelisting, integrated Clam Antivirus and LDAP support, a centralized spam training alias, and a new dependency-free storage driver. Much of the documentation has also been rewritten to make installation easier. A change log and release notes are also available. Slashdot has recently featured a review of the author's book, Ending Spam and an interview as well."
Comparison to other tools (Score:2, Insightful)
Re:Comparison to other tools (Score:3, Informative)
Although DSPAM is not an official participant at TREC, three configurations will be evaluated for comparison - with tum, toe, and teft training modes. Zdziarski reported some of the preliminary results in his interview, but complete and comparative results won't be available until TREC in November.
Re:Comparison to other tools (Score:2, Informative)
The moment I installed and started GLD (gasmi.net), the spam simply stopped. It was like flipping the "nospam" switch on. The spam just stopped. No false positives, no missed spam, nothing.
Every now and then I get unwanted email, but at least now it's from an actual, identifiable SMTP server, not a spam-bot.
It's an amazing improvement from i
finally (Score:1)
Windows and Exchange. (Score:5, Interesting)
For practical reasons I don't have linux in my test lab, and I'd like to have DSpam on my Webserver which is running IIS6 and Windows 2003 Server.
I can see I need to run it in SMTP mode with a relay to my Exchange box, but I don't want to waste my time trying to compile it (using Visual Studio), if someone already knows it wont work.
-Jar.
Most likely need cygwin. (Score:2)
Re:Windows and Exchange. (Score:5, Informative)
Q. Does it work with Windows?
A. v3.2 is the first to include a Windows build supplement, which includes the necessary Visual C++ project files and portage to compile the agent and tools under Windows. Check out the win32/ directory in the source tree for more information. Win32 support is still unofficial, but seems to work well. Of course getting it compiled is one thing, getting it integrated is another. It's probably best to build it under Cygwin using the general distribution.
Re:Windows and Exchange. (Score:2)
I downloaded version 3.6.0, but there seems to be nada
Linux Router (Score:4, Interesting)
How about getting it compiled into a Linksys WRT54G router firmware i.e Sveasoft firmware?
Re:Linux Router (Score:4, Informative)
Re:Linux Router (Score:2)
Re:Windows and Exchange. (Score:1)
Re:Windows and Exchange. (Score:2)
I owe you One (1) Beer.
-Jar.
SPAM (TM) (Score:1)
SPAM [spam.com] is a registered trademark of Hormel Foods Corporation, and DSPAM aren't the Monty Python [montypythonsspamalot.com].
Still getting on Hormel's nerves, I suppose (Score:1, Informative)
Too late (Score:3, Funny)
Re:hiding your address (Score:5, Insightful)
What kind of fuckwittery is this? No, plenty of languages can code a simple contact form handler, the platform you run it on is pretty irrelevant, and PHP is by no means "the most important language to learn in the universe". It's a pretty typical scripting language, not the magic you make it out to be.
Re:hiding your address (Score:2)
For just about everything else there is still perl (which is definately superior to php in every NON web task) and when perl fails there is C (
Re:hiding your address (Score:2)
Re:hiding your address (Score:2)
Re:hiding your address (Score:1)
Why not use Apache + mod_perl/mod_php, like the vast majority of souls in the known universe?
Re:hiding your address (Score:2)
Re:hiding your address (Score:2)
Re:hiding your address (Score:2)
Re:hiding your address (Score:2)
Re:hiding your address (Score:1)
As you know, comments on PHP vs. Other Scripting Languages are totally useless...
Re:php problems -- too specialized (Score:1)
I've been keeping a list of problems with PHP [toykeeper.net], if anyo
Re:hiding your address (Score:2)
You have to do more than that. You also have to not email anyone, and also not have an easy to guess username.
The problem is, you can never publish your email address anywhere - and someone else will gladly do it for you. All it takes is one person you have emailed to come down with an email virus, which then propogates your address all over the net.
Email address synthesis will also guarantee unless you have the m
Re:hiding your address (Score:4, Interesting)
> personal address anywhere on the internet.
Hiding your address does not work because some viruses collect addresses from your correspondents addressbook. Your address will percolate to spam lists, it is only a matter of time. If like me you have kept your adress for many years, you absolutely need some form of spam defense.
Re:hiding your address (Score:2)
Re:hiding your address (Score:1)
Re:hiding your address (Score:2)
Re:hiding your address (Score:2)
You still have to communicate with people, and many of them will have windows boxes which will get rooted at one time or another. It is made worse by people who innocently spam whole lists of people with documents or joke emails. Your address can get spread around that way.
Re: (Score:2)
Re:hiding your address (Score:1)
> or you can always use the html encoding for the characters in the email
These are no protection against a number of more advanced bots, and that number will increase over time.
Also, in many situations, like signing up for stuff online, an encoded email address won't be seen as valid input and will be rejected out of hand.
> or you can always just put the words inside an image.
This might work on your personal website, but is useless in most situ
Re:hiding your address (Score:1)
See, eg, here: http://www.nyphp.org/phundamentals/email_header_i
Re:hiding your address (Score:2)
A Romanian phishing gang found it, and tried to send over 2 million phishing emails by uploading a PHP script via the exploit. Fortunately, the way I have the email relay configured (the firewall blocks port 25 egress
Re:hiding your address (Score:2)
It's at least ten years too late for that for me, and I'll be damned if I'm going to give up my email address now just because of a few pesky spammers. Besides, the worst of the spam flood seems to be over. A year ago, I was getting hundreds of spam messages a day; now I might get ten, occasionally twenty a day. SpamAssassin + ClamAV identify the vast majority of those.
Re:hiding your address (Score:1)
For me, most spam (unwanted email not intended for me personally) I receive are either bounces or "confirmation" emails from other people's spam filters. Since spammers never send FROM their own address, they usually just pick a random address off their list and send from them (ie. Mine.) So bounces go to me.
These days, I've started clicking
Re:not "bulletproofly" reliable (Score:1)
Re:hiding your address (Score:1)
Re:hiding your address (Score:2)
Re:hiding your address (Score:2)
All you've done is swapped vigilence in maintaining anti-spam on your inbox to vigilence in protecting your contact form against spammers abusing your email form as a spam gateway. My contact form page gets an attempted hit every couple of days (usually a combination of MIME attachments in the comments field and injecting a BCC field to forward to the recipient) and thi
Mod parent into oblivion! (Score:1)
Re:hiding your address (Score:2, Funny)
Yeah yo, no bulletproofly reliable warez yo!
>
Make it convinient to root your server, yo! Yeah, yo! Bulletproofly warez, yo!
> Though this is only possibly with PHP...
Yeeeeaaaah, buddy! Warez, yo!
NOT!
Whatever TF this guy is smoking, you lemmings shouldn't mod it +4/Informative. It's a crap post.
Try DSPAM (Score:4, Informative)
As far as I'm concerned there are two tools for spam filtering: DSPAM and SpamAssassin. Try them both. See what fits your needs. My impression is that SpamAssassin provides more knobs and buttons and is more easily extended by the casual user, but DSPAM can be lighter weight. Both are highly accurate, with very low false positive rates.
Re:Try DSPAM (Score:2)
Re:Try DSPAM (Score:2)
Re:Try DSPAM (Score:3, Informative)
Re:Try DSPAM (Score:2)
The only thing I would critisize is shutting off auto-learn. If you want to be conservative, just lower the ham threshold and raise the spam threshold a bit. I tried to manually train for a while, and what I found was that I was actually lying to SA. auto-learn means that a view of yo
Re:Try DSPAM (Score:2)
I should make this more clear in my notes. Thanks for pointing it out.
Re:Try DSPAM (Score:2)
Training on everything is probably a mistake. Catching all of the edge conditions where that fails is going to be a very laborious task. Do all of your users do the same, or do you force their auto-learning off and have them use your bayes tokens? That has its own problems (you're not training on their mail), but at least would not leave an inattentive user in the horrible situation where they are constantly training incorrectly. That quickly leads to a broken classi
Re:Try DSPAM (Score:2)
Some explanation appears here [uwaterloo.ca].
In summary, auto-learn re-evaluates the message using only the static rules - not the bayes rules. Then, if the static rules give an extreme score that differs from the bayes score, and a couple of extra ad hoc conditions hold (number of "hits" exceeds some threshold) the bayes filter is trained.
You can adjust the "extremeness" of the score under which Bayes is trained but training will not be on what Spamassassin reports; only on
Re:Try DSPAM (Score:2)
Hrm... well, no.
First off "number of hits" is not an "extra ad hoc condition". Number of "hits" is exactly "score". There's no difference, just two pieces of terminology for the same thing. "Level" is another thin
Re:Try DSPAM (Score:2)
I wrote that paper, and the configuration I posted here is what was used in the best-scoring run.
For your conven
Re:Try DSPAM (Score:1)
I've found greylisting to be the best solution
Re:Try DSPAM (Score:3, Insightful)
Not true. First, if the user's mailbox is cluttered with spam, the user is more likely to overlook good mail. More likely than a good spam filter. Second, it is way easier to scan a list of predominantly spam for occasional good mails (and vice versa) than to have everything jumbled together. Third, spam filters are good enough that one does not need n
Re:social effects (Score:1)
It's better for the admin, too... When a non-interactive system makes a mistake, I find that the users complain -- either to the admin or to each other. But with dspam, they reclassify the missed message and continue working, happy to know th
Re:social effects (Score:2)
Re:social effects (Score:1)
I used SPAM Assassin quite happily for many years but found the effectiveness started dropping, there are some messages that just can't be caught, usually these are the worst kinds of messages (ie. a face full of spunk) almost always received by the people most likely to be offended (ie. 55 year old female administrative staff).
False positives seem to be more of a problem written in languages other than English. Pretty much all of our e-mail in Welsh language we receive through AOL has been tagged by AOL a
Solution (Score:2)
I use a three-outcome approach with SpamAssassin. Messages scored below 5 are delivered to the user's INBOX. Messages scored 5 or higher, but less than 10 go into the spam box. Messages scored 10 or higher are rejected during the SMTP session, with instructions on how to proceed.
I did this because, in practice,
Re:Try DSPAM (Score:1)
Accuracy... SpamAssassin generally offers higher accuracy with less effort, at first, but the accuracy degrades over time. DSPAM takes more effort initially, but offers higher, sustained accuracy over the long term. I see an average of about 99.5% long-term accuracy with dspam. I can't tell what the accuracy was with spamassassin, since it doesn't include a wa
Re:Try DSPAM (Score:2)
Re:Try DSPAM (Score:1)
I haven't tried dspam as a daemon yet, but intend to try it soon to see how it work
Re:Try DSPAM (Score:2)
Thanks, but... (Score:1)
Re:Thanks, but... (Score:1, Funny)
"So I let Google spam me in a targeted and personal manner via HTML rather than random people spamming me through SMTP."
I can understand why you're so proud.
Re:Thanks, but... (Score:1)
Re:Thanks, but... (Score:1)
Yeah, I bet at least 99% of gMail users know how to do that.
Re:Thanks, but... (Score:2)
So do I... and it could so easily be improved! (Score:2)
I really, really want an option for GMail to record the message-id of all messages I ever send through their server, and bounce any which are returned to me but which they haven't got on record as being sent by me.
I requested this ages ago, and it should be relatively straightforward. Does anyone else have this problem?
A 'chicken-and-egg' random thought (Score:2)
This is one of those things that makes me wonder...which "side" is pushing the technological envelope further and faster, the {spammers | malware slimers | virus breeders} or those who develop to defeat them?
Since it's generally agreed that history is written by the winners of a given conflict, I guess we won't have an answer to that until the war's over.
This comment generously brought to you by a severe lack of caffeine.
Re:A 'chicken-and-egg' random thought (Score:2)
Spammers used email to sell things whilst at the same time pissing everybody off. Eventually people hate spam so much that they are willing to pay for services that try and and eliminate spam.
It may not always be so but spammers have always been one step ahead, they have more incentive.
curious about MD (Score:1)
Geez from dealing with spammers to working with the crap DiamondTouch, Yerazunis is a real glutton for punishment
Re:curious about MD (Score:1)
A downside is that markovian is quite a lot more resource intensive than simple bayesian.
I used bogofilter (a fast bayesian filter) before CRM114. Even if it was harder to setup CRM114 than bogofilter and it used more resources, it was totally worth it.
Re:curious about MD (Score:2)
Re:curious about MD (Score:1)
Re:curious about MD (Score:2, Interesting)
Bayesian (burton)
TP: 785 TN: 1003 FN: 218 FP: 4 SC: 4 IC: 0
SR:
Re:curious about MD (Score:1)
this is one thing i'm struggling with, is how to compare the results of 2 filters on the same corpus.
we know FP's are substantially worse than any spam that gets through, but how much worse?
Re:curious about MD (Score:2, Insightful)
Re:curious about MD (Score:2)
He leavened it with appearances on Junkyard Wars [the-nerds.org].
OpenBSD port (Score:3, Informative)
Enhancement? (Score:1)
I thought that whitelisting had been a feature of every email reader/server since spam filtering began.
Re:Enhancement? (Score:1)
DSPAM's trusted sender whitelisting is automatic, based on who you converse with. It's not quite social networking, but is very useful, and requires no effort on the end-users part.
Not an advertisement... (Score:3, Interesting)
I replaced all those defenses with: DSPAM. And I'm seeing better results out of the box than I ever did with a multi-layered SA-based solution, even after a lot of time tweaking.
A quick anecdote: When I converted, I opened up a bunch of previously blocked spamtrap addresses, just to get some good training material for the filter. I've long since passed my initial training threshhold but haven't even bothered to block the spamtraps again because I never see the spam. At the risk of sounding like I'm bragging, I literally don't have a spam problem anymore, and DSPAM is entirely responsible for that.
Now, I'm not necessarily advocating that you give up all your custom defenses and switch to DSPAM. (I've turned off all my other filters, but I haven't removed them completely.) There's always a chance that an ingenious spammer will find a weakness in DSPAM setups, but I can testify to the fact that DSPAM is "scary good" as of right now. Training the filter is a simple matter of dropping misclassified messages (and there aren't many) into an IMAP folder.
If what you have is working for you, stick with it. But if you're looking for a low-maintenance, high accuracy filter, you should definitely give DSPAM a shot.
What is wrong with DSPAM? (Score:1)
Spamassassin is.
Bogofilter is.
Popfile is.
I thought it was the license, but seems that DSPAM is GPL.
So, can anyone comment? I'm not installing it
for my server if i can not apt-get it and have debian
security support for it.
Re:What is wrong with DSPAM? (Score:1)
There's been a lot of interest in this area but nobody's felt like taking it upon themselves to make a Debian distro AFAIK. Part of it may have had to do with the storage driver backend, which supports several different approaches, but required a recompile to switch from say Postgres to MySQL. In 3.6, the storage backend can be built dynamically making packaging much easier. Perhaps someone will pick 3.6 up now.