Distributed Spam Detection 304

Posted by CmdrTaco on Saturday December 01, 2001 @01:20PM from the interesting-ideas dept.

A reader writes "There's an interesting project at SourceForge, called, "Vipul's Razor", that uses a gnutella like system to let users exchange spam "signatures" to filter spam. I work at an ISP in Ottawa, we have been using it for last two weeks to stop bulk of spam coming to our POP3 accounts. More impressively, it hasn't tagged any valid mail as spam yet. Here's the scoop from its webpage: "Vipul's Razor is a distributed, collaborative, spam detection and filtering network. Razor establishes a distributed and constantly updating catalogue of spam in propagation. This catalogue is used by clients to filter out known spam. On receiving a spam, a Razor Reporting Agent (run by an end-user or a troll box) calculates and submits a 20-character unique identification of the spam (a SHA Digest) to its closest Razor Catalogue Server. The Catalogue Server echos this signature to other trusted servers after storing it in its database. Prior to manual processing or transport-level reception, Razor Filtering Agents (end-users and MTAs) check their incoming mail against a Catalogue Server and filter out or deny transport in case of a signature match."" Cool idea. I'm up around 80% spam a day on my main mail account. Might be worth a try.

Distributed Spam Detection

This discussion has been archived. No new comments can be posted.

Search 304 Comments Log In/Create an Account

Comments Filter:

Idiotic (Score:1, Insightful)

by Anonymous Coward writes: on Saturday December 01, 2001 @01:26PM (#2641278)

90% of spam I get has a subject like:

"New pill reduces debt! 513456"

So, a message digest won't work.

Anyone know where these people live? (Score:1, Insightful)

by oman_ ( 147713 ) writes: on Saturday December 01, 2001 @01:28PM (#2641282) Homepage

Just curious.. has anyone compiled a list of known spammers and their home addresses?

Great use of p2p (Score:5, Insightful)

by astrashe ( 7452 ) writes: on Saturday December 01, 2001 @01:29PM (#2641293) Journal

This is a great use of p2p -- something that doesn't involve piracy. I wish I had heard of it before.

Are there any other innovative non-piracy p2p apps out there that we should know about?

Authentication with servers? (Score:5, Insightful)

by GlassUser ( 190787 ) writes: <`slashdot' `at' `glassuser.net'> on Saturday December 01, 2001 @01:30PM (#2641296) Homepage Journal

I read some of the documentation, but I can't find details on a couple of questions. Do the servers authenticate with each other? It was implied, but how deep is it? Are the SHA signatures signed to the originating server (or client/trollbox) too? I think this kind of model is great, but if you don't have some nifty authentication/accountability, it can be wide open for abuse. I'm sure anyone reading slashdot can imagine a vengeful spammer flooding the network with bogus or malicious hashes.

How about a server frontend approach? (Score:3, Insightful)

by serial frame ( 236591 ) writes: on Saturday December 01, 2001 @01:32PM (#2641306)

It would be very neat if this were provided as a free service that acts as a front-end to an existing POP3 account. Simply sign up, provide info like your username, POP3 host (but not password; that can be passed from the service to your POP3 server on log-in for safety reasons). Then, point your favourite mail client at the service's POP3 server, and...voila. Same e-mail, minus the spam.
Nothing truly insightful here, just speculation from a convenience freak.

idea won't work if reaches critical mass (Score:4, Insightful)

by intuition ( 74209 ) writes: on Saturday December 01, 2001 @01:37PM (#2641321) Homepage

Razor catalogs spam by hashing the entire text of the message. Later potential spam is "detected" by hashing entire texts of messages to see if the hash matches any of the existing hashes in the spam catalog.

To get around this all a spammer has to do is change/add at least one charachter to each spam. This would make all the hashes unique and no spams would be detected.

Open for abuse? (Score:2, Insightful)

by robstah ( 537647 ) writes: <robsterNO@SPAMdebian.org> on Saturday December 01, 2001 @01:45PM (#2641345) Homepage

Although, i marvel at the theory and innovative use of peer to peer technology to achieve exemplary aims. I have some concerns about the possibilities of abuse, AFAIK the submission system for spam, is not moderated in any way. In fact only the hash is sent to the server and not a copy of the spam, i am therefore concerned that the system could possibly be abused by someone submitting the hash of a legitimate mail to the system that would then result in this email from being recieved by the other hosts. This could be done to prevent the circulation of bugtaq items, my a malicous user for instance. And as everyone has different personal opinions about SPAM and what constitues it, i think a set of clear guidelines is required and when submissions are made a copy of the mail is associated with it and a human being moderates the hashes being submitted. Although i have my doubts about the system, if these were put to rest i would have no hesistation in implementing a system like this.

Re: Distributed spam filter (Score:3, Insightful)

by blibbleblobble ( 526872 ) writes: on Saturday December 01, 2001 @01:45PM (#2641347)

It does seem like a remarkably sensible system, just getting email clients to talk to each other about the emails they get.

You can tell if the same email has been sent to hundreds of people (and if you use hashes, you can do that without revealing the email)

You can click a "this is spam" button when you read an email, and anyone who trusts you (i.e. has your public key in their "trusted filtering friends" list) can look for similar messages and filter them.

But, there do seem to be a load of problems:
- Personalised email, as someone already mentioned
- Privacy problems with letting others into the secrets of your mailbox
- If you have the original of a message, you can calculate the hash, then see who else got the message (i.e. works for personal mail as well as spam)
- Relatively easy for malicious users to wrongly label someone as a spammer

Well worth investigating, though...

One way around potential abuse. (Score:5, Insightful)

by chris_7d0h ( 216090 ) writes: on Saturday December 01, 2001 @01:56PM (#2641381) Journal

To eliminate the situation where one person posts a lot of "incorrect" signatures, a ranking system could be applied.
The thought goes like this.
A person submits a signature of "identified" spam mail to a "supernode" for ex. and the submission gets a ranking of 1. Each additional submission (by other users) increases the score by a number.

This way, there are several classifications which could be used to filter incoming mail. For the mail providers, they could opt for only removing mail matching signatures with a very high score (thus very likely these will be actual spam) or they could filter anything reported.

The purpose of allowing the use of classifications is that it will take longer time to get higher scores, since more people have to report the specific spam mail. Some people whish to eliminate things the least bit suspected, but mileage may vary.

Do you see a resemblance with the ./ moderation?

Re:I've managed to filter most spam (Score:2, Insightful)

by LiteForce ( 102751 ) writes: on Saturday December 01, 2001 @03:22PM (#2641547) Homepage

This won't work if somebody has sent you a message by way of BCC (Blind Carbon Copy).

an other effective spam stopping method ? (Score:3, Insightful)

by Sarin ( 112173 ) writes: on Saturday December 01, 2001 @03:24PM (#2641551) Homepage Journal

I receive about 40 spam messages in my mail account each day and I run my own mail server (qmail). Someone told me about a very basic spam stopping method. Just remove the mail-account for a couple of weeks and then reconnect it again, you should less or no spam after that period.

I receive too much real messages in order to try this out and I think most spammers won't bother to actuall remove an email address from their database if it doesn't exist. But has someone else tried this with any luck?

This p2p spam sounds really nice and I'm going to give it a try asap. I already "lost" an other mail-account in the flood of spam I got on it, so now it forwards all messages to msnbill@microsoft.com (microsoft domain billing address).

Re:So... (Score:4, Insightful)

by Greyfox ( 87712 ) writes: on Saturday December 01, 2001 @04:05PM (#2641639) Homepage Journal

Spammers themselves are generally interested in ways to disrupt those lines of defense. If this project grows in popularity and shows itself to effectively block spam, they'll start gunning for it. Considering potential holes in the system before that starts happening really isn't a bad idea.

Why not a histrogram filter? (Score:1, Insightful)

by javaaddikt ( 385701 ) writes: on Saturday December 01, 2001 @04:47PM (#2641764)

The best option would be a word count histogram filter. Then the spammer would have to entirely alter their language or sales pitch, which isn't going to happen. Just like handwriting, it is hard to change unless you make a whosale effort at changing it. They're too lazy, too.

Re:So... (Score:4, Insightful)

by dev0n ( 313063 ) writes: on Saturday December 01, 2001 @04:51PM (#2641781) Homepage

Seems like everyone hates spam with a passion, except maybe the spammers themselves

well, i would have to disagree with you on this point.. i work at a web hosting company as the technical support manager, and handling abuse complaints falls into my realm of responsibility... and i have found that a significant number of first time spammers do not KNOW that spam is "wrong", and get quite upset that they were "taken" by companies that send bulk messages on their behalf. i had one gentleman send me an apology letter that actually made me feel sorry for him. he, and many other people on our network, have never been repeat spammers.

i know that there are many people out there who don't care, but we can't automatically assume that all spammers are evil. some of them are just ignorant.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Distributed Spam Detection 304

Distributed Spam Detection More Login

Distributed Spam Detection

Idiotic (Score:1, Insightful)

Anyone know where these people live? (Score:1, Insightful)

Great use of p2p (Score:5, Insightful)

Authentication with servers? (Score:5, Insightful)

How about a server frontend approach? (Score:3, Insightful)

idea won't work if reaches critical mass (Score:4, Insightful)

Open for abuse? (Score:2, Insightful)

Re: Distributed spam filter (Score:3, Insightful)

One way around potential abuse. (Score:5, Insightful)

Re:I've managed to filter most spam (Score:2, Insightful)

an other effective spam stopping method ? (Score:3, Insightful)

Re:So... (Score:4, Insightful)

Why not a histrogram filter? (Score:1, Insightful)

Re:So... (Score:4, Insightful)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot