The Next Step in Fighting Spam: Greylisting 481

more effective

Open Relay: 1
Dialup Spam Source: 0
Confirmed Spam Source: 2
Smart Host: 0
Spamware Developer or Spamvertized site: 0
Unconfirmed Opt-In List Server: 0
Insecure formmail.cgi: 0
Open Proxy Server:8
The Next Step in the Spam Control War: Greylisting
By Evan Harris
Copyright 2003, all rights reserved.

Introduction
This paper proposes a new and currently very effective method of enhancing the abilities of mail systems to limit the amount of spam that they recieve and deliver to their users. For the purposes of this paper, we will call this new method "Greylisting". The reason for choosing this name should become obvious as we progress.

Greylisting has been designed from the start to satisfy certain criteria:

 1. Have minimal impact on users
 2. Limit spammers ability to circumvent the blocking
 3. Require minimal maintenance at both the user and administrator level

User-level spam blocking, while somewhat effective has a few key drawbacks that make its use in the continuing spam war undesirable. A few of these are:

 1. It provides no notice to the senders of legitimate email that is falsely identified as spam.
 2. It places most of the costs of processing the spam on the receivers side rather than the spammers side.
 3. It provides no real disincentive to spammers to stop wasting our time and resources.

As a result, Greylisting is designed to be implemented at the MTA level, where we can cause the spammers the most amount of grief.

For the purposes of evaluating and testing Greylisting, an example implementation has been written of a filter that runs at the MTA (Message Transfer Agent) level. The source for this example implementation is available as a link below, and as other implementations or additional utility code become available, they will also be linked.

Greylisting has been tested on a few small scale mail hosts (less than 100 users, though with a fairly diverse set of senders from all over the world, and volumes over 10,000 email attempts a day), however it is designed to be scalable, as well as low impact to both administrators and users, and should be acceptable for use on a wide range of systems, including those of very large scale. Of course, performance issues are very dependent on implementation details.

The Greylisting method proposed in this paper is a complimentary method to other existing and yet-to-be-designed spam control systems, and is not intended as a replacement for those other methods. In fact, it is expected that spammers will eventually try to minimise the effectiveness of this method of blocking, and Greylisting is designed to limit options available to the spammer when attempting to do so.

The great thing about Greylisting is that the only methods of circumventing it will only make other spam control techniques just that much more effective (primarily DNS and other methods of blacklisting based on IP address) even after this adaptation by the spammers has occurred.

The Greylisting Method
High Level Overview
Greylisting got it's name because it is kind of a cross between black- and white-listing, with mostly automatic maintenance. A key element of the Greylisting method is this automatic maintenance.

The Greylisting method is very simple. It only looks at three pieces of information (which we will refer to as a "triplet" from now on) about any particular mail delivery attempt:

 1. The IP address of the host attempting the delivery
 2. The envelope sender address
 3. The envelope recipient address

From this, we now have a unique triplet for identifying a mail "relationship". With this data, we simply follow a basic rule, which is:

  If we have never seen this triplet before, then refuse this delivery and any others that may come within a certain period of time with a temporary failure.

Since SMTP is considered an unreliable transport, the possibility of temporary failures is built into the core spec (see RFC 821). As such, any well behaved message transfer agent (MTA) should attempt retries if given an appropriate temporary failure code for a delivery attempt (see below for discussion of issues concerning non-conforming MTA's)

your first mistake (Score:4, Insightful)

security through obscurity, again? (Score:5, Insightful)

Re:security through obscurity, again? (Score:5, Interesting)

Re:security through obscurity, again? (Score:4, Insightful)

Re:security through obscurity, again? (Score:4, Insightful)

Re:security through obscurity, again? (Score:5, Insightful)

Re:security through obscurity, again? (Score:4, Interesting)

Re:security through obscurity, again? (Score:3, Informative)

Re:security through obscurity, again? (Score:5, Interesting)

Re:security through obscurity, again? (Score:3, Insightful)

Re:security through obscurity, again? (Score:4, Insightful)

That's a good point (Score:3, Interesting)

Re:your first mistake (Score:5, Insightful)

Re:your first mistake (Score:5, Funny)

Comment removed (Score:5, Funny)

Re:your first mistake (Score:5, Funny)

Re:your first mistake (Score:5, Funny)

I think not (Score:5, Interesting)

Re:I think not (Score:2)

Secret algorithms vs. secret keys (Score:3, Informative)

Re:your first mistake (Score:4, Informative)

Re:your first mistake (Score:5, Informative)

Re:your first mistake (Score:5, Interesting)

Re:your first mistake (Score:2, Interesting)

Spammers don't care about defeating the top 5%. (Score:3, Interesting)

Questions (Score:2, Insightful)

Re:Questions (Score:4, Insightful)

Re:Questions (Score:4, Informative)

Re:Questions (Score:2, Informative)

can't believe their numbers (Score:5, Informative)

Re:can't believe their numbers (Score:5, Informative)

Re:can't believe their numbers (Score:3, Interesting)

Poor use of statistics (Score:5, Insightful)

Re:here are the stats (Score:3, Interesting)

Open Relays a smaller problem? Viruses instead? (Score:2, Informative)

In case of /.'ing (Score:4, Informative)

Re:In case of /.'ing (Score:2)

Tempfailing is not new and unique (Score:5, Informative)

I am not sure what the spam filter is (Score:2)

Re:I am not sure what the spam filter is (Score:2)

Easy way to stop spam... (Score:3, Informative)

Re:Easy way to stop spam... (Score:3, Informative)

anyone@domain (Score:3, Informative)

Easy for end-users, sure. (Score:5, Insightful)

clever hack for WHOIS contact addresses (Score:5, Interesting)

1 false positive is not acceptable. (Score:3, Insightful)

Re:1 false positive is not acceptable. (Score:5, Interesting)

Re:1 false positive is not acceptable. (Score:4, Insightful)

Re:Reference for that paper (Score:3, Informative)

Time critical (Score:5, Insightful)

Re:Time critical (Score:5, Informative)

Re:Time critical (Score:2)

Re:Time critical (Score:2)

Re:Time critical (Score:3, Insightful)

Re:Time critical (Score:3, Insightful)

Re:Time critical (Score:3, Insightful)

Re:Time critical (Score:4, Insightful)

Re:Time critical (Score:2)

spam.....hrmmm (Score:5, Insightful)

How about Habeas' haiku method? (Score:4, Interesting)

Re:spam.....hrmmm (Score:2)

I'm not sure about this... (Score:3, Insightful)

Bayesian Filtering (Score:3, Interesting)

Re:Bayesian Filtering (Score:2)

Re:Bayesian Filtering (Score:2)

Re:Bayesian Filtering (Score:4, Insightful)

Re:Bayesian Filtering (Score:3)

I have my own algorithm (Score:2, Insightful)

Re:I have my own algorithm (Score:2, Insightful)

Copy of spam logged? (Score:2, Insightful)

spammesilly@gt.rr.com (Score:2)

RFC 3514 (Score:5, Funny)

Filtering out spam (Score:2)

Published a paper? (Score:4, Informative)

Re:Published a paper? (Score:2)

Re:Published a paper? (Score:5, Insightful)

Re:Published a paper? (Score:3, Insightful)

Waiting for Article Title (Score:5, Funny)

In many countries SPAM is illegal and... (Score:2)

Re:In many countries SPAM is illegal and... (Score:2)