Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Slashdot Log In

Log In

Create Account  |  Retrieve Password

How to Prevent Form Spam Without Captchas

Posted by ScuttleMonkey on Wed Nov 08, 2006 12:30 PM
from the can't-beat-the-curte-power-of-kittenauth dept.
UnderAttack writes "Spam submitted to web contact forms and forums continues to be a huge problem. The standard way out is the use of captchas. However, captchas can be hard to read even for humans. And if implemented wrong, they will be read by the bots. The SANS Internet Storm Center covers a nice set of alternatives to captchas. For example, the use of style sheets to hide certain form fields from humans, but make them 'attractive' to bots. The idea of these methods is to increase the work a spammer has to do to spam the form without inconveniencing regular users."
+ -
story

Related Stories

[+] Developers: HTML Encoded Captchas 177 comments
rangeva writes to tell us about a twist he has developed on the common Captcha technique to discourage spam bots: HECs encode the Captcha image into HTML, thus presenting an unsolved challenge to the bots' programmers. From the writeup: "The Captcha is no longer an image and therefore not a resource they can download and process. The owner of the site can change the properties of the Captcha's HTML, making it unique,... add[ing] another layer of complication for the bot to crack." HECs are not exactly lightweight — the one on the linked page weighs in at 218K — but this GPL'd project seems like a nice advance on the state of the art.
[+] Carnegie Mellon CAPTCHA Digitization Project Now Underway 119 comments
tomandlu writes "The BBC is reporting that Carnegie Mellon University has found a novel use for CAPTCHAs — deciphering old texts. We've discussed this project before, but it was prior to it getting off the ground. Users Entering text acts as a sort of distributed computing project. Basically, the CAPTCHA is made up of two words — one of which is known to Carnegie, and one of which isn't. If the user correctly deciphers the known word, then the unknown word is assumed to be correct. Well, almost. Two different users must give the same answer to the same unknown CAPTCHA before it is taken off the list. 'Using the reCAPTCHA system von Ahn's team is digitizing documents and manuscripts as fast as the Internet Archive can supply them, and the good news for book lovers (and bad news for spammers) is that the supply of reCAPTCHAs is not likely to dry up any time soon.'"
This discussion has been archived. No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More
Loading... please wait.
  • And how... (Score:5, Interesting)

    by Creepy Crawler (680178) on Wednesday November 08 2006, @12:36PM (#16770691)
    Ok, so captchas and other email obfuscation mechanisms are used a lot. Fine, a web designer can choose to do this.

    Now, lets enter US law: American with Disabilities Act. Target [arstechnica.com] is currently being sued for NOT complying with this federal law. I can understand why businesses would be required for this, but where will the net-boundaries stop?

    For example, I have a US corp. I hire an offshore datacenter to handle web processing. Is my website have the compulsory ADA lawss upon it, or do they not apply due to international boundaries? Yipe.
    • Perhaps the vision impaired could get audio captchas?

      Click this button, listen to the sound, and then choose the selection what the sound was.

      Like birds chirping, babies crying, piano playing and maybe other familiar sound effects that you would choose from a multiple choice list.

      Of course if the user is deaf and blind, I'm not sure how they are using a computer to begin with.
    • So the government will get you when you don't completely comply with a regulation that affects a small group of people, but at the same time doesn't do anything to rid the world of the constant barrage of spam that annoy *everyone*, including the disabled?

      Well, that's nice then...

    • Re: (Score:3, Insightful)

      Now, lets enter US law: American with Disabilities Act.

      So? Just put a phone number on the site with a "If you are disabled and can't use our captcha, please call our tech support and we'll set up an account."
        • Re:And how... (Score:4, Insightful)

          by tomstdenis (446163) <tomstdenis@NoSpAM.gmail.com> on Wednesday November 08 2006, @12:59PM (#16771147) Homepage
          I think you'd find Slashdot very much more trivial and redundant if all non-Americans left.

          That said, ADA's can go fuck themselves. I can see making exceptions for EMPLOYEES but why would I have to go out of my way to help customers? What if it's simply not cost effective? If it costs millions to placate the handful of noisemakers is it worth the effort?

          Being blind really has to suck. And *I DO* wish that companies would help them out. I don't think we should force them though as it can lead to smaller companies who can't afford to deal with it going out of business.

          Sure, our websites would then be ADA compliant, but there would only be a handful of mega-corp websites at that point. So you're trading what little free market economy we have left to placate special interest groups.

          Frankly, if I were blind I'd make due and where I couldn't I'd rely on friends or family. No shame in asking a family member to order something from a website for you. Granted "disabled" folk want their independence, they also have to be practical about it....

          Tom
          • by heinousjay (683506) on Wednesday November 08 2006, @01:04PM (#16771271) Journal
            I think you'd find Slashdot very much more trivial and redundant if all non-Americans left.

            Indeed. I would miss the self-righteous off-the-mark diatribes about how we should run our country. I wouldn't be able to get my daily fill of hubris from people who think they are superior in every way. I don't know what I would do.
            • Well, in all fairness, at we least we furriners just give you an earful, whereas typical American hubris is usually delivered via shock and awe.

              Mods: go nuts! I have karma to burn, bitches.
            • Re:And how... (Score:5, Insightful)

              by tomstdenis (446163) <tomstdenis@NoSpAM.gmail.com> on Wednesday November 08 2006, @02:40PM (#16773319) Homepage
              You've obviously never ran a small business so you have no fucking clue whatsoever.

              Adding ADA compatible facilities and also making sure you're compliant costs money that most small companies don't have to spend. Given that it's to cater to a SMALLER market segment it's not good business sense to do it.

              And why should disabled people not expect to be 100% independent? Because majority rules. Sorry dude. Why should I cripple my business so you can read my literature? You don't have a right to be my customer. You have a right to employment, and to that end I'd have to at least accept the resumes of disabled folk. But i don't have to cater to the whims of every nancy out there with a problem.

              Not that catering is bad. I think if a company has the means and market it should attempt to go all ADA compliant. I think it's a good thing to get ramps, lifts, braille/etc. I just don't think it's a good idea to FORCE it upon people.

              Tom
              • Re:And how... (Score:5, Insightful)

                by headonfire (160408) on Wednesday November 08 2006, @03:47PM (#16774703)
                yeah, yeah I have. I helped run and manage a family antique shop for several years, then got out of the business to do more interesting things.

                It's not about being 100% independent, it's about being as independent as possible. It's about all the small shit that YOU take for granted. It's being able to take a hot bath without worrying if you're going to boil your nerveless legs off, get an infection, and die. it's being able to cook your own meals, at least once in a while; or get your own groceries, or buy the things that other people are buying. Why does a disabled person have to do without, or beg for help from someone? And what if there -isn't- anyone to help, an all too common situation? Shit, my buddy can't even leave the house without someone to help him right now. He's got a visiting nurse who is nice and brings him some fast food once in a while so he can have a bit of variety.

                If you're running a small enough shop, being ADA compliant isn't hard anyways, and can amount to a ramp and a handrail. Get some lumber, nails and a hammer and do it yourself! Shit, grants and tax incentives are even available for that shit! And offer assistance to the blind guy or girl, don't tell him/her to fuck off and learn to read. If you don't have regular blind customers, wait until someone asks before you spend the money on braille if you're gonna be cheap.

                Goddamn, it's not asking you to suck a dick and buy a ferrari for every cripple who walks or wheels into your storefront! Just let people do their thing, regardless of their physical abilities! It's not about making a ton of money, it's about DOING THE RIGHT THING FOR PEOPLE. And yeah, yeah I DO have a right to be your customer. You cannot deny me custom in your public shop because of my race, gender, religion, or physical ability. That's the law. You have the right not to sell and expose yourself to a lawsuit, but I do have the right to enter your shop until you tell me to leave.

                You know why it's law? because without the law, nobody would do it, because so many people are amoral cheapasses, particularly business owners. That's why we developed employee, child labor, and consumer protection laws - business owners weren't exactly chomping at the goddamn bit to be nice to people, not when it might cost a few dollars off the top.
              • Re:And how... (Score:4, Insightful)

                by SillyNickName4me (760022) <dotslash@bartsplace.net> on Wednesday November 08 2006, @09:32PM (#16779409) Homepage
                You've obviously never ran a small business so you have no fucking clue whatsoever.

                Maybe gp doesn't, but I do, and I also happen to be visually impaired (not blind, but bad enough to never be able to drive a car, not be able to read any signs that I can't get close to etc)

                Adding ADA compatible facilities and also making sure you're compliant costs money that most small companies don't have to spend. Given that it's to cater to a SMALLER market segment it's not good business sense to do it.

                It costs money in quite some cases, but this is to expand your market, not to cater to a smaller market.

                And why should disabled people not expect to be 100% independent? Because majority rules. Sorry dude.

                What you just described is tirany by the majority, not a democratic society. You may not have noticed, but the system in the USA has all kinds of provisions to try to prevent exactly that. Actually taking into account the needs of minorities, upto individuals, is a fundamental part of the system.

                Why should I expect to not be 100% dependent? because there is no reason why I should be. I am actually in a situation where I am not much more dependent on others then I would be without being visually impaired. That is for a substantial part a consequence of my own choices, and it is first of all my own responsibility to see to this. That said, I am hindered by many things that would not have costed money to prevent, will cost little to fix, and mostly happen out of ignorance, not because of it costing money. I don't see anything wrong with getting people to put a little thought into this, if needed by means of the law.

                Not to mention that when as many disabled people as possible can be as independent as reasonably possible, the outcome for society as a whole is surely better from a social point of view, and it is quite likely cheaper on the whole as well.

                Why should I cripple my business so you can read my literature?

                Expanding your potential market is not in itself crippling your business.

                You do have a point that it may not be worth it financially when you have to do things like install ramps, elevators etc, it may not fit into your specific building for cosmetical or whatever other reasons, and you can quite rightfully ask how far this should go anyway.

                Hence I don't think that there should be laws forcing this onto companies, rather, those who do try to be accessable to disabled people should get the possible cost compensated in the form of tax breaks for example.

                You don't have a right to be my customer.

                No, but depending on where exactly you live, you might not be allowed to discriminate against me based on disability.

                You have a right to employment,

                Again this depends on local law, this is different from state to state in the USA, and even more different between countries..

                and to that end I'd have to at least accept the resumes of disabled folk. But i don't have to cater to the whims of every nancy out there with a problem.

                Making sure you do not create obstacles for disabled people out of ignorace is not catering to the whims of everyone out there with a problem, it is being a decent human who tries to better the society he lives in. Being forced to incure cost for the sake of a better society however is not a good thing (because of the forced part of it), encouragement to do a bit extra in the form of compensation however seems like a worthwhile idea to me.

                Not that catering is bad. I think if a company has the means and market it should attempt to go all ADA compliant. I think it's a good thing to get ramps, lifts, braille/etc. I just don't think it's a good idea to FORCE it upon people.

                This I completely agree with, and since for all I can tell this was your real point also, maybe do yourself the favor to slow down a bit before posting such rants as the one in front of it, you have a reasonable and well defendable point of view I believe, but much of your post is going to prevent people from seeing that because it rather makes you look unreasonable and extremist.

  • by Thansal (999464) on Wednesday November 08 2006, @12:36PM (#16770695)
    Why is it so hard to make a captcha that a bot can't read but a human can?

    The slashdot captchas are among the easiest I have ever seen to read, however I still havn't seen any spam on slashdot. Is there something else goign on here? It can't be anything like IP banning or flood controlls as those don't stop botnets. Is it that spammers just don't target slashdot? or is it that captcha reading bots are not nearly that good at breaking them and we could tone down the level of those horrible tiwsted-doted-lined Captchas?
    • Try running the Slashdot front page through crm114 sometime and see if it really is better than a human (specifically, better than you) at distinguishing spam from legitimate content. ;)
    • by Anonymous Coward on Wednesday November 08 2006, @12:42PM (#16770829)
      Men's and Ladies Prestige Watches For all occasions! Perfect Christmas gifts!

      These replicas have all the presence and poise of the originals after whome they were designed at a fraction of the cost. The attention to detail is paramount and they are comparable to the originals in every way.

      To view our huge inventory visit our website now at:

      http://pwned31337.ku/ [pwned31337.ku]

      : Replicated to the smallest detail
      : 98% A+ Accuracy
      : Includes all Proper Markings
      : Wide selection and fast worldwide shipping
      : Authentic Weight
      : True-to-original self winding and quartz mechanisms
      : Guaranteed worldwide Christmas delivery
    • by everphilski (877346) on Wednesday November 08 2006, @12:43PM (#16770851) Journal
      Think about it ... the slashdot crowd is technical and informed and "knows better" ... why would someone spambot slashdot? It surely would not be effective...
    • Re: (Score:3, Insightful)

      I've always wondered why designers don't use something simpler such as showing a picture of an easily identifiable object and requiring the user to identify it. This would work in 99.9% of cases. Alternatively, for the screen reader crowd, the check could something like, "What is the fifth word in this sentence?" There's probably some obvious flaw with this technique that I'm not thinking of, or I imagine it would have been done already.
      • Re: (Score:3, Insightful)

        The obvious flaw is that you need to create each one, and they therefore are inherently more limited in number. Text-based chaptchas are generated by a computer- pictures of pandas and their associated word would have to be done by hand.
      • by Thansal (999464) on Wednesday November 08 2006, @12:54PM (#16771063)
        I actualy like the ones like that.

        instead of obfuscated images, just put in plain text questions.
        What is 2+2?
        What is the 3rd word in this sentance?
        What is the name of my blog?

        All of these can be answered by some one using a screen reader, and take less time then figguring out a captch. Sure it does not stop manual spamming, but what does?
        • Re: (Score:3, Informative)

          instead of obfuscated images, just put in plain text questions.

          That's been considered before. The problem with that approach is that, unlike image-based CAPTCHAs, there are a limited number of templates available for natural-language questions. The spammer just has to compile a list of the various patterns of questions and answers, a much easier task than designing an OCR program capable of extracting random, disconnected letters and numbers from a randomly distorted image. The problem is essentially on

        • by Trailer Trash (60756) on Wednesday November 08 2006, @02:05PM (#16772517) Homepage
          What is the 3rd word in this sentance?

          How about:

          Which word is spelled incorrectly in my sentance?

      • Re: (Score:3, Informative)

        These questions or pictures again need to be either automatically generated or generated by humans. If automatically generated, they would need to follow a pattern, and so the challenge would then be on the spammers to identify the pattern and train their bots to read the pattern and respond appropriately.

        If, on the other hand, they're generated by humans, it would be expensive to generate each one, and so they'd be limited in number. Therefore the spammers simply go about collecting each one, identifyin

      • Re: (Score:3, Insightful)

        Except that there is no such thing as a picture of an easily identifiable object, especially not if you don't want to block non-English speakers. People will come up with many different words for the same thing, people will misspell it, people will not know the English word for it, and people will just not know what it is.
        • Awesome, I'm sitting here during my lunch break at work, checking out that page, and what do I see under some of the sample captchas? Goatse, barely distorted.
    • Re: (Score:2, Informative)

      Why is it so hard to make a captcha that a bot can't read but a human can?

      Numerous times there is confusion between I and L. Since every site uses its own set of images and its own 'set of rules to obfuscate', the user has all the reasons to be confused. Then there is 3 coupled with something that makes it look like B etc.

      Ofcourse, you will fail one time only, as on next reload you will get a new image to read, but as the article says, user response drops. People want to help you and you are making it, kind

      • zero and "o" are my 2 problems genneraly.

        However, as I said, I have never failed a slashdot captcha, probably because they are all words....
    • In a word: accessibility. Blind readers can't see graphic-based captchas and screen readers won't read them. Audio-based captchas have been used, but they can be difficult for some people with disabilities as well, are often difficult even for abled people and may be easier to process by bots in many cases.

    • Re: (Score:3, Informative)

      What's worked surprisingly well for me is simple arithmetic. Adding a random math problem such as 2 + 5 = [ ] or 3 + 4 = [ ] has DRAMATICALLY decreased the amount of form spam two of my websites have received.
    • by Pichu0102 (916292) <pichu0102@gmail.com> on Wednesday November 08 2006, @12:52PM (#16771029) Journal
      The slashdot captchas are among the easiest I have ever seen to read, however I still havn't seen any spam on slashdot.

      You obviously don't browse the comments at -1.
    • Slashdot has a couple of extra things going for it:

      * A "lameness filter" which excludes certain posts (ill-defined and probably continually changing to keep up)

      * A 20-second rule which prevents you from blasting the board

      * Moderation, which puts anonymous posts in a place most people don't read anyway. They may be there and you don't see them.

      That's still not sufficient for some jackass not to at least try, especially since the audience is so large. It may not be worth the trouble, since Slashdotters are r
      • Re: (Score:3, Insightful)

        actualy, I browse at 0, as alot of ACs have some rather good posts. (infact I brwse at 0 Nested, so I see even more of these posts)

        I still have yet to see anything that was an ad, I have seen pleanty of trolls, but those are not bots. I forgot about the lameness filter, and I admit to being curious if that is catchign things....
    • by nuzak (959558) on Wednesday November 08 2006, @01:59PM (#16772359) Journal
      > Why is it so hard to make a captcha that a bot can't read but a human can?

      Because anything difficult to OCR can be a real pain for humans too. Still, it's not that spammers are mass-OCR'ing images, it's that they actually get humans to enter the captchas, sometimes providing porn as a reward, but it's sometimes also a paid operation with goldfarming-style sweatshops. In a way, this is fine, because it scales far worse than full scale automation, but it does keep captchas from being a panacea.

      It's the combination of the captcha, rate controls, and moderation that keeps spam out of here. All links here have rel="nofollow" as well, making them useless for google spamming, and the spammers know it. Basically it's a poor return on investment when you can spam a bunch of blogs that are wide open.
  • Javascript (Score:5, Interesting)

    by Aladrin (926209) on Wednesday November 08 2006, @12:39PM (#16770757)
    I hadn't read the article yet, and just the summary, and as soon as they said 'hidden fields' that are attractive to spambots, I thought "Why not hide the fields from the spambot instead?"

    It's easy, you just have the javascript create all or part of the form. Or modify the form in some way. It would happen before the user even sees the form, and the spambot would have to implement a javascript parser to get it. (Or a parser, that's unique to your site.)

    I would think AJAX would be a huge hamper to them as well.
    • Re: (Score:3, Informative)

      Please, read before you respond.

      "I hadn't read the article yet," is NOT the same as "I haven't read the article yet,"

      I've read it. You can stop posting the same 'rtfa' over and over. Jeez.
        • Re:Javascript (Score:4, Informative)

          by masterzora (871343) <masterzora@@@gmail...com> on Wednesday November 08 2006, @05:06PM (#16776195)
          The reason CSS doesn't cause the same issues is because the CSS method isn't dependent on the CSS working. If the CSS doesn't work, then, oh well, good thing we have this text telling the user not to use those forms. If the Javascript doesn't work, crap, the user can't even see the necessary forms. See the difference?
  • Blind users (Score:4, Insightful)

    by awtbfb (586638) on Wednesday November 08 2006, @12:42PM (#16770817)
    This is still somewhat problematic for blind users. If decoy field names are picked up when CSS is turned off, then there will be a lot of users exposed to the bogus fields.
    • Re: (Score:3, Insightful)

      It's fairly trivial to also hide a comment telling non-CSS-browser-users to leave a field blank.

      Blind people can't see. They aren't stupid. :P (Well, any more then anyone else.)
    • Use CSS' media types.

      Aural, braille, and embossed are all media types that would hide the fields for blind users if done correctly (i.e. used and the reader supports it, which you'd think they would want to). This technique is not the only reason why blind user's tools need to work differently based on mediate type in CSS.
  • Just shoot 'em on sight.

    KFG
  • Private Key encrypt the randomized field names and have a hidden Public Key field. That way, the fields foo, bar, and abacab have no sense of meaning to the bots, but will decrypt to subject, body, and spammer catcher.
  • by gorckat (960852) on Wednesday November 08 2006, @12:54PM (#16771057)
    ...can it be clearly labeld as bogus? Something like:

    Subject: _______{-enter your spam topic here if you want me to disregard your email

    Can the label/tag telling someone to leave a field blank be hidden form a bot but clearly visible to a live person?
  • Hiding things seems like a good way to get search engines to not like you.

  • My Method (Score:3, Interesting)

    by CastrTroy (595695) on Wednesday November 08 2006, @01:01PM (#16771211) Homepage
    My Method is to just disallow posting of html. I have a simple blog, and if they try to do anything like post too many HREFs or or something, then I just deny the post. That seemed to work for the most part. The bots usually tried to post URLs on my site, so if they posted something like <a href=.... then I would just display an error message, since html doesn't show up properly anyway, because I encode the < and > with &lt; and &gt;. They also try posting [link]...[/link] which also doesn't work on my blog, so I just display an error message and let the user fix it. You can still post straight URLs, but that's not too good for spammers, because they usually want a link. I also stop people from trying to post more than 5 URLs in a single post, since I noticed the bots like to do that. I recently upgraded by blog to use AJAX to submit the comments. Adds an extra layer of protection against the bots, but I really haven't needed any since I added in the filters mentioned above.
  • Related Story (Score:4, Informative)

    by Amazing Quantum Man (458715) on Wednesday November 08 2006, @01:13PM (#16771411) Homepage
    Since the editors didn't see fit to put this in related links:

    What Ways Can Sites Handle Spambot Attacks? [slashdot.org]

  • by liangzai (837960) on Wednesday November 08 2006, @01:41PM (#16771917) Homepage
    This will prevent 100% of the bots from even entering your page... ... plus a few IE users.
  • Vbulletin forums? (Score:3, Informative)

    by Shoeler (180797) * on Wednesday November 08 2006, @02:11PM (#16772657)
    I run two largish Vbulletin forums - and we get at least 1-2 spammers a day. I haven't found a way to prevent them yet, but I have found a way to stop em from getting any traffic or money for the unsuspecting idiot that clicks on them.

    I use an anti-spam e-mail technique: blacklist.

    Vbulletin has a censoring system where words you choose can be replaced with your choice of characters - by default it's an *. www.clickmeforspam.com, where I would use the "clickmeforspam.com" as the censored word, shows up as www.****************** .

    It's quite hilarious to see the humans behind the spam, who have registered, gotten through a human image trap, clicked on a link e-mailed to them, logged in and posted their spam re-post it like 2-3 times only to realize they got owned by my filter. They get all pissed off, and by that time a user has reported the post or we've seen it and banned them. It's very fun to make fun of them in their spam posts filled with ***s. :)
  • by sparkz (146432) on Wednesday November 08 2006, @04:07PM (#16775151) Homepage
    I've been doing a variation this for quite a while now on my phpBB forum [steve-parker.org]. There are bots which identify a phpBB forum and simply POST a user-account creation to the relevant page. This then adds their URL to the forum's memberlist page, improving their Google ranking.

    I won't stand for that, so the simple fix is to remove the "WEBSITE" input from the form. If "WEBSITE" gets POSTed along with the other data, I know it's a robot and post a message to kindly go away. Genuine users can edit their profile once the account is activated, if they want to plug their website.

  • use dnsbls (Score:4, Interesting)

    by joost (87285) on Wednesday November 08 2006, @05:30PM (#16776645) Homepage
    Shameless plug! I developed a plugin for Ruby on Rails that uses DNSBLs to combat form spam. (begin shameless self promotion)

    dnsbl_check rails plugin [spacebabies.nl]

    Basically what the plugin does is check clients against one or more DNSBLs. You might know them from mail servers. You see, it turns out that the forms are almost always abused by bots. These bots are quite well known. sbl-xbl from spamhaus catches 80% in my setup, spamcop catches the rest. You enable the plugin for key controllers and it really does work.

    (/end shameless self promotion) mod me down if you wish
    • Re:HTTP_REFERER (Score:5, Insightful)

      by Bogtha (906264) on Wednesday November 08 2006, @04:29PM (#16775545)

      A lot of legitimate users have the Referer header switched off or otherwise unavailable. Apart from the privacy factor, it's also common for "firewalls" [sic] to disable or change them.

      Of course, you wouldn't know this, because anybody who finds out is automatically banned from your website, so they don't have a chance to leave a comment or even find your email address letting you know about the problem.