Slashdot Log In
Evolution of the 'Captcha'
Posted by
CmdrTaco
on Mon Jun 11, 2007 07:36 AM
from the why-can't-i-even-read-them-half-the-time dept.
from the why-can't-i-even-read-them-half-the-time dept.
FireballX301 writes "The New York Times is running an article about the small word puzzles various sites use in order to defeat automated script registration while still letting humans through. It seems many people can't actually solve them anymore, so new alternatives (image recognition) are being created. This, of course, seems breakable as well — is there a feasible alternative to the captcha, or are we stuck jumping through more and more hoops to register at places?"
This discussion has been archived.
No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
I am torn (Score:5, Funny)
Re:I am torn (Score:5, Funny)
Parent
Re: (Score:3, Funny)
Re: (Score:3, Interesting)
Th
Re:Unintelligent design (Score:4, Funny)
Is that like "Despite the fact that God created the Universe, people keep getting stupider"?
Or is it some sly jab at Windows?
Or maybe it's a scientific theory derived from studying governments!
Parent
Re:I am torn (Score:4, Funny)
That will sort the men from the bots.
Parent
Knowledge tests... (Score:3, Interesting)
'Germany is a country in Africa?'
Your duty to prove you were human was to change it to the proper continent and the question mark to a period. Seems pretty fool proof, especially if you combine it with things like "and make 'country' all capitals."
Re:Knowledge tests... (Score:5, Funny)
Parent
Re:Knowledge tests... (Score:4, Funny)
Parent
Re:Knowledge tests... (Score:5, Funny)
And then you voted for Bush, TWICE!!!!!!
Parent
Re: (Score:3, Funny)
Re:Knowledge tests... (Score:5, Funny)
Which is a food?
A) pink
B) car
C) Britney Spears
D) Hamburger
There is of course the possible registration by a disturbed and horny male who would say "Britney Spears" but you get the idea.
Parent
Re:Knowledge tests... (Score:5, Funny)
Which is a food?
A) pink
B) car
C) Britney Spears
D) Hamburger
There is of course the possible registration by a disturbed and horny male who would say "Britney Spears" but you get the idea.
Parent
Re:Knowledge tests... (Score:4, Funny)
Parent
Re:Knowledge tests... (Score:5, Funny)
Parent
Re:Knowledge tests... (Score:5, Insightful)
Parent
Re: (Score:3, Interesting)
Re: (Score:3, Informative)
Re: (Score:3, Informative)
Alternative? (Score:3, Insightful)
Re:Alternative? (Score:5, Insightful)
You give script your email address, it sends you an email and you follow a validation link within the email. Implementing this on my website where I had a captcha before got rid of 100% of the spam.
There are also other little dirty tricks you can do to ensure it's a human on the other end, one of my favorites is to check the referrer URL when accepting a comment... if it's not being referred from my entry forum then it just happily throws the request away. Even if it's not spam it's probably something malicious anyway.
Another thing I used to use that worked really well in conjunction with registration is "approving" any account in which the first post doesn't contain any links or any words on a "spam list". If the first post of the newly registered account contains any links or spam words at all, it's held for moderation and must be approved manually. A vast majority of the legit people leaving comments for the first time wont be including any links or talking about viagra on a tech site, no links or spam words means they've been validated as "not spam" and if they've included links it only takes a human a few seconds to qualify if the account should be canceled as spam or approved as a non-spam account. This one obviously takes some man power so it only really works on smaller sites. It might be easy for a spam bot to counteract this but the way it validates is not apparent, not to mention this is already after an email has been validated.
Parent
Re: (Score:3, Interesting)
Email validation requires people to give you something -- their email address -- that may consider more valuable that the ability to post on your forum. You'll lose all those people, who are probably rather more numerous than those who would be turned away by an annoying captcha.
In addition, email response is far more automatable than captchas. I am currently experimenting with an automated confirm-l
Re: (Score:3, Insightful)
Get over yourself.
If you're building a community forum where your visitors are likely to be repeat customers then IMO a more formal registration is appropriate.
How many people do you really think come to your website thinking, "Today I am going to join a community!"? Joining a community is not something people carefully plan out doing, it's something that happens if they try it out for a while an
Re:Alternative? (Score:4, Funny)
Parent
Re:Alternative? (Score:4, Interesting)
I read an advertisement recently -- apparently someone is collecting the URLs of web forum signup pages and then selling them to the botnets. I was thinking that maybe we could come up with a way of randomizing the signup page URL so that it would only work when the link is actually clicked on, but never got around to it. And let's be honest -- they'd figure that out too. *sigh*
Parent
Re:Alternative? (Score:5, Insightful)
Parent
Re: (Score:3, Informative)
(Or maybe I misinterpreted).
Re: (Score:3, Insightful)
Meanwhile, you have shut out all users who do not speak English well can can't figure out your instructions.
Re: (Score:3, Insightful)
Your search space wouldn't be large enough -- you can only have a limited number of photographs, since they have to be manually generated, and once the correct answers have been identified the captcha-breaking algorithm would reduce to "which image is closest to something in this set", a fairly trivial image-matching problem. This
Great idea (Score:3, Insightful)
Are they chosen for any good reason, or are they completely arbitrary? Are there letters that bots have trouble with? Fonts? Who knows?
The only thing that's sure is that every protection will eventually be broken.
What's more, maybe if you can't solve a simple word puzzle, I don't want you registering at my site...
Re:Great idea (Score:5, Insightful)
I have perfect vision and I struggle to tell if some S/5/Zs are one of the letters. The fonts and distortion is getting worse and worse to the point where it's usually 2 or 3 attempts before I can get one correctly, purely because letters are so distorted in them these days.
Parent
Re:Great idea (Score:5, Insightful)
Meanwhile, having an automated system feed them to Chinese people on $0.50 an hour can't be too hard, and they'll have at least as good a chance of getting the correct result as I do.
Parent
Re: (Score:3, Interesting)
Heh, I remember once having to enter some cryptic captcha string into a text field at rapidshare or some nameless file hosting service. I think the problem with it was there was no discrimination between O and zero, or something to that extent. Anyway, the captcha sucked so much I misread it three times, in which the site replied with "You are a bot!" and shut me out of the system. Funny way of showing appreciation and respect to customers.
By the way - since I started typing on this subject - I run a coup
Inverted problem (Score:5, Funny)
MAN, I feel clever some times.
See you in court? (Score:5, Funny)
Parent
Captcha too hard (Score:5, Insightful)
I am not sure if a picture is better, but it is defintively a step forward if I don't have to spend 5 time retrying.
Re:Captcha too hard (Score:5, Funny)
And dyslexic.
Parent
worst captchas ever (Score:3, Funny)
Stop testing the Humans, test the Robots (Score:5, Insightful)
We know that humans are more intelligent than scripts, so I always thought it should be easier to test the lack of intelligence in scripts than proving intelligence in humans.
For example just use a simple honeypot in a html form. Put a dummy input field in a form. You can hide the field with CSS/noscript tag or just mark it: "This field should be left intentionally blank" or something of that nature to make it more human friendly.
Seeing that all form fields are generally blank, the spambot/script will fill your dummy field. On server side check if the field has data, ignore the submission. It would be a VERY intelligent script that could COMPREHEND the purpose of any particular html input field.
my anonymous 2c
Re:Stop testing the Humans, test the Robots (Score:5, Interesting)
Block any IP submitting a non-blank "name" or "password" field.
Parent
Re:Stop testing the Humans, test the Robots (Score:5, Informative)
Parent
Re:Stop testing the Humans, test the Robots (Score:4, Informative)
Not really, considering that most of these scripts are targeted at large sites (yahoo, hotmail, etc) OR common site frameworks (PhpNuke, Drupal, Blogger, etc) where common hidden field input patterns would very quickly be tested and coded around by the script writers. The whole point of CAPTCHA in the first place was that it presented a random and dynamic test which was easy enough for users to solve (at least in theory) while hard enough to foil simple analysis by script. This might work on a small custom website where it is not worth the trouble of the script writers to code a version specifically for the hidden input pattern of your site, but this hidden field stuff was tried and failed on big sites even before CAPTCHA was in common use.
Parent
Digital Certificates are the answer (Score:3, Insightful)
Filtering by reputation (Score:3, Interesting)
Why don't sites band together, share data on abusive registrants, and require each new registrant to provide "references" in the form of their logins to 3-5 other sites. A person with a normal online life could easily demonstrate a pattern of nonspammy behavior. People with no prior history might be placed on probation (their posts are reviewed and may not contain any link-like data). If a registrant posts spam they temporarily (or permanently) lose their accounts on that site and all connected sites.
At some point in time, the only thing that will work is a system that tracks the identity behind the account, assigns a reputation and ostracizes miscreants.
Captcha effectiveness isn't related to difficulty (Score:5, Interesting)
Shamus Young (the creator of the "DM of the Rings") recently introduced a captcha on his site to deal with comment spam. In his post about using a captcha [shamusyoung.com] on his site, he notes that:
Emphasis mine. He's running a fairly popular site, and using a captcha based off of a single, unchanging, three-character phrase. Just the presence of the captcha was enough to effectively eliminate his spam problem. The indication seems to be that just the presence of a captcha is enough to keep spam off of even a moderately popular site.
Captcha wastes (human) time and frustrates users (Score:4, Interesting)
Take your average HTML form:
Rather than have 1 textbox for a field value, have 10. UserName1, UserName2, UserName3, etc.
Use javascript to randomly assign one of them as visible. The rest are hidden from the user.
On the server, watch to see which textbox is filled. Presumably, with decent enough javascript skills, and stupid enough bots, your humans will fill out what they see, which is the correct combination. The bots won't.
Granted, this method can be defeated if the bot checks for field level visibility after the page finishes loading, but even then, with decent enough javascript, you can continue to provide unobtrusive checks to ensure that your user is real -- e.g., unless the bot is running a macro through a web browser itself, your onblur events probably won't be tripped. And so on.
This puts a burden on the developers to come up with clever ways of defeating the bots, but in reality, that's where the battle is -- html application devs. vs spambot devs. Users shouldn't have to be dragged into the middle.
audio captcha (Score:3, Informative)
Especially with provisions of Section 508 [wikipedia.org] and the ADA [wikipedia.org] (and foreign counterparts) that ban discrimination against blind people, who use computers through screen readers that render text as speech or braille.
some sites are including an audio option.
examples are here [captcha.net] (under Guidelines > Accessibility) and here [accessibilityblog.com]
Bugmenot wants to join our b&. (Score:3, Insightful)
Re: (Score:3, Funny)
Re: (Score:3, Insightful)