Slashdot Log In
Web Users Angered by Anti-Spam 'Captcha'
Posted by
Zonk
on Thu Jun 01, 2006 10:36 AM
from the web-user-smash dept.
from the web-user-smash dept.
Carl Bialik from WSJ writes "Captchas -- the jumbles of letters that users must type to gain access to some websites -- are a growing irritation, the Wall Street Journal reports. But programmers hope to make new variations that are both easier to decipher and harder to crack. From the article: 'Some captchas have been solved with more than 90% accuracy by scientists specializing in computer vision research at the University of California, Berkeley, and elsewhere. Hobbyists also regularly write code to solve captchas on commercial sites with a high degree of accuracy. ... Henry Baird, a professor of computer science at Lehigh University who studies PC users' responses to the codes, has been working with colleagues to develop new generations of captchas that are designed to be easier on humans but baffling for computers.'"
Related Stories
[+]
Defeating Captcha 430 comments
An anonymous reader pointed us at PWNtcha, a package that breaks various on-line captcha algorithms. The site provides numerous examples of easy (Paypal, and an older version of Slashdot make the list) and hard Captcha. It also links various sources explaining why Captcha is a bad idea.
[+]
Ask Slashdot: How Would You Design a Captcha for the Deaf-Blind? 99 comments
kesuki asks: "Right now, the state of the art captcha only works for the visually-abled. Some people are trying to start a grass roots opposition to catcha using existing anti-discrimination laws. However, without any captcha at all, spammers would have a field day. Audio captcha would work for the blind, of course, but they still leave out the deaf-blind using brail interpreters to use their computers and navigate the web. What system of captcha can you dream up that would work for the deaf-blind?"
This discussion has been archived.
No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
Full
Abbreviated
Hidden
Loading... please wait.
What? (Score:5, Funny)
Re:What? (Score:4, Informative)
Parent
To read this comment enter the text (Score:5, Funny)
I prefer kitten auth [kittenauth.com].
Re:To read this comment enter the text (Score:4, Interesting)
That still leaves things like manually capturing every possible unique base kitten image, then doing a pixel-by-pixel comparison and marking everything mostly matching as a kitten. It can be slowed down by changing the brightness or tint of the overall image slightly, but too much would make the image unrecognizable.
It would be more interesting to combine several ideas. Rather than "click on the kitten" have each picture marked with a random letter, and "enter the letters of the pictures with kittens". Or maybe change it up, pick brown kittens or black kittens or white kittens, kittens playing with a ball, etc.
Parent
Image Key Sets & Dynamic Captchas (Score:5, Informative)
In order to use the p0rn site he ran, you had to either pay money or spend time identifying captchas. He would then store them in a database and match it up with a checksum of the image. When he had completed a site's captcha key set, he would sell these lookup tables to anyone with money.
All they then had to do was write their program to do a checksum of the image (or the image itself if he had stored it) and then plug the word from the database into the page for verification.
With the introduction of splashers that spatter the statically stored images with lines or dots, the image is stored and a something like an edit distance is applied to it to find the closest match. Once that is accomplished, it references the keyword out of the database. You turn up the splasher and you risk the user not being able to figure out the word.
It seems that evil always finds a way. This is why captchas should always be dynamically generated on the fly from a very large dictionary! Check out Securimage for PHP [hotscripts.com].
Re:Image Key Sets & Dynamic Captchas (Score:3, Interesting)
I spent some time working on an alternative to captcha, I call AOMIS. http://aomis.net./ [aomis.net.] I haven't had a chance to work on it for a while, but the basic idea was, provide a piece of media, the user must identify the content.
In most cases, it would be an image. So, I might show you a picture of an elephant, and to submit the form, the user would have to enter 'elephant' into the box. Each image would have a number of correct answers to account for common spelling mistakes, and the most common correct r
Re:Image Key Sets & Dynamic Captchas (Score:5, Interesting)
The second approach was simply to set up captcha solving sweatshops somewhere in Asia with cheap labor, with people paid a few cents an hour to sit and solve captchas all day. This brought the cost of a new email address up to something like 1/3 cent, which for many spammers is still a viable price. The cost does limit this approach, though, so the captcha still helps.
The interesting thing about both of these strategies is that they use humans to solve a problem that is difficult for computers, which is von Ahn's research area - he's also one of those behind The ESP Game [espgame.org] (caution - this can be shockingly addictive). There's essentially nothing that can be done to defeat either approach without also making a system a huge pain in the ass for legitimate users. From this point of view, spending time trying to come up with more advanced captchas is kind of pointless.
Parent
90% accuracy? Not bad. (Score:5, Funny)
Hell, that's better than my average. They are getting so cryptic, it seems I get them wrong about 25% of the time these days.
-josh
Re:90% accuracy? Not bad. (Score:5, Funny)
Parent
I often fail those Turing tests (Score:4, Funny)
Different method entirely (Score:5, Interesting)
Which of these is a number: A 2 R P?
Seems that regardless of what they come up with there's going to be some part of the population that won't figure it out anyway, and if the whole point is to confuse auto-registerers, then I'd think it'd be harder for those to account for every possible question and answer set.
(Yea, it's in TFA, but mentioned like an aside...)
Re:Different method entirely (Score:5, Funny)
Or, even better, put it to music - and add a time limit!
"One of these things is not like the others,
one of these things just doesn't belong.
Can you tell me which thing is not like the others,
before I finish this song?"
Parent
captchas discriminate against the blind (Score:5, Interesting)
Re:captchas discriminate against the blind (Score:5, Funny)
Parent
captcha isn't that bad.... (Score:5, Insightful)
And even if you aren't blind, I've run into many a captcha that I couldn't decipher. Poorly designed sites may delete the entire content of your post if you fail the captcha, but I guess that's a design issue for another topic.
Re:captcha isn't that bad.... (Score:3, Interesting)
Sites should have alternate means, but even the ones that claim to have alternate means never really follow up on anyone.
News for Nerds? (Score:4, Informative)
- There are things called 'Captchas'
- People don't like them
- Computers are getting better at cracking them
- Some boffins are trying to make new ones which people like and computers don't
Really, that's all there is.Re:News for Nerds? (Score:5, Interesting)
What's wrong with an article being a spark for more in-depth discussion? How else are things rarely discussed in the media and never in depth (like most tech topics) going to be discussed on slashdot?
Sure, I know this post (and the parent) are off-topic, but it bugs me when people think that the purpose of slashdot is just to accumulate articles... that's what RSS feeds are for.
The discussion is what keeps me coming back, and typically, no matter how moronic the article is, there are several posts that give the kind of information that I wish was included in the article (but isn't). At the very least, people provide links to more comprehensive information and/or discussion of the issues concerned.
Parent
Not the point (Score:3, Interesting)
Re:Not the point (Score:5, Insightful)
The paradox is, if a site has one that works really well for them, other sites will want to use it as well. As other sites use similar or identical systems, it becomes exponentially more beneficial for crackers to crack. So, as soon as something's good enough to use, it becomes good enough to crack.
Parent
The human factor (Score:5, Funny)
If I wanted to be really sadistic, I could instead present site readers with a sentence, in which they have to fill in either "their," "there," or "they're."
Re:The human factor (Score:5, Funny)
Your a looser for even sugesting such a thing!
Parent
20% error rate (Score:3, Informative)
That's amazingly high. 1 in 5 CAPTCHA's are incorrectly entered by humans doing their best to do the right thing.
No wonder people get mad at them.
John.
Server in the Middle (Score:5, Interesting)
This is v1.0 of the Matrix, where human brains are harnessed to solve problems by a more powerful and wise, though less "intelligent" computer network.
Captcha is a nice idea but... (Score:5, Insightful)
HOWEVER. A short and simple multiple-choice or true-false quiz might determine with some level of accuracy if the poster is a person or not. Simple stuff like a random image of a sheep, a lion, a bear or a whale with a radio button selection below it. It's easy to run through, it shouldn't require much skill from the user and has the potential to confuse interpreting software a lot more.
This approach could also even be ENTERTAINING to the user in that funny pictures could be used in the image interpretation drill. Such questions could be "Is this person having a good day?" and you can put all manner of interesting images in there for a true-false scenario. Being an entertaining method will definitely win fans. Being tedius, stressful and mistakable will lose fans.