ReCAPTCHA.net Now Vulnerable to Algorithmic Attack

Catch up on stories from the past week (and beyond) at the Slashdot story archive

ReCAPTCHA.net Now Vulnerable to Algorithmic Attack 251

Posted by timothy on Thursday August 05, 2010 @04:59PM from the bless-you! dept.

n3ond4x writes "reCAPTCHA.net algorithms have been developed to solve the current CAPTCHA at an efficacy of 30%. The algorithms were disclosed at DEFCON 18 over the weekend and have since been made available online. Also available is a video demonstration of random reCAPTCHA.net CAPTCHAs being subjected to the algorithms." There's probably an excellent Firefox plugin to render this page's color scheme more bearable. Note: the PowerPoint presentation linked opens fine in OpenOffice, and the video speaks for itself.

This discussion has been archived. No new comments can be posted.

ReCAPTCHA.net Now Vulnerable to Algorithmic Attack

Load All Comments

Search 251 Comments Log In/Create an Account

Comments Filter:

colours (Score:2, Funny)

by orange47 ( 1519059 ) writes:

"There's probably an excellent Firefox plugin to render this page's color scheme more bearable."
just select all page, its better.
- Re:colours (Score:5, Informative)
  
  by electrostatic ( 1185487 ) writes: on Thursday August 05, 2010 @05:17PM (#33154916)
  
  "...an excellent Firefox plugin to render this page's color scheme more bearable."
  
  Yep. Color Toggle
  
  https://addons.mozilla.org/en-US/firefox/addon/9408/ [mozilla.org]
  
  I have it set so Ctl-Shift-Z set light yellow background, black text, and blue links.
  
  Parent Share
  twitter facebook
  - Re: (Score:2)
    
    by commodoresloat ( 172735 ) writes:
    
    sweet! Perfect for reading slashdot discussions in this IT color scheme! It's hilarious that a slashdot summary in this section is complaining about color schemes on other pages... glass houses and all
  - Re: (Score:2)
    
    by SkyDude ( 919251 ) writes:
    
    Just hit Ctrl-a and all the text shows up just fine.
Human Success? (Score:5, Insightful)

by Anonymous Coward writes: on Thursday August 05, 2010 @05:03PM (#33154776)

So what is the average human success rate? I think mine is only about 50%

Share
twitter facebook
- Re: (Score:2, Informative)
  
  by Anonymous Coward writes:
  
  Mine is 100%. Recaptcha is probably one of the easiest captcha I've ever had to deal with; something is wrong with you, sorry.
  - Re:Human Success? (Score:4, Insightful)
    
    by Kalriath ( 849904 ) writes: on Thursday August 05, 2010 @11:41PM (#33157972)
    
    Yeah, I agree with this. Recaptcha is one of the easiest out there.
    Admittedly though, I have around about 3% success rate with vBulletin captchas. Hear that forum owners? I'm not joining your forum because I can't read your captcha!
    
    Parent Share
    twitter facebook
- Re: (Score:2)
  
  by artg ( 24127 ) writes:
  
  So, is there a firefox plugin that fills in captchas for me ?
My eyes! (Score:3, Funny)

by Yvan256 ( 722131 ) writes: on Thursday August 05, 2010 @05:04PM (#33154782) Homepage Journal

The goggles, they do nothing!

Share
twitter facebook
- Re: (Score:2)
  
  by billstewart ( 78916 ) writes:
  
  Dude, if you're getting old enough to need reading glasses, just get them....
  There are some really bad CAPTCHAs out there - recapcha is one of the more human-readable ones, but sometimes just magnification isn't enough.
- - Re:My eyes! (Score:5, Funny)
    
    by SomeJoel ( 1061138 ) writes: on Thursday August 05, 2010 @05:40PM (#33155208)
    
    Did you not learn when I explained this yesterday? The quote is: "My eyes! The goggles do nothing!". There is no "they", nor is there any bad pronunciation. Indeed, it is correctly articulated and enunciated, with an accent.
    Easy there champ, nobody appreciates a Family Guy nerd correcting everyone's quotes.
    
    Parent Share
    twitter facebook
    - Re:My eyes! (Score:5, Funny)
      
      by SomeJoel ( 1061138 ) writes: on Thursday August 05, 2010 @07:53PM (#33156304)
      
      Judging from the other replies, meta-humor is a little hard for you guys...
      
      It works wonders though. For instance, the next time someone is talking about "the force" or jedis and such, tell them "Get a life, Star Trek sucks!". You'll find the reaction much more interesting than if you correctly identify the franchise.
      
      Parent Share
      twitter facebook
    - - Re: (Score:2)
        
        by sexconker ( 1179573 ) writes:
        
        Except those guys are simply bastardizing the Simpsons quote.
OCR improvements? (Score:3, Interesting)

by Anonymous Coward writes: on Thursday August 05, 2010 @05:05PM (#33154788)

Can these attack algorithms actually increase the accuracy of normal OCR programs?

Share
twitter facebook
- Re: (Score:2)
  
  by Chad Birch ( 1222564 ) writes:
  
  I haven't RTFA, but that's unlikely. With a captcha, you receive a response indicating whether you were correct or not. When using OCR, there isn't really any automated way to be sure if you've gotten it right.
- Re: (Score:2)
  
  by nizo ( 81281 ) * writes:
  
  Better living through spam!
- Re: (Score:2)
  
  by ottothecow ( 600101 ) writes:
  
  recaptcha was created to increase the accuracy of normal OCR programs...
  so technically the bots solving them would also be helping proof Project Gutenberg texts so long as they are getting both the test word and the book word correct.
  - Re: (Score:3, Informative)
    
    by AusIV ( 950840 ) writes:
    
    They're not. I saw the presentation these guys gave at DefCon (their presentation was about as painful as their website), and they're only getting the test word correct with about 30% accuracy. They're not completely sure about their success rates on book words, but they believe it to be considerably lower than the test words.
    - - Re:OCR improvements? (Score:4, Insightful)
        
        by Sparr0 ( 451780 ) writes: <sparr0@gmail.com> on Friday August 06, 2010 @03:44AM (#33158838) Homepage Journal
        
        The problem is that since you are *probably* solving the verification words with higher accuracy to begin with, you are actually poisoning the data being gathered regarding the book words. So, while a book word becoming a verification word based on your "solutions" will keep your solution rate constant, it actually damages the system when it comes time for humans to solve the CAPTCHA, or worse when the solutions are used as OCR corrections.
        To clarify, given a classically OCR-able "foo" and a non-OCR-able-but-human-readable "bar", a human is expected to recognize the slightly-deformed-by-reCAPTCHA "foo" and is trusted to get "bar" right more often than OCR would. This attack only defeats the deformation applied by reCAPTCHA, it doesn't actually improve the OCR on the non-deformed words, which means you are going to submit an answer of "foo ban" every time this pair is encounted (or "blah ban" for a different scenario), and the reCAPTCHA system is eventually going to decide that the book word really is "ban".
        
        Parent Share
        twitter facebook
  - Re: (Score:2)
    
    by Garble Snarky ( 715674 ) writes:
    
    Unless of course, all the bots use the exact same algorithm, and they all make the same mistake on the book words. Recaptcha uses consensus, right?
    - Re: (Score:2)
      
      by Peach Rings ( 1782482 ) writes:
      
      IIRC, as part of the marblecake time magazine vote thing, people submitted thousands of PENISes as the book word to try to get it inserted randomly into ebooks. The recaptcha people said they've anticipated such an attack and that it's not possible to influence final book word results.
Pretty cool stuff (Score:2)

by Monkeedude1212 ( 1560403 ) writes:

But that just means more spambots, right?
- - Re: (Score:2)
    
    by fyrewulff ( 702920 ) writes:
    
    On the other hand, it'll be easier to block the known spammers because fewer of them will be able to afford the hardware/sweatshop/botnet setups once the computational brute force needed increases.
    - - Re: (Score:2)
        
        by veganboyjosh ( 896761 ) writes:
        
        Useless to humans, maybe.
        
        Maybe not so much to Skynet.
        
        Re: (Score:2)
        
        by John Hasler ( 414242 ) writes:
        
        > Maybe not so much to Skynet.
        Then we just have to hope the spammers piss off Skynet.
      - Re: (Score:2)
        
        by Peach Rings ( 1782482 ) writes:
        
        Obviously the evolution of CAPTCHA science will be toward text (or audio) that is more easily recognized by humans but not by bots. It's not just a matter of making captchas really really hard.
Speaking about re-captcha (Score:4, Informative)

by imsabbel ( 611519 ) writes: on Thursday August 05, 2010 @05:08PM (#33154816)

I recently went to their homepage and looked _really_ hard for any statistics about which books are transcriped. I read their Science paper. Tried all sections.
Its all about the captcha part, and _nothing_ about the RE.
The way they state how it works ("We are using 100.000 unique words") sounds like they have given up on that part long ago and just recycle their old database again and again...

Share
twitter facebook
- Re:Speaking about re-captcha (Score:5, Informative)
  
  by icebraining ( 1313345 ) writes: on Thursday August 05, 2010 @05:14PM (#33154892) Homepage
  
  Currently, we are helping to digitize old editions of the New York Times and books from Google Books.
  http://www.google.com/recaptcha/learnmore [google.com]
  
  Parent Share
  twitter facebook
  - Re: (Score:2)
    
    by MozeeToby ( 1163751 ) writes:
    
    You don't even need that, the attacker has access to everything, remember? They can just look at the file directly if it's predownloaded on the page or send the page the mouse over event for that element. I highly doubt that the people doing these algorithms are using a full web browser to pull and post data.
  - Re:Speaking about re-captcha (Score:5, Interesting)
    
    by imsabbel ( 611519 ) writes: on Thursday August 05, 2010 @05:49PM (#33155292)
    
    Hm.
    So its for-profit work for the biggest advertising firm in the world.
    Sort of expected project gutenberg or something.
    Too bad.
    
    Parent Share
    twitter facebook
    - Re: (Score:2)
      
      by Cyberax ( 705495 ) writes:
      
      As far as I understand, this data will be publicly available on Google Books.
      - Re: (Score:3, Insightful)
        
        by martin-boundary ( 547041 ) writes:
        
        Google books isn't really public, though. You can only view a small number of pages of each book, which is pretty useless from the point of view of public uses that come to mind.
        
        Re: (Score:2)
        
        by Cyberax ( 705495 ) writes:
        
        Google Books allows you to view and download entire books, if they are in the public domain.
        Example:
        http://books.google.com/books?id=Q_rLGDGlQz0C&printsec=frontcover&dq=Mark+Twain&hl=en&ei=HlNbTOrmKtWN4gb-q9j3AQ&sa=X&oi=book_result&ct=result&resnum=2&ved=0CCsQ6AEwAQ#v=thumbnail&q&f=false [google.com]
    - Re: (Score:3, Insightful)
      
      by bill_mcgonigle ( 4333 ) * writes:
      
      So its for-profit work for the biggest advertising firm in the world.
      Sort of expected project gutenberg or something.
      Google's digitizing hundreds of thousands of historic books from some of the great university libraries. What's the problem here, that they won't lose money on the effort?
      The NYT archive has been done for at least a year, it made reCAPTCHA a feasible company.
- Re: (Score:2)
  
  by Mashiki ( 184564 ) writes:
  
  Dunno. I've been seeing a lot of unique stuff recently like hebrew, chinese, japanese, and vertical lettering.
- Re: (Score:2)
  
  by al0ha ( 1262684 ) writes:
  
  >> The way they state how it works ("We are using 100.000 unique words") sounds like they have given up on that part long ago and just recycle their old database again and again...
  
  I think they recycle the database of words that are known, not the ones also proffered up that are unknown to the API, so the book building continues.
  
  Entering the known word gets you past the gate, and once you've done that they assume that you also correctly answered the word they don't know. As far as I understand t
Can the mouse cursor be positioned by a script? (Score:2)

by master_p ( 608214 ) writes:

If not, then the captcha should only be visible when the mouse cursor is over it.
The key to a successful captcha is to make it accessible only by a user sitting in front of the screen.
- Re: (Score:2)
  
  by Lehk228 ( 705449 ) writes:
  
  even if it couldn't be done normally, a hostile client could say the cursor is over the script just as easilly as it could place the cursor there.
- Re: (Score:2)
  
  by AusIV ( 950840 ) writes:
  
  As a couple of ACs have pointed out, the people breaking CAPTCHAs aren't using browsers, they're using scripts. They don't care if a DOM element is hidden, or if they have to make an extra ajax request of some sort. The scripts will be tailored to the CAPTCHA they're trying to break, and you can't keep a script from getting a hold of something that you plan to show a human.
- Re: (Score:3, Insightful)
  
  by IBBoard ( 1128019 ) writes:
  
  Remember, iPads and touch-screens can't do hover. Plus there's the whole disability accessibility aspect as well ;)
I'm a computer, apparently (Score:3, Funny)

by El_Muerte_TDS ( 592157 ) writes: on Thursday August 05, 2010 @05:11PM (#33154844) Homepage

It looks like that tool is better at deciphering the captchas than I am.

Share
twitter facebook
far from it (Score:4, Informative)

by MagicM ( 85041 ) writes: on Thursday August 05, 2010 @05:12PM (#33154858)

I'm watching the video, and the end result is "b:1/78 1.28% s:27/78 34.62%" indicating that out of 78 tests of two words per test it got a single word right 35% of the time, and both words right only once or 1% of the time.
Since both words need to be correct "solve the current CAPTCHA at an efficacy of 1%" would be closer to the truth.

Share
twitter facebook
- Re: (Score:3, Informative)
  
  by NegativeK ( 547688 ) writes:
  
  35% * 35% ~ 12%. And that ignores that one word is a known control, while the other is a word they're trying to OCR.
  - Re: (Score:3, Informative)
    
    by rm999 ( 775449 ) writes:
    
    You are right, there is no need to get both words right.
    But, your 35% * 35% calculation assumes the recognition difficulty of the words is independent, which is a bad assumption in this case; the OCR word is one that is known to be hard to guess. It is probably more like 35% * 5% or something.
    - Re: (Score:3, Insightful)
      
      by retchdog ( 1319261 ) writes:
      
      Interesting. If this is true as stated, and one knew/modeled OCR performance, you could use this information in some cases to pick out the plum and boost the crack...
      - Re: (Score:2)
        
        by retchdog ( 1319261 ) writes:
        
        meh. never mind. it'd only take twice as long at most, to just do your best on both. duh.
        i guess if there were a limited number of attempts you might use this to decide which ones to attempt vs. reload.
    - Re: (Score:2)
      
      by petermgreen ( 876956 ) writes:
      
      I seem to remember recapatcha claiming that if they think they are being screwed with they switch to sending two known words rather than one known and one unknown
- Re: (Score:3, Informative)
  
  by BarryJacobsen ( 526926 ) writes:
  
  I'm watching the video, and the end result is "b:1/78 1.28% s:27/78 34.62%" indicating that out of 78 tests of two words per test it got a single word right 35% of the time, and both words right only once or 1% of the time.
  Since both words need to be correct "solve the current CAPTCHA at an efficacy of 1%" would be closer to the truth.
  My understanding is that only one of the words needs to be correct, but it has to be the "right" one (reCAPTCHA presents two words one it's very certain it knows what it is and one it's less certain, you have to get the one that it's very certain of in order to pass).
- Re: (Score:2)
  
  by sexconker ( 1179573 ) writes:
  
  Only ONE word needs to be correct for recaptcha.
  There is a known word you are tested against, and an unknown word pulled from a database of shit they scanned.
  Solving the known word correctly means you probably also got the unknown word correct. They then pool the "correct" submissions for the unknown words and see what the most common ones are.
  I don't know if this is completely automated or if they have an intern monkey clicking "yes" or "no" for unknown words and probable solutions, but the whole "crowd s
- Re: (Score:2)
  
  by IICV ( 652597 ) writes:
  
  Not necessarily; I'm not sure exactly how reCAPTCHA works, but in theory they don't know one of the words - in fact, that other word may very well be unknowable, due to smearing or just not being a word (that happened to me the other day actually, I got one word and one thing that looked like a Farsi character). Thus, if you successfully guess the correct thing for the "known" word, it doesn't really matter what you guess for the "unknown" word as long as it's close or at least something a human might type.
- Re:far from it (Score:5, Informative)
  
  by hydrofix ( 1253498 ) writes: on Thursday August 05, 2010 @05:53PM (#33155330)
  
  Since both words need to be correct "solve the current CAPTCHA at an efficacy of 1%" would be closer to the truth.
  Actually, that is incorrect. The other word is already positively known by the OCR, and serves as a control, while the other is the one that the OCR could not read. It will of course only check the one that it knowns, and assumes the other one is then correct as well. So, if you get one of the words correct AND this is the same word that as their OCR identified correctly (which is very likely the case), then you pass, but most of the time (99%) give a bad answer for the harder, non-OCR word. Sadly, this leads to pollution of their database in the long run.
  
  Parent Share
  twitter facebook
- Re:far from it (Score:4, Informative)
  
  by Jorl17 ( 1716772 ) writes: on Thursday August 05, 2010 @09:20PM (#33157152)
  
  This is not informative. As many have said. If You read: http://www.google.com/recaptcha/learnmore [google.com] , you'll get it.
  
  Here is the deal: reCAPTCHA presents two words. One is picked by it and is previously known. The other one is a word from a book that has been scanned. Said word is unknown to the reCAPTCHA system. When the user enters both words, reCAPTCHA checks to see if the known word has been properly recognized. If that is the case, then reCAPTCHA can assume that a human is answering. Given that a human is answering, then the second unknown word given by the human is most likely correct, because he/she will be able to recognize it as well. Using this system, reCAPTCHA works as a CAPTCHA (spam prevention) mechanism and also helps transforming old books/papers into digital format, such as the New York Times.
  
  So, in practice, only one word has to be correct -- the word that reCAPTCHA knows. What's sad is that bots may contribute incorrect second words...
  
  Next time, get informed before going all crazy.
  
  And here is the relevant info, quoted from the aforementioned website:
  
  reCAPTCHA improves the process of digitizing books by sending words that cannot be read by computers to the Web in the form of CAPTCHAs for humans to decipher. More specifically, each word that cannot be read correctly by OCR is placed on an image and used as a CAPTCHA. This is possible because most OCR programs alert you when a word cannot be read correctly. But if a computer can't read such a CAPTCHA, how does the system know the correct answer to the puzzle? Here's how: Each new word that cannot be read correctly by OCR is given to a user in conjunction with another word for which the answer is already known. The user is then asked to read both words. If they solve the one for which the answer is known, the system assumes their answer is correct for the new one. The system then gives the new image to a number of other people to determine, with higher confidence, whether the original answer was correct.
  
  Parent Share
  twitter facebook
- - Re: (Score:2)
    
    by MagicM ( 85041 ) writes:
    
    All I'm saying is that just because the algorithm got 30% of the words right doesn't mean that it can "solve the current CAPTCHA at an efficacy of 30%".
    - Re: (Score:2)
      
      by MagicM ( 85041 ) writes:
      
      Actually I guess that's not what I'm saying, because I said "1%" which was wrong. You may consider my face egged.
    - Re: (Score:2)
      
      by sexconker ( 1179573 ) writes:
      
      All I'm saying is that just because the algorithm got 30% of the words right doesn't mean that it can "solve the current CAPTCHA at an efficacy of 30%".
      Yes, yes it fucking does.
      "Solving" a captcha - to an attacker or a legitimate user - means getting past the damned popup and creating your account, posting your /. obama poop copypasta troll, etc.
      Being correct with regards to the OCR means nothing.
Plugin not needed... (Score:4, Informative)

by knarf ( 34928 ) writes: on Thursday August 05, 2010 @05:13PM (#33154874)

There's probably an excellent Firefox plugin to render this page's color scheme more bearable
No plugin needed:
View->Use Style->None
That is what it looks like in Seamonkey, Firefox will be similar. This more or less always works.

Share
twitter facebook
- Re: (Score:2)
  
  by interval1066 ( 668936 ) writes:
  
  Or if you're using ff 3.6....; View->Page Style->No Style.
- Re: (Score:2)
  
  by c++0xFF ( 1758032 ) writes:
  
  This more or less always works.
  You're not joking: it even makes the Time Cube site somewhat readable!
Hmm (Score:5, Funny)

by Tailhook ( 98486 ) writes: on Thursday August 05, 2010 @05:15PM (#33154906)

Should I run the DEFCON presenter's giant SWF or not?
o_O

Share
twitter facebook
- Re: (Score:2, Funny)
  
  by machxor ( 1226486 ) writes:
  
  Why not. You run Firefox right? If yes then you have no worries because it's not full of hole like IE is...
  - Re: (Score:2)
    
    by Chad Birch ( 1222564 ) writes:
    
    You are disturbingly misinformed.
- Re: (Score:3, Insightful)
  
  by Monkeedude1212 ( 1560403 ) writes:
  
  I'm glad YOUR common sense kicked in before hundreds of others.
Bad Hacking (Score:5, Insightful)

by pz ( 113803 ) writes: on Thursday August 05, 2010 @05:16PM (#33154910) Journal

Why would anyone want to do this? It's like attacking the UN peace keeping troops or the Red Cross. reCAPTCHA is doing good work, digitizing scanned printed books so that the the text can be made available for online searching. Breaking reCAPTCHA is like defecating in the village well, ensuring that everyone suffers. No one benefits from reCAPTCHA being broken. No one.

Share
twitter facebook
- Re: (Score:3, Informative)
  
  by kyrio ( 1091003 ) writes:
  
  4chan already broke it.
  - Re: (Score:2)
    
    by cant_get_a_good_nick ( 172131 ) writes:
    
    4chan didn't quite break it, more like they broke time's form implementation. They did a lot of 'hacks' but most was on how Time handled the poll - they didn't use any CAPTCHA at the beginning, then took the form offline, but not the voting script, so 4chan voted well past the cut off time, will millions of monkeys voting.
    see reCaptcha blog [recaptcha.net] and this well written article [musicmachinery.com]
    - Re: (Score:2)
      
      by shird ( 566377 ) writes:
      
      No the OP is pretty much right. 4chan has now implemented reCaptcha, yet is still getting hammered with spam. Thus some spammer using 4chan has managed to find a way around it with a pretty good success rate.
- Re:Bad Hacking (Score:5, Insightful)
  
  by Dhalka226 ( 559740 ) writes: on Thursday August 05, 2010 @05:31PM (#33155094)
  
  No one benefits from reCAPTCHA being broken. No one.
  Spammers.
  
  Parent Share
  twitter facebook
- Re:Bad Hacking (Score:5, Insightful)
  
  by maxume ( 22995 ) writes: on Thursday August 05, 2010 @05:32PM (#33155106)
  
  Actually, it could be of use to reCAPTCHA, they can just pass their test words through this system before they make them public and then use the output to help prevent similar attacks.
  
  Parent Share
  twitter facebook
- Re: (Score:2)
  
  by Purity Of Essence ( 1007601 ) writes:
  
  Advertisers benefit. Or rather, people who sell advertising and SEO services and work automated lead/sales referral systems. Their clients are probably hurt by all the forum spam done in their name. Look around you. Wherever there is money being made, there are assholes joining in.
- Re:Bad Hacking (Score:4, Insightful)
  
  by Flyne ( 1082975 ) writes: on Thursday August 05, 2010 @05:42PM (#33155220)
  
  The problem of breaking reCAPTHCA is precisely the same problem as increasing computer OCR abilities, since reCAPTCHA by design uses words which current OCR abilities are inadequate for. This is a good thing for AI and computer vision and text digitization.
  
  Parent Share
  twitter facebook
  - Re:Bad Hacking (Score:4, Insightful)
    
    by Timmmm ( 636430 ) writes: on Thursday August 05, 2010 @07:10PM (#33155906)
    
    The problem of breaking reCAPTHCA is precisely the same problem as increasing computer OCR abilities
    No it isn't. Well, not unless you read books with wavy crossed-out words and don't mind 30% accuracy.
    
    Parent Share
    twitter facebook
- Re:Bad Hacking (Score:5, Insightful)
  
  by sbayless ( 1310131 ) writes: on Thursday August 05, 2010 @05:58PM (#33155364)
  
  No one benefits from reCAPTCHA being broken. No one
  You couldn't be more wrong. Sure, breaking reCAPTCHA would create a headache for website admins (including me, for example), but in order to break reCAPTCHA someone has to devise a better text recognition program. And that's great news! This is an example of a general side effect of the cat and mouse game that are captchas. Captcha's are a simple form of Turing Test, where website admins are trying to determine who is a computer and who is a real human being. Every time a captcha gets broken, we get a sophisticated new algorithm for doing something that previously only humans could do (or only humans could do well, at least).
  
  Parent Share
  twitter facebook
  - Re: (Score:2, Insightful)
    
    by mysidia ( 191772 ) writes:
    
    Except the algorithm doesn't really do that... to defeat the captcha, it only needs to get it right about 10 or 20% of the time, to give the malicious script a "good enough guess" to brute-force the Captcha with 5 or 6 retries.
    As long as the number retries are less than those the a fair percentage of humans require....
- Re: (Score:2)
  
  by cant_get_a_good_nick ( 172131 ) writes:
  
  It's not about breaking reCaptcha, it's about avoiding the reCaptcha hurdle on all the sites that use it. If a site put up a captcha, there's some resource it's protecting that other people want. This is a way to get it in a bulk way, therefore economically cheaper.
  And you think that a person who can benefit with a fat check will care about some abstraction that they're polluting the village well? For money, people sell drugs that kill people. This is nothing compared to that.
- Re:Bad Hacking vs Penetration Testing (Score:2)
  
  by billstewart ( 78916 ) writes:
  
  If reCAPTCHA's too easily breakable, then Bad Guys will figure out how, and will start exploiting sites that use reCAPTCHA for protection.
  So we need to know how vulnerable it is, and the reCAPTCHA folks need to figure out how to fix it. It's an arms race, always has been, probably always will be.
- - Re: (Score:2, Insightful)
    
    by mysidia ( 191772 ) writes:
    
    reCaptcha, and indeed all Captchas have a fundamental flaw.... advances in computer vision will eventually render them all obsolete.
    Most of the CS knowledge is already around to totally defeat captchas of this sort... it's only an Engineering question. They will most likely get broken when sufficiently unethical engineers are hired by sufficiently wealthy spammers.
    It's basically a known fact, that spammers will eventually break conventional captchas totally, by developing algorithms to guess captcha
Readability (Score:2)

by pgn674 ( 995941 ) writes:

There's probably an excellent Firefox plugin to render this page's color scheme more bearable.
I like using a Readability bookmarklet in my bookmarks bar: Readability - An Arc90 Lab Experiment [arc90.com]
Is this related? (Score:5, Interesting)

by Khyber ( 864651 ) writes: <techkitsune@gmail.com> on Thursday August 05, 2010 @05:48PM (#33155290) Homepage Journal

Anybody that pays attention to 4chan recently knows they had to implement captcha due to a massive spamflood of infected morons. recaptcha got busted thanks to someone in /g/ who leaked the vulnerability in the sound system for reCAPTCHA, and the whole site was again inundated with spam, though not to the degree as the original spam attack.

Share
twitter facebook
- Re: (Score:2)
  
  by Dice ( 109560 ) writes:
  
  The audio vulnerability is unrelated, and more effective than the algorithm presented in TFA.
- Re: (Score:2)
  
  by Monkeedude1212 ( 1560403 ) writes:
  
  Anybody that pays attention to 4chan
  What, all 200 of them?
How is this 30% accurate??? (Score:4, Insightful)

by mwvdlee ( 775178 ) writes: on Thursday August 05, 2010 @06:02PM (#33155406) Homepage

When it is claimed to be 30% accurate, I'd expect some 30% of all captchas being correcly guessed. Watching the video, I noticed the algorithm gives itself 30-40% scores for getting just one of the two words right or sometimes even for getting the right length and a few correct letters. Didn't watch it to the end, but in the few minutes I watched, ZERO entire captcha's were solved. So that's ZERO% acurate in my book. For instance, actual captcha text "ware readiness", guessed captcha "votarry rehabbed", reported accuracy 38.24%... how the hell is that over 38% accurate? If you had that level of accuracy when trying to get past a captcha (which is pretty much the definition of it being vulnerable, right?), you wouldn't get past a single captcha. it's 30% accurate if it correcly guessed about 3 out of every 10 captcha's, not if it fails every single captcha.

Share
twitter facebook
- Re: (Score:2)
  
  by Timmmm ( 636430 ) writes:
  
  You only need to get one (specific) word right.
- - Re: (Score:2)
    
    by wdavies ( 163941 ) writes:
    
    mod this up. I hadn't gotten the implication of the exploit either until now.
%30 seems about right (Score:2)

by dilvish_the_damned ( 167205 ) writes:

since thats about the accuracy of a human
New Human Verification Scheme (Score:4, Interesting)

by BlueMonk ( 101716 ) writes: <BlueMonkMN@gmail.com> on Thursday August 05, 2010 @08:39PM (#33156812) Homepage

Seeing this article gave me an idea to come up with a new human verification process. I created a C# program in about an hour that loads images from Google images based on searching for 3 of 2000+ nouns. It shows 3 examples of each noun and asks the user to pick the correct noun from a list of 6. This program is just a proof of concept of course. Could this become useful? (Binary and source code included.)
http://enigmadream.com/misc/HumanVerification.zip [enigmadream.com]

Share
twitter facebook
- Multiple choice doesn't work for CAPTCHAs (Score:3, Insightful)
  
  by mrnobo1024 ( 464702 ) writes:
  
  The spammers can just choose a random option until they get in. All that will do is slow them down a bit.
- Re: (Score:3, Interesting)
  
  by KahabutDieDrake ( 1515139 ) writes:
  
  If you used something that wasn't a public resource based around text strings, then yes.
  
  Better still... show a bank of images, ask which one has a happy little girl in it. (all images contain a girl, only one obviously happy). Randomize the backend with a cryptographic routine (so the file names don't give anything away) and you are set for a while. Computers are terrible at such things, people are pretty good at it.
Let's hope they hit 100% (Score:2, Interesting)

by drinkypoo ( 153816 ) writes:

Then we can just put reCAPTCHA on all pages being used for spam, and get transcription services for free.
- Re: (Score:2)
  
  by MobileTatsu-NJG ( 946591 ) writes:
  
  A Firefox extension is not the same thing as a plugin.
  B.F.D.
- - Re:Offtopic (Score:4, Informative)
    
    by Anonymous Coward writes: on Thursday August 05, 2010 @05:29PM (#33155078)
    
    No, Firefox addons used to be called extensions, plugins are still plugins.
    
    Parent Share
    twitter facebook
  - Re: (Score:3, Informative)
    
    by Cougar Town ( 1669754 ) writes:
    
    Wrong. Plugins have been around since Netscape and are still called plugins. They have a different function than an extension (and an extension is what we would want in this case to fix the site's colours).
    Both plugins and extensions, along with themes, are collectively referred to as "addons." "Plugin" is the wrong word in the summary. "Extension" or "addon" would have been acceptable.
  - Re: (Score:2, Funny)
    
    by CarpetShark ( 865376 ) writes:
    
    A Firefox extension is not the same thing as a plugin.
    Firefox plugins ***used*** to be called Firefox extensions. You must just be too young to know this.
    It's a bit like watching scary women fight over who was married to some guy first on Ricky Lake.
- - Re: (Score:2)
    
    by Peach Rings ( 1782482 ) writes:
    
    reCAPTCHA isn't bad, but Google's captchas are so hard they're probably more easily solved by learning algorithms than actual human beings.
- Re: (Score:2)
  
  by sugarmotor ( 621907 ) writes:
  
  You wrote, "There is more than enough written and audio samples that the world would love to see OCR'ed." -- Where do you get those?
- Re:So many better ways than recaptcha (Score:4, Informative)
  
  by JesseMcDonald ( 536341 ) writes: on Thursday August 05, 2010 @05:49PM (#33155304) Homepage
  
  There is ZERO reason to use worthless tests like these as opposed to using real identification. That is instead of using computer generated difficult test, use actual pictures of actual 'difficult text' that an OCR agent failed to identify. Each person is given one alread tested sample and one unknown sample. If you get the already tested sample, then your answer is accepted as 'probable' correct for the unknown sample.
  Congratulations, you've just described ReCAPTCHA! This is exactly how the current system works.
  
  Parent Share
  twitter facebook
- Re: (Score:2)
  
  by Lunix Nutcase ( 1092239 ) writes:
  
  In other words.... use reCAPTCHA?
- Re: (Score:2)
  
  by sexconker ( 1179573 ) writes:
  
  The percentages shown are a running total of all the captchas tested against in that run.
  b is the % of cases where BOTH words were correctly recognized
  s is the % of cases where AT LEAST ONE word was correctly recognized
  You only need to know ONE word to pass a recaptcha captcha. Though it has to be the CORRECT word, and I don't know if the developers of this program knew which word was known, or if they took that into account when displaying the percentages.
  The worst case scenario is that they can solve it
- Re:My eye's... (Score:5, Funny)
  
  by Peach Rings ( 1782482 ) writes: on Thursday August 05, 2010 @08:52PM (#33156914) Homepage
  
  You know a hacker is hard core when his site is monochrome in a monospace font, and he saves his files as straight up docx.
  
  Parent Share
  twitter facebook
  - Re: (Score:3, Insightful)
    
    by Peach Rings ( 1782482 ) writes:
    
    By the way, that wasn't just a facetious comment. TFA isn't a serious paper. It's not even typeset, just typed into Microsoft Word. And god knows why I'm being warned about VBScript macros when I try to open it.
    And this isn't a case where the little guy is making real scientific progress right under the nose of the obsolete establishment. The author doesn't even have a freshman understanding of big-O notation, it's completely juvenile.
- Comment removed (Score:5, Funny)
  
  by account_deleted ( 4530225 ) writes: on Thursday August 05, 2010 @09:41PM (#33157298)
  
  Comment removed based on user account deletion
  
  Parent Share
  twitter facebook

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

colours (Score:2, Funny)

Re:colours (Score:5, Informative)

Re: (Score:2)

Re: (Score:2)

Human Success? (Score:5, Insightful)

Re: (Score:2, Informative)

Re:Human Success? (Score:4, Insightful)

Re: (Score:2)

My eyes! (Score:3, Funny)

Re: (Score:2)

Re:My eyes! (Score:5, Funny)

Re:My eyes! (Score:5, Funny)

Re: (Score:2)

OCR improvements? (Score:3, Interesting)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3, Informative)

Re:OCR improvements? (Score:4, Insightful)

Re: (Score:2)

Re: (Score:2)

Pretty cool stuff (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Speaking about re-captcha (Score:4, Informative)

Re:Speaking about re-captcha (Score:5, Informative)

Re: (Score:2)

Re:Speaking about re-captcha (Score:5, Interesting)

Re: (Score:2)

Re: (Score:3, Insightful)

Re: (Score:2)

Re: (Score:3, Insightful)

Re: (Score:2)

Re: (Score:2)

Can the mouse cursor be positioned by a script? (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3, Insightful)

I'm a computer, apparently (Score:3, Funny)

far from it (Score:4, Informative)

Re: (Score:3, Informative)

Re: (Score:3, Informative)

Re: (Score:3, Insightful)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3, Informative)

Re: (Score:2)

Re: (Score:2)

Re:far from it (Score:5, Informative)

Re:far from it (Score:4, Informative)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Plugin not needed... (Score:4, Informative)

Re: (Score:2)

Re: (Score:2)

Hmm (Score:5, Funny)

Re: (Score:2, Funny)

Re: (Score:2)

Re: (Score:3, Insightful)

Bad Hacking (Score:5, Insightful)

Re: (Score:3, Informative)

Re: (Score:2)

Re: (Score:2)

Re:Bad Hacking (Score:5, Insightful)

Re:Bad Hacking (Score:5, Insightful)

Re: (Score:2)

Re:Bad Hacking (Score:4, Insightful)

Re:Bad Hacking (Score:4, Insightful)

Re:Bad Hacking (Score:5, Insightful)

Re: (Score:2, Insightful)

Re: (Score:2)

Re:Bad Hacking vs Penetration Testing (Score:2)

Re: (Score:2, Insightful)

Readability (Score:2)

Is this related? (Score:5, Interesting)

Re: (Score:2)

Re: (Score:2)