Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

 



Forgot your password?
typodupeerror
×
Spam Businesses Google The Internet

Has Google Broken JavaScript Spam Munging? 288

Baxil writes "For years now, Javascript munging has been a useful tool to share email addresses on the Web without exposing them to spammers. However, Google is now apparently evaluating Javascript when assembling summary text for web pages' listings, and publishing the un-munged email addresses to the world; and spammers have started to take advantage of this kind service." Anyone else seen this affecting their carefully protected email addresses?
This discussion has been archived. No new comments can be posted.

Has Google Broken JavaScript Spam Munging?

Comments Filter:
  • *rolleyes* (Score:5, Insightful)

    by Anonymous Coward on Tuesday June 23, 2009 @02:35PM (#28442663)

    Seriously, queue the obfuscation != security thing. If your email address is carefully protected, it is not displayed on a web page, obfuscated or not.

  • Really.... (Score:5, Insightful)

    by Darkness404 ( 1287218 ) on Tuesday June 23, 2009 @02:37PM (#28442713)
    Really with the development of better OCR technologies and such comes the elimination of e-mail security by obscurity. If you don't want spam either A) have a decent spam filter (I don't think I've had a single piece of spam pass through G-mails filter and only one false positive) or B) don't share your e-mail address. Those are the only two ways to prevent spam that will continue to work.
  • by RichardDeVries ( 961583 ) on Tuesday June 23, 2009 @02:40PM (#28442749) Journal
    That should be the title. That is, if it were newsworthy. Which it isn't.
  • by Null Nihils ( 965047 ) on Tuesday June 23, 2009 @02:41PM (#28442773) Journal

    This can easily be fixed, and should be right away. If Google is turning JavaScript into text output, they can easily parse that output (just like the spammers currently are) and see if the text contains an e-mail address. And if it does, they should omit it from search results (unless the address was originally plain text and not obfuscated, in which case they can assume the author wants it searchable).

  • by Bazman ( 4849 ) on Tuesday June 23, 2009 @02:45PM (#28442833) Journal

    So much content on the web these days is spat out by document.write(), I'm not surprised at all that google evaluates certain javascripts in order to get any content to index.

    Even done a "View Source" on a google mail or google maps page? The web is now javascript.

  • by Punto ( 100573 ) <puntob@gmai l . com> on Tuesday June 23, 2009 @02:47PM (#28442881) Homepage

    nowadays, half of the pages I try to visit don't render at all without javascript. Somtimes the main content is missing (you just get the headline, the links that go on the sides, and the ads), somtimes it's just a blank page. It seems like all these traditional news organizations just _have_ to be "web 2.0" to appear relevant again.

    Google needs to index the page, they don't have much choice.

  • Yes, but . . . (Score:2, Insightful)

    by Art3x ( 973401 ) on Tuesday June 23, 2009 @02:48PM (#28442891)

    Your email address will almost certainly get out. If not by a spambot then through an unscrupulous merchant.

    That's why spam filtering is better than email hiding. Gmail's spam filter, for example, is very good. I get spam in my Inbox about once a quarter.

    Google's job is to turn human-readable pages into machine-searchable pages. So it will always seek to expand what it can read: images, Flash, JavaScript, etc.

    It's best not to hide in the direction that technology is advancing.

  • by BlitzTech ( 1386589 ) on Tuesday June 23, 2009 @03:09PM (#28443255)
    AJAX is a great technology that has vastly improved the usefulness of the web. However, like every other fad, it gets significantly overused in places where it just IS NOT reasonable. I wish more developers would come to the realization that AJAX != 'Web 2.0-ifying your page' and move back to using the right technology for a given problem. AJAX everywhere just reeks of the same kind of software bloat that makes modern computers run slow compared to 5-10 year old equipment.

    When all you have is a hammer...
  • Re:Really.... (Score:5, Insightful)

    by buchner.johannes ( 1139593 ) on Tuesday June 23, 2009 @03:25PM (#28443511) Homepage Journal

    No it is not. If you increase the time used per website, you can not process that many websites anymore. JS obfuscated emails were protected because spammers didn't take effort.
    You might say computers got faster, but unfortunately the web didn't get smaller.

    Anyway, I understand the need to post email addresses on a website. How else should people contact you the first time? Personally, I don't like contact forms. Would you advocate for a CAPTCHA or requiring a POST request to obtain the real email address? You could still cry "security by obscurity".

    But you can't take away the option of posting email addresses on websites from users, as it is very useful to contact people by email. Reminds me of people saying "Flash is proprietary, and too fancy for my taste anyway, so nobody must use it. Use Javascript.".

    Maybe one should make swf files with the email in them. Muhahaha

  • by bheer ( 633842 ) <rbheer AT gmail DOT com> on Tuesday June 23, 2009 @03:26PM (#28443539)

    I don't think the spammers got his email address from Google. I mean, to do that they'd have to send a fairly narrow query to Google -- something like 'chibi jesus' -- and then scrape the results ... just scraping the cached page wouldn't help -- that contains JS, not the email address. Plus, I imagine Google would notice if a bot started sending lots of search queries its way.

    It's far more likely that spammer bots are now actively processing JS. As others on this thread have pointed out, it ain't hard to do.

  • by Todd Knarr ( 15451 ) on Tuesday June 23, 2009 @03:28PM (#28443561) Homepage

    Seconded. You don't need Javascript to do a simple hyperlink. You don't need a scrolling text-box to display your page, the browser can scroll the page just fine thankyouveddymuch. You don't need to dynamically replace elements to change content while maintaining a navigation header or sidebar when appropriate (note: appropriate) use of frames will accomplish exactly what you want.

    The two sins of engineering: making it more complicated than it needs to be, and making it simpler than it needs to be. Avoid them.

  • Re:robots.txt (Score:5, Insightful)

    by RajivSLK ( 398494 ) on Tuesday June 23, 2009 @03:40PM (#28443807)

    What would be nice is if google created a new tag in the lines of rel="nofollow" which would be an in-line way to keep the engine from seeing content.

    That would be exploited by spammers to the extreme. Imagine clicking on a listing for disney kids fun house only to have a hidden ad for an online Viagra dispensary dominate the page.

  • Re:Really.... (Score:2, Insightful)

    by mshieh ( 222547 ) on Tuesday June 23, 2009 @03:44PM (#28443867)

    I don't think I've had a single piece of spam pass through G-mails filter and only one false positive

    You mean you've only noticed one false positive. I'm sure it's been mentioned in half of the comments in this thread, but security by obscurity is effective because there is value in stopping half of the spam, unlike traditional security where having your data stolen and sold once is not a big gain over having it done many times. There are many reasons why obscurity works towards this goal of reduction rather than elimination.

  • by Asmor ( 775910 ) on Tuesday June 23, 2009 @03:56PM (#28444093) Homepage

    I publically list my email whenever I need to. If I want someone to email me something, I say, "Send it to itoltz@gmail.com". In fact, if HTML is allowed where ever I'm writing that, I'll even be so kind as make it a mailto link (i.e. <a href='mailto:itoltz@gmail.com'>itoltz@gmail.com</a>).

    And you know what? I almost never get spam in my inbox. I'd say a piece squeaks through Gmail's filters every few months (though when it does, I usually seem to get 2-3 similar spams over the course of a day or two).

    Granted, not everyone has the option of using gmail, and for those who do not everyone is comfortable with the idea of using it. That's fine. But the point is, if gmail is that good at filtering out spam, anyone else can be too.

  • Re:*rolleyes* (Score:3, Insightful)

    by Chabil Ha' ( 875116 ) on Tuesday June 23, 2009 @04:03PM (#28444239)

    To add:

    Relying on the expected behavior (Google not processing JS) of something over which you have no control for your security is pretty silly as well.

  • by Cajun Hell ( 725246 ) on Tuesday June 23, 2009 @04:26PM (#28444605) Homepage Journal

    For example, make a "Phone Number" field and set the CSS display attribute to none. Normal users won't see this field and won't fill it out. Spam-bots will see it and attempt to fill it out.

    This only works for as long as spammers don't care about it. I think anyone who can figure out the HTML resulting from javascript, can also figure out the style of an element.

    What's really funny about this problem is that we used to talk about using captchas to tell the robots apart from the meatbags, so that you could discriminate against robots. But now people want the robots to make sense of their page (so that they get referrals from Google) but they don't want the robots to make sense of their page (so that their email box doesn't get referrals from spambot). You're on the web or you're not. Choose.

  • Re:Mung (Score:2, Insightful)

    by Anonymous Coward on Tuesday June 23, 2009 @04:51PM (#28445043)

    I believe a 'WHOOSH' is in order.

  • Re:Mung (Score:3, Insightful)

    by Midnight Thunder ( 17205 ) on Tuesday June 23, 2009 @04:55PM (#28445095) Homepage Journal

    Actually proper English indicates that you double consonant when adding 'ing' if it ends with one, or drop the 'e' if it ends with one:
        hop -> hopping
        hope -> hoping

    so:
        munge -> munging
        mung -> mungging

  • Comment removed (Score:4, Insightful)

    by account_deleted ( 4530225 ) on Tuesday June 23, 2009 @05:30PM (#28445693)
    Comment removed based on user account deletion
  • Re:*rolleyes* (Score:3, Insightful)

    by interkin3tic ( 1469267 ) on Tuesday June 23, 2009 @07:22PM (#28447103)

    Do you really think whipping up a perl script is beyond the abilities of somebody who has the ability to run a spamming "business"?

    Maybe you mistook that for a rhetorical question, sorry for that misleading question, it was semi-honest. I really don't know how much effort goes into a spamming buisiness. Never met anyone who identified themselves as a spammer, so I don't know if they're as dumb as they seem. For that matter, I've never written a perl script.

    Just seems to me like if you have a decent head on your shoulders you'd be doing more than the equivalent of agressively begging for change on the sidewalk.

  • Re:*rolleyes* (Score:3, Insightful)

    by enoz ( 1181117 ) on Tuesday June 23, 2009 @07:53PM (#28447383)

    You miss the point.

    The Javascript obfuscation method allows you to make a mailto: url that was accessible to users yet difficult for spammers.

    Sticking your email in an image is probably worse then simply asking users to solve a captcha before giving them your email.

  • Re:Mung (Score:4, Insightful)

    by SausageOfDoom ( 930370 ) on Tuesday June 23, 2009 @09:12PM (#28447981)

    It has been happening for quite some time.

    I have always said that the only way to keep your e-mail address safe from spammers is to not give it out at all. Although Google may be doing it now, it's been perfectly possible for as long as computing power has been available cheaply to the spammers (ie botnets).

    About 4 years ago I conducted an experiment with anti-spam techniques for the comments on my blog. One of the things I tried was a bit of javascript which added a validation field to the form. The spammers kept on as if it wasn't there, which meant they had to be evaluating javascript.

    And the thing is, once your obsfucation measures are broken by the spammers, because of places like archive.org the internet never forgets - so you can't claw it back. You can update your obsfucation code on your site, but there's nothing stopping the spammers from simply trawling the archives and mirrors to find it there.

    The only way to protect your e-mail address is to never send it client-side - always put it behind a form and a server-side mailing script.

"I've seen it. It's rubbish." -- Marvin the Paranoid Android

Working...