Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Slashdot Log In

Log In

Create Account  |  Retrieve Password

Recovering Blurred Text Using Photoshop and JavaScript

Posted by timothy on Wed Oct 08, 2008 05:31 PM
from the careful-how-you-hide-stuff dept.
An anonymous reader writes "There's been a lot of talk about recovering blurred or pixelated text, but here's an actual implementation using nothing but Photoshop and a little JavaScript. Includes a Hollywood-esque video showing the uncovered letters slowly appearing."
+ -
story

Related Stories

[+] Blurring Images Not So Secure 166 comments
An anonymous reader writes "Dheera Venkatraman explains in a webpage how an attacker might be able to extract personal information such as check or credit card numbers, from images blurred with a mosaic effect, potentially exposing the data behind hundreds of images of blurred checks found online, and provides a ficticious example. While much needs to be developed to apply such an algorithm to real photographic images, he offers a simple, yet obvious solution: cover up the sensitive information, don't blur it."
This discussion has been archived. No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More
Loading... please wait.
  • From TFA:

    The most important feature is the JavaScript scripting environment built into Photoshop, which is far more powerful than the AppleScript environment (and a much nicer language, in my opinion).

    Hey Timothy, are you trying to get the mac fans riled up?

    I'd try this myself, but I stopped pirating PS at version 6. Be interesting to see what other Slashdotters make of this.

  • Oblig (Score:5, Funny)

    by Anonymous Coward on Wednesday October 08 2008, @05:34PM (#25306651)
    Now for all of the other pixellated stuff...

    Bwahahahaha.
    • Re:Oblig (Score:5, Funny)

      by Anonymous Coward on Wednesday October 08 2008, @05:54PM (#25306853)

      I'm off to ebay to try it on all those XP serial numbers.

  • by Anonymous Coward

    20x zoom. Mr. Data, what does that look like to you?

  • Just ovveride? (Score:5, Insightful)

    by Dreen (1349993) <dreen1@@@gmail...com> on Wednesday October 08 2008, @05:43PM (#25306737)
    I never understood why people use pixel mis-mashing when they want to obfuscate something in an image.

    drawing a big black rectangle is 10x faster and there is no way you can de-obfuscate that
    • Re:Just ovveride? (Score:5, Interesting)

      by fbjon (692006) on Wednesday October 08 2008, @05:49PM (#25306811) Homepage Journal
      It may stand out too much in the layout. What would be interesting (maybe even useful) is developing a formula that determines the minimum mosaic size for a given font style/complexity and size that makes reversing it produce ambiguous results, one extreme being a black rectangle, the other no mosaic at all.
      • Re:Just ovveride? (Score:5, Interesting)

        by Zadaz (950521) on Wednesday October 08 2008, @06:03PM (#25306943)

        Why not make a "secure" mosaic filter that does one or more of the following:

        - Randomizes the pixel data before applying the mosaic (Keeping the original colors so the mosaic looks natural.)

        - Applies noise to the area before applying the mosaic (Could use intensity noise rather than hue to retain the color scheme that is being obfuscated.)

        - Requires the user to drag the smudge tool across the area by a pre-determined amount to randomize the data before the mosaic.

        - Apply noise to the mosaic pixels, (1:1 with the mosaic size) after it has been applied.

        These would all retain the look of the mosaic but would defeat the reverse engineering tactics displayed here.

        Heck, forget the plugin, these would be pretty simple to automate these within Photoshop.

        • by larry bagina (561269) on Wednesday October 08 2008, @06:44PM (#25307323) Journal

          Heck, forget the plugin, these would be pretty simple to automate these within Photoshop.

          This is gimp country. On a quiet night, you can hear adobe squeal like a pig.

        • If you randomize pixel in any way before applying mosaic, adding noise or just randomly permute the pixels around, then what you get in the mosaic is going to be the same colored blocks that look like a solid colored strip. That's because mosaic computes the average of the pixel values that fall under the block, which would be the same for all blocks if randomness is evenly distributed. However, if you apply mosaic first and then randomize the blocks after, then the result looks much more like mosaic and y
    • Absolutely agree.

      Plus TFA says the original image was blurred using the Photoshop "mosaic" filter. So this approach, while interesting for text blurred using that exact filter, is probably useless in most real-world approaches, such as trying to recover text obliterated with the rubber-stamp tool, or like you suggest, a black box.

    • Re:Just ovveride? (Score:5, Informative)

      by Tumbleweed (3706) * on Wednesday October 08 2008, @06:10PM (#25307017) Homepage

      drawing a big black rectangle is 10x faster and there is no way you can de-obfuscate that

      Just make sure you're not saving in a file format that has a preview, where the preview doesn't have the obfuscation updated. :)

    • by IdahoEv (195056) on Wednesday October 08 2008, @07:05PM (#25307485) Homepage

      drawing a big black rectangle is 10x faster and there is no way you can de-obfuscate that

      That's not entirely true. There was an article a couple years back about a technique for recovering redacted text with pretty high reliability.

      It used the fact that most standard fonts have variable spacing, and that once you've determined the font you can model that only certain combinations of letters will actually fit in the space of the redacted word or words. Combined with a dictionary and bayesian matching based on nearby words, you can often figure out what words would have fit into a redacted rectangle. Or at least limit it to a fairly small pool of possibilities.

      They demonstrated it on a redacted government document, and pulled out some places where the redacted words had to be "Iran" and "Ahmedinejad" etc., because nothing else both fit and made sense. If it's a monospaced font, you know the exact number of letters of the redacted text.

      I can't find the original link, but here's a paper [lehigh.edu] that describes some of the techniques available for "cracking" blackout redaction. (some apply only to magic-marker-type redaction, but others apply even to electronic black-rectangle redaction).

  • by Empiric (675968) on Wednesday October 08 2008, @05:50PM (#25306827) Homepage

    Is it not the case that the reason this works is because you're running the -same blur algorithm- with the -same input- (the unblurred letters/pixels) and simply iterating through the letters and looking for equivalent output?

    Presumably, the blur algorithm output could resolve such that multiple unblurred letters resolve to the same blurred pixels, but even if it is not this trivial to map the input state to output state, it still wouldn't seem to me to approach solving the general case of letters "blurred" by any arbitrary means, which is the real-world capability implied by the article.

    What am I missing here?

    • by sexconker (1179573) on Wednesday October 08 2008, @06:23PM (#25307129)

      You're not missing anything.
      This guy tried to read some blurred text on his friend's site, so he decided to mess around in photoshop.

      He got over zealous and did some javascript stuff in photoshop, based on blurring known text and attempting to reblur and match that text 1 letter at a time.

      He was then disappointed that he couldn't use the thing to unblur assumed text of unknown font, font settings, color, language, character set, and blurring algorithm after unknown layers of image alteration after the text was rendered, and after unknown compression.

        • The ????? is finally revealed!
          1. - Get frustrated at blurred text
          2. - Fail to write unblur filter
          3. - Post to slashdot
          4. - PROFIT!!!!
  • Ideal conditions (Score:5, Interesting)

    by Itninja (937614) on Wednesday October 08 2008, @05:56PM (#25306871) Homepage
    The article mentions the authors 'cheating a little' by de-blurring the image under 'ideal conditions'. From what I can gather, he is using a source Photoshop file (PSD) as the sample. If he already had access to the source PSD file, wouldn't it be easier to just click undo a few dozen times? Can this be reproduced to a raster image at all?
    • Re: (Score:3, Insightful)

      I imagine the "cheating" aspect of using a PSD file is that you are working with a lossless image. There's no JPEG/MPEG compression going on etc. Working with another lossless format like BMP, or PNG, would probably work just as well.

      • Re: (Score:3, Interesting)

        What is more significant is knowing exactly which font and which blur filter was used. Also having the filter applied evenly is going to make things far easier than if someone grabs the blur tool and scribbles over the area with it.

    • Re: (Score:2, Informative)

      Once I hit "save" in photoshop and exit I can't undo. Although in a layered image the rasterized text (must be rasterized before we can liquify/blur it) would probably be on a separate layer, but it's generally much more incriminating to replace text and not unblur text and it's probably not layered like that.

  • Old tech (Score:5, Funny)

    by Glith (7368) on Wednesday October 08 2008, @06:03PM (#25306947)

    Jeez. Hasn't anyone seen CSI?

  • Failure (Score:5, Insightful)

    by sexconker (1179573) on Wednesday October 08 2008, @06:13PM (#25307051)

    "While my original goal of recovering the censored text on my friendâ(TM)s page was never achieved, the project was a success."

    I wouldn't call that a success...

    Good execution of a basic concept, but the fact remains that this shit is infeasible in practice. You have all the font issues (the typeface, the spacing, the color, the size, etc.), and you've got all the source issues - Are you sure that's text? Is it English? Was it obfuscated in other ways? Has the image been altered after the text was rendered? How has compression affected it?

    The biggest fucking issue, of course, is that you're assuming the text was obfuscated using photoshop, or at least very similar blurring/pixelating algorithms.

    It's a great project in terms of using javascript and photoshop to do something neat but basic in concept (essentially brute forcing, as the author says).

    But unless you have inside info about how the text was rendered and obfuscated, you're better off taking a step back and squinting.

    I think I see a duck.

    • Re:Failure (Score:4, Funny)

      by finiteSet (834891) on Wednesday October 08 2008, @06:53PM (#25307389)

      But unless you have inside info about how the text was rendered and obfuscated, you're better off taking a step back and squinting.

      Little Girl: "Wow! It's a schooner!"
      Willam: "You dumb bastard! That's not a schooner, it's a sailboat!"
      Little Boy: "A schooner is a sailboat, stupidhead!"
      Willam: "Well you know what? THERE IS NO EASTER BUNNY! OVER THERE, THAT'S JUST A GUY IN A SUIT!!"

  • Coral Link (Score:3, Informative)

    by mrbene (1380531) on Wednesday October 08 2008, @06:17PM (#25307075)
    I'm getting a 500 status connecting to the original, but it seems that Coral CDN has a decent cache [nyud.net].
  • "The easy way to read numbers"
  • Short version (Score:5, Interesting)

    by rabtech (223758) <slashdot_sez&boneville,net> on Wednesday October 08 2008, @06:28PM (#25307173)

    If you know the blur algorithm, you can run each character through it to produce the blurred output, then compare that result to the image you are trying to unscramble and pick the closest match.

    This assumes the blurring doesn't cause pixels to overlap their neighbors too much, that the algorithm produces deterministic output (isn't random), and that there are few possible inputs resulting in the same blurred output.

    If the letters overlap because the blur blends with its neighbors then it just becomes a computational complexity problem where you have to try words instead of letters. A lot harder, but not totally impossible.

    A blurring algorithm that used some large mosaic effect prior to bluring or used randomized input would produce a similar looking blur effect, but without disclosing much about the input.

    Personally, I'd prefer examining the blur area for the predominant background color and create a gradient/mosaic around that color to fill the area. Then there is absolutely no chance of recovering information, but the effect on video wouldn't be too horribly jarring (as a black box might be).

  • by vodevil (856500) on Wednesday October 08 2008, @07:36PM (#25307725)
    I think people would be more interested if this removed the blur from nipples.
  • javascript dock? (Score:3, Interesting)

    by Eil (82413) on Wednesday October 08 2008, @10:07PM (#25308829) Homepage Journal

    Holy cow, I thought I had seen all possible ways to crapify a web page via Javascript, but today I have learned of another: Adding a partially transparent Mac OS dock to it. So that when you scroll down (as you normally do with web pages), the thing jumps around like a monkey on amphetamines. And you presume that it doesn't get any worse than that, wouldn't you? You would, but then you'd be wrong because it also does the magnification thing and it does it about as well as you'd expect Javascript to do it.

    Now if you'll excuse me, I'm heading out to abuse my 2nd Amendment rights.

    • Wouldn't it be cool if you could use this technique to clear up the fuzzies on censored Japanese pr0n?!

      • by moderatorrater (1095745) on Wednesday October 08 2008, @06:04PM (#25306955)
        You're on the wrong forum for that line of reasoning. You'll want to find a site that's either anti-censorship or pro-porn. Good luck finding something like that on the internet.
      • Wouldn't it be cool if you could use this technique to clear up the fuzzies on censored Japanese pr0n?!

        Uh, those "fuzzies" are what are known as pubic hair. You might be unfamiliar with it as American porn stars have razor rash down there instead :)

        <sarcasm>Wasn't it ingenious of them to grow hair down there to cover up all that unsightly stubble?</sarcasm>

    • You can do the exact same thing in GIMP rather easily.

      Get some image with blurred text.
      Try blurring some letters to match up with the blurred text.
      Fail.

      See? You've already done what this guy did with javascript, photoshop, and who know how much wasted time.