Compressed VoIP Calls Vulnerable To Bugging 140

Posted by kdawson on Friday June 13, 2008 @12:20PM from the say-that-again-slowly dept.

holy_calamity writes "Security researchers at Johns Hopkins report that a variable bit-rate compression scheme being rolled out on VoIP systems leaves encrypted calls vulnerable to bugging. Simpler syllables are squeezed into smaller data packets, with more complex ones taking up more space; the researchers built software that uses this to spot phrases of interest in encrypted calls simply by measuring packet size."

This discussion has been archived. No new comments can be posted.

Compressed VoIP Calls Vulnerable To Bugging

Load All Comments

Search 140 Comments Log In/Create an Account

Comments Filter:

Easy Solution: (Score:5, Insightful)

by dintech ( 998802 ) writes: on Friday June 13, 2008 @12:23PM (#23780065)

Easy Solution. Music in the background.

Share
twitter facebook
- Re:Easy Solution: (Score:5, Insightful)
  
  by Anonymous Coward writes: on Friday June 13, 2008 @12:26PM (#23780127)
  
  Better solution: Fix the stupid, broken protocol.
  
  For instance, the concept of RSA blinding had to be invented because people discovered that certain bits of the SSL private key can be determined simply by measuring the time it takes to encode messages. This was due to some implementation details inside SSLeay where it switched from one multiplication algorithm to a different one depending on the size of certain numbers in the algorithm.
  
  OAEP had to be invented for similar reasons
  
  "Music in the background" is not a security solution. In fact, that's a freaking joke.
  
  Parent Share
  twitter facebook
  - Re:Easy Solution: (Score:5, Funny)
    
    by Daimanta ( 1140543 ) writes: on Friday June 13, 2008 @12:44PM (#23780489) Journal
    
    ""Music in the background" is not a security solution. In fact, that's a freaking joke."
    
    Yes, but a joke you can dance on.
    
    Parent Share
    twitter facebook
    - Re: (Score:2)
      
      by ehrichweiss ( 706417 ) writes:
      
      VOIP: It's got a good beat and I can bug out to it.
    - Re: (Score:3, Funny)
      
      by martin_henry ( 1032656 ) writes:
      
      Awesome....a VOIP dance party.
  - Protocol isn't broken - it's badly mixed (Score:5, Insightful)
    
    by billstewart ( 78916 ) writes: on Friday June 13, 2008 @01:59PM (#23782057) Journal
    
    This isn't a simple case of a broken protocol - it's an effect of mixing different protocols in ways that don't work together.
    Voice codecs are designed to support a given level of audio quality subject to bit rate and computational complexity limitations. Most codecs are fixed-rate, or fixed-rate with silence suppression. Encryption isn't part of their design; it's somebody else's problem, and many VOIP systems aren't encrypted anyway (for instance, connections between an office phone and a PBX usually aren't.) Variable bit rate codecs are sometimes a good choice, depending on the kind of sounds you're trying to compress and the networks you're transmitting them on, and they're at least an alternative to the usual fixed-rate codecs.
    Encryption systems usually aren't designed to deal with real-time message streams or timing attacks. Typically VOIP encryption protocols are designed for constant bit rate codec output, which is what most codecs provide, and the codecs usually package up 10, 20, or 30ms audio samples into a data packet for transmission over IP.
    The problem occurs when you're choosing your codec and encryption separately, and you take a crypto system designed for fixed-rate codecs and use a variable-bit-rate codec instead. It's difficult to keep people from doing that sort of thing, especially if they're using huge-overhead approaches like VOIP inside IPSEC as opposed to VOIP systems with the crypto built in. It's also difficult to prevent people from making bad choices like that when they're using open-source software applications, as opposed to proprietary phones that only have the small set of codecs the manufacturer built in (typically uncompressed G.711, or G.729 or a GSM codec, all of which are fixed-rate except for silence suppression.)
    
    Parent Share
    twitter facebook
  - Re: (Score:3, Informative)
    
    by jonaskoelker ( 922170 ) writes:
    
    OAEP had to be invented for similar reasons
    Not true: OAEP fixes problems with the math, which by its declarative nature is timing-independent.
    
    The problem fixed by OAEP is this: suppose you want to a message from a small set (say, a single bit, or "attack" versus "retreat"); assume for convenience the set of messages is contained in [0, n-1], where n = pq is part of the RSA public key.
    
    If you just do plain RSA encryption (c = m^e % n), then the eavesdropper can encrypt all the values from the small set in almost no time, and see which of the encrypti
  - Re: (Score:2)
    
    by lpq ( 583377 ) writes:
    
    The music in the background isn't so ridiculous...though maybe it would be 'random' noise generated by the sending phone that is based on a key-negotiation sent at call start & maybe changed periodically throughout the call. But it may not be possible to remove the "scarcity" of information in the data-stream (all the small words have been compressed, so few bits are used) from the real-time nature of VoIP -- and the fact that people might not be saying much, using "little words", or whatever. It seem
  - Not a joke. Another layer on the onion, friend. (Score:2)
    
    by aphor ( 99965 ) writes:
    
    Hey AC: don't be an asshuile. We are all on the same team here no? You are right, but the irony is that because you are right, you are frustrated that people don't get it, and you react in a way that reduces the fraction of readers that will get it.
    
    It's worth noting that the wrongest part of the dintech post you're criticizing has nothing to do with music. It's "Easy Solution"... as if. So is it going to be "give a man a fish" or "teach a man to fish?"
- Re: (Score:3, Funny)
  
  by gstoddart ( 321705 ) writes:
  
  Easy Solution. Music in the background.
  
  Oh, sure, give the RIAA reason to get involved in encrypted phone calls.
  
  They'll try to make sure you're not using unlicensed music to mask your conversations. We'll be seeing John Doe subpoenas to get access to what music you were playing. :-P
  
  I'm only half joking.
  
  Cheers
- Not really... (Score:4, Informative)
  
  by msauve ( 701917 ) writes: on Friday June 13, 2008 @01:42PM (#23781761)
  
  First, the paper was testing the Speex codec [speex.org], and in based in principle on looking at codecs which use variable bit-rate CELP, a compression scheme which is tailored to speech, not music (music sounds terrible through one of these codecs, because their dictionaries are filled with speech sounds). Having music in the background is only likely to confuse the codec, making the speech sound terrible too, possibly to the point of unintelligibility.
  
  The conclusions do not apply to more standardized codecs like G.711 and G.729a, which use fixed size packets.
  
  The paper itself can be downloaded from here [jhu.edu]. Get it quick, before the IEEE figures this out and make the author remove it so they can extort their fee.
  
  Parent Share
  twitter facebook
- Re: (Score:2)
  
  by billcopc ( 196330 ) writes:
  
  Easier solution: don't use voice.
  
  I've always found VoIP rather humorous, since you're taking a digital channel, and shoving voice through it - you know, like the reverse of a dial-up modem.
  
  If you're dealing with sensitive stuff that you don't want eavesdropped, do it in a secure IM session or encrypted email. Talk is overrated!
Do what my grandparents do (Score:5, Interesting)

by phorm ( 591458 ) writes: on Friday June 13, 2008 @12:23PM (#23780075) Journal

Anyone wanting to avoid detection could just follow what my German-speaking grandparents do when they don't want us kids listening into the conversation: randomly switch languages on different topics (though I think that this is sometimes also because some concepts are also easier to portray in a given language).

Random switches between languages would probably confuse the heck out of filters guessing compressed data. That or you could just learn Russian... I don't think they *have* any simple-syllable words in Russian :-)

Share
twitter facebook
- Re:Do what my grandparents do (Score:5, Funny)
  
  by smitty97 ( 995791 ) writes: on Friday June 13, 2008 @12:26PM (#23780115)
  
  That or you could just learn Russian... I don't think they *have* any simple-syllable words in Russian :-)
  In Soviet Russia, VoIP bugs you!
  
  Parent Share
  twitter facebook
- Re:Do what my grandparents do (Score:5, Funny)
  
  by markana ( 152984 ) writes: on Friday June 13, 2008 @12:47PM (#23780565)
  
  >That or you could just learn Russian... I don't think they *have* any simple-syllable words in Russian :-)
  
  Da!
  
  Parent Share
  twitter facebook
- Re:Do what my grandparents do (Score:4, Funny)
  
  by mlwmohawk ( 801821 ) writes: on Friday June 13, 2008 @12:48PM (#23780577)
  
  Just speak arabic!! We already know the FBI and CIA don't have enough translators.
  
  Parent Share
  twitter facebook
  - Re:Do what my grandparents do (Score:4, Interesting)
    
    by MindStalker ( 22827 ) writes: <mindstalker@gmail.cDEGASom minus painter> on Friday June 13, 2008 @01:35PM (#23781623) Journal
    
    Depends upon how you define "translators." One of my best friends just got out of the Army, he is a really good linguist and knows several langauges, but he flunked out of the Arabic program because its not just hard to learn, you have to learn hundreds of dialects even for Iraq. He could understand it well enough but to be able to go out on the street and translate you have to be certain you won't accidentally offend with a mistranslation. Apparently virtual no non native arabic speakers ever make it through this program. Anyways he go reassigned to listen to and interpret radio broadcast and other incoming information. Not officially a translator. The point to this story?? I don't know..
    
    Parent Share
    twitter facebook
    - Re: (Score:2, Informative)
      
      by wyohman ( 737898 ) writes:
      
      Apparently virtual no non native arabic speakers ever make it through this program.
      
      Not likely spanky. Were this true, the language school at Monterey would have to answer a lot of very tough questions. Like my wise grandfather used to say, "Believe half of what you see and none of what you hear."
      
      Cheers.
  - Re: (Score:2, Funny)
    
    by Samizdata ( 1093963 ) writes:
    
    I worked for a set of commodity trader brothers back in the 80's. One of them, who worked as their corporate attorney, was in a Club Fed for tax issues.
    
    I saw more than one threat from the Bureau of Prisions warning them to stop using Latvian (their native tongue) during phone calls to the incarcerated.
    - Re: (Score:2)
      
      by mlwmohawk ( 801821 ) writes:
      
      Latvian? That's funny.
      
      Security through obscurity indeed.
- Re: (Score:3, Insightful)
  
  by JeffAMcGee ( 950264 ) writes:
  
  Going from one language to two would only make the process of breaking the message a bit more complex, and by that I mean precisely one bit more complex, because there would be about twice as many phrases to look for. This is not a solution. The solution is to not use variable bit rate compression if security is important.
  - - Re: (Score:2)
      
      by mpe ( 36238 ) writes:
      
      It wouldn't be a matter of just having twice as many phrases, because a phrase in one language could be recognised as a completely different phrase in English. Your point also doesn't take into account mid-sentence language switches ie. simply substituting words into otherwise English sentences. Also, your point is not valid unless the people analysing the traffic have prior knowledge of which two languages would be spoken, and therefore the system would have to cover the most common languages (at least 8 I
- Re: (Score:3, Funny)
  
  by hummassa ( 157160 ) writes:
  
  That or you could just learn Russian...
  Which would give you an advantage, if you ever have to pilot a bleeding-edge mind-controlled Russian jet fighter.
- Re: (Score:2)
  
  by SBacks ( 1286786 ) writes:
  
  If you're going to go through all this trouble, you might as well start from scratch. Make a language only you and people you wish to secretly communicate with know.
  - Re: (Score:2)
    
    by Anarke_Incarnate ( 733529 ) writes:
    
    Nah, Esperanto for the win :)
  - Re: (Score:2)
    
    by gnick ( 1211984 ) writes:
    
    If you're going to go through all this trouble, you might as well start from scratch. Make a language only you and people you wish to secretly communicate with know.
    Ix! Gorfat blutell pragmew ig jounty crein moxin fout. Im odin reax trelli poin zor trillo daster zub? Unt jo.
- Re: (Score:2)
  
  by caluml ( 551744 ) writes:
  
  I don't think they *have* any simple-syllable words in Russian :-)
  Da?
Randomize the packets slightly (Score:2)

by BasharTeg ( 71923 ) writes:

I would think that a very slight randomization of the packets with filler would add a trivial amount of data to the packet and would tend to interfere with thier analysis. I'm sure after a certain point of added bytes and randomization, you would change their margin of error such that the process wasn't useful or effective anymore.
- Re:Randomize the packets slightly (Score:4, Informative)
  
  by pclminion ( 145572 ) writes: on Friday June 13, 2008 @12:33PM (#23780289)
  
  Time/space attacks are well known. Somebody who actually, hmm, UNDERSTOOD cryptographic security would never have designed the protocol this way in the first place.
  
  The people suggesting that we should just inject noise or background patterns are being ridiculous. Why sacrifice communication quality when there are BETTER ways to fix it? DO IT RIGHT.
  
  Parent Share
  twitter facebook
  - Re: (Score:3, Interesting)
    
    by Creepy Crawler ( 680178 ) writes:
    
    ---The people suggesting that we should just inject noise or background patterns are being ridiculous. Why sacrifice communication quality when there are BETTER ways to fix it? DO IT RIGHT.
    
    Injecting "noise" makes sense for me. Why so?
    
    We use a salt for our hashes, dont we? The "noise" would be the same thing. Consider this: during negotiation, we have chaotic noise formulas in which we propagate the variables so that each side knows the noise transform. We then add the noise after digitalization but before e
    - Re: (Score:2)
      
      by pclminion ( 145572 ) writes:
      
      Sure, but this sort of defeats the purpose of VBR, since the resulting audio stream is random and thus VBR can't really do it's "thing."
      - Re: (Score:2)
        
        by pclminion ( 145572 ) writes:
        
        Damn. I apologize for the apostrophe abuse.
Evasive, ummm, technology (Score:5, Funny)

by martyb ( 196687 ) writes: on Friday June 13, 2008 @12:30PM (#23780223)

FTFA
In tests on example conversations, the software correctly identified phrases with an average accuracy of about 50%. But that jumped to 90% for longer, more complicated words. Wright thinks these phrases may be the most important. "I think the attack is much more of a threat to calls with some sort of professional jargon where you have lots of big words that string together to make long, relatively predictable phrases," he says. "Informal conversational speech would be tougher because it's so much more random."

So, ummm, what we should do to, umm, well, protect ourselves from, ummm, yaknow, eavesdroppers, heh-heh, is well, make sure there's enough, ummmmmmm, yaknow, like extra noise, like, mixed in, dude.

Share
twitter facebook
- Re:Evasive, ummm, technology (Score:5, Funny)
  
  by gstoddart ( 321705 ) writes: on Friday June 13, 2008 @01:18PM (#23781273) Homepage
  
  So, ummm, what we should do to, umm, well, protect ourselves from, ummm, yaknow, eavesdroppers, heh-heh, is well, make sure there's enough, ummmmmmm, yaknow, like extra noise, like, mixed in, dude.
  
  Oh my god, thats like, totally, like, a great idea, yaknow. I mean, like, they'll never figure out what we're, like, saying, yaknow?
  
  Oryoucouldspeakreallyfastwithoutpausesbetweenwords. Thatwaythey'llneverknowwhatyousaid =)
  
  Or. We. Could. All. Speak. Like. Shatner. Random. Long. Pauses. Genius.
  
  Cheers
  
  Parent Share
  twitter facebook
  - Re: (Score:2, Funny)
    
    by Anonymous Coward writes:
    
    Or. We. Could. All. Speak.
    Like. Shatner. Random. Long.
    Pauses. Genius.
    
    You're missing one syllable in the middle line...
    - Re: (Score:2)
      
      by gstoddart ( 321705 ) writes:
      
      You're missing one syllable in the middle line...
      
      He doesn't have a full pause between every syllable. Sometimes, he'll put a couple of words together in one go as well "Spock. How. Areweto. Know. Whattodo?" or "Nurse. Chapel. Let's. Gotomyroom". ;-)
      
      Cheers
  - Re: (Score:2)
    
    by CodeBuster ( 516420 ) writes:
    
    Brilliant!
- Re: (Score:2)
  
  by Lisandro ( 799651 ) writes:
  
  Lumbergh? [blogger.com] Is that you?
- - - Re: (Score:2)
      
      by SBacks ( 1286786 ) writes:
      
      Yes, but you can click that little box that says "Post Anonymously" when you are logged in.
It's easy to encrypt your conversations (Score:3, Interesting)

by muellerr1 ( 868578 ) writes: on Friday June 13, 2008 @12:32PM (#23780265) Homepage

Just st-st-stuh-stutter when you talk. And use a lot of, uh, you know, um, non-word sounds between, uh, like, your phrases. And don't use any complexificated words without Bushifying them first. Better yet, only speak in Klingon.

Or maybe you shouldn't say anything on VoIP that you don't want anyone else to hear.

Share
twitter facebook
- Re: (Score:3)
  
  by Ephemeriis ( 315124 ) writes:
  
  Or maybe you shouldn't say anything on VoIP that you don't want anyone else to hear.
  A couple honest questions...
  
  1) Why do I see so much about wiretapping/bugging VoIP lately? I guess I've always assumed that VoIP was just as vulnerable to bugging as POTS - maybe even more so. Was I wrong? Was VoIP previously un-buggable and this just recently changed? Or is it just because VoIP is the new, cool thing?
  
  2) Why would anyone think that compressed VoIP would be any more or less secure than uncompressed? As
  - Re: (Score:3, Informative)
    
    by mmkkbb ( 816035 ) writes:
    
    From the article summary above: "a variable bit-rate compression scheme being rolled out on VoIP systems leaves encrypted calls vulnerable to bugging" and "spot phrases of interest in encrypted calls simply by measuring packet size."
    
    Emphasis mine.
  - This is compressed, encrypted VOIP (Score:3, Informative)
    
    by billstewart ( 78916 ) writes:
    
    1) You're seeing lots about bugging and wiretapping VOIP because VOIP use is increasing, and because the buggers in government are getting really aggressive about wanting to wiretap people. VOIP is potentially less secure than POTS, because there are more ways to tap the Internet than traditional phones (where you either use alligator clips on the wire or go to the phone company office), and it's also potentially much more secure than POTS, because the end users can do their own encryption without needing
    - Re: (Score:2)
      
      by FLEB ( 312391 ) writes:
      
      I think another factor adding to the possibility of VOIP tapping is that your conversation is liable to be sent over a range of different midpoints and hardware, owned by a variety of people. Plus, tapping in and copying the information stream has very little chance of creating noticeable interference.
    - Re: (Score:3, Funny)
      
      by 6Yankee ( 597075 ) writes:
      
      the buggers in government
      
      Oooh. Well played, Sir, well played.
    - Re: (Score:2)
      
      by AnyoneEB ( 574727 ) writes:
      
      an 8kbps voice codec typically takes 24-28kbps of IP if you don't encrypt it, and maybe double if you do.
      As I understand modern encryption, it adds overhead to creating the connection because encryption keys have to be shared before encrypted data can be sent, but the actual encryption is done with a cipher such that it takes up the same amount of space encrypted as decrypted, so it does not cause a size overhead on the main data transfer. Then again, if it changes keys often or is doing something else special, then maybe it would cause that much overhead.
      - Re: (Score:2)
        
        by billstewart ( 78916 ) writes:
        
        That's not actually the problem here - as you say, the encryption itself doesn't cause a size increase except for an initial key exchange. The problem is that you're taking a raw data packet of two or three 10-byte compressed-voice samples (each 10ms at 8kbps), wrapping them in RTP/UDP headers (20 bytes) and IP headers (another 20 bytes), and then if you're doing a typical VPN with tunnel-mode IPSEC, you're adding another layer of IP headers (which are slightly larger because they've got IPSEC options adde
- Re: (Score:2)
  
  by gstoddart ( 321705 ) writes:
  
  Or maybe you shouldn't say anything on VoIP that you don't want anyone else to hear.
  
  Well, then what's the point in saying it at all then? ;-)
  
  Oh, you mean someone other than the person you're talking to. My bad. :-P
  
  Cheers
Bad science (Score:4, Insightful)

by DrYak ( 748999 ) writes: on Friday June 13, 2008 @12:53PM (#23780695) Homepage

First, the article mixes things :
vowels actually are simpler than consonant to compress (because of spectral complexity - consonant use much more different frequencies. They are mostly noises and have a more "random"-like wave form making them harder to compress). They got it completely in reverse.

Then TFA doens't show a method to magically guess was is being said over a crypted channel only by looking at the bitrates, it only says that it finds some predetermined pattern in a given set of samples to test against. The whole thing would only be able to answer to some very simple questions like "did the words XYZ appear in the conversation ? or did ABC appear in the conversation ?" - with a rather bad success rate if those words are long and complex enough - which hardly makes it enough to obtain personal information or otherwise efficiently spy on someone.

Then the whole system has a lot of short comings :
- As said before it assumes that the spy know exactly that some phrase has to be said - if the spy doesn't guess exactly what words he must search for the attack fails (the users may be speaking in a foreign language to begin with).
- It assumes that the speech-generator-made needle they are looking for in the hay sack will be close to what they are looking for. The users may have an accent and pronounce words differently (cf alumnium vs. aluminium, etc...)
- And worse of all, it assume that the granularity of the packed will be small enough so that the phonemes will have an influence on the bit rate. Whereas in reality, short packets have a big overhead of bandwidth, longer packets increases the latency. But lots of VoIP users are happy with a 500ms latency because it really diminishes the overhead. At 500ms you can have a couple of words in a single packet. The whole packet will tend to have a corresponding bandwidth close to the average (there will be small difference between phonemes, but these will all be packed into the same packet and will average).
- It fails to take into account an interleaved video stream. Video conferencing is really popular, and its own bandwith will completely dwarf the bandwidth used by audio. So unless the VoIP uses 2 separate stream (some VoIP systems do), and only encrypt at the stream level, and the transmission is happening over a non crypted channel (no sane person should do that), this method will fail epically.

Share
twitter facebook
- Re: (Score:2)
  
  by pclminion ( 145572 ) writes:
  
  Then TFA doens't show a method to magically guess was is being said over a crypted channel only by looking at the bitrates, it only says that it finds some predetermined pattern in a given set of samples to test against.
  
  I don't care. Good cryptosystems should be absolutely impenetrable. Even the smallest flaw is like a crack in a dike. Maybe it will expand and blow the dike, maybe it won't. But it's simply UNACCEPTABLE to have cracks in the dike, and it's UNACCCEPTABLE to have known weaknesses. Whether
  - It's not a flaw in the cryptosystem. (Score:2)
    
    by argent ( 18001 ) writes:
    
    It's a traffic monitoring problem.
    
    It doesn't matter how good the cryptosystem you use to call the Pizza Hut nearest the Pentagon is, if you just need to count the trucks leaving the Pizza Hut to tell when there's a burst of late night activity so you can tell the invasion is about to start.
  - Counter measure. (Score:2)
    
    by DrYak ( 748999 ) writes:
    
    I don't care. Good cryptosystems should be absolutely impenetrable. Even the smallest flaw is like a crack in a dike. Maybe it will expand and blow the dike, maybe it won't. But it's simply UNACCEPTABLE to have cracks in the dike, and it's UNACCCEPTABLE to have known weaknesses.
    As I said, the counter measure is bloody simple :
    Use longer packets.
    - It saves your bandwidth because of less overhead (that's why people are *already* doing it).
    - It has only a small impact upon latency.
    - With long enough packets, the difference between sound averages and nothing can be eavesdropped based on phonemes compression ratio.
  - - Re: (Score:2)
      
      by pclminion ( 145572 ) writes:
      
      Ummm, no. Every encryption algorithm is guaranteed to be vulnerable to brute force - trying every possible key value.
      
      Okay, wise guy -- leaving aside brute force.
- Re: (Score:2)
  
  by drew ( 2081 ) writes:
  
  The whole thing would only be able to answer to some very simple questions like "did the words XYZ appear in the conversation ? or did ABC appear in the conversation ?" - with a rather bad success rate if those words are long and complex enough - which hardly makes it enough to obtain personal information or otherwise efficiently spy on someone.
  Yes but presumably if you know that a certain phrase appears in an encrypted conversation, and when it occurs, that could allow an attacker to use a known-plaintext
  - Fuzzy Logic vs. Binary Information (Score:2)
    
    by DrYak ( 748999 ) writes:
    
    Yes but presumably if you know that a certain phrase appears in an encrypted conversation, and when it occurs, that could allow an attacker to use a known-plaintext attack to discover the rest of your conversation.
    No you can't. Because you don't have the plain text. What you have is a probability that some piece of the crypted transmission sounds somewhat similarly to another piece of audio you have.
    
    You didn't actually get the clear binary data (the original wave form or the original non-crypted compressed stream).
    
    By comparing the two pieces you have matched together you can't infer the key that was used to encrypt into another, because they actually AREN'T the crypted version of the other. They AREN'T the same data
- - It's only rythm based (Score:2)
    
    by DrYak ( 748999 ) writes:
    
    The eavesdropping is only based on the rythm of speach based on the difference in phoneme compression.
    
    If we admit that the conditions are good enough for the trick to work (short packets, no background noise, no additional data interweaved with the voice stream) ...
    ahh, but that can be enough for overzealous evesdroppers to come a knocking. Lets say the words and phrases "Commies" "Americans" and "Kill-em-all" were found in a convo. Depending on which side you are on, and who you are directing it at, you could be either extremely patriotic, or a "terrorist", care to guess which way our overlords will assume?
    Then you sent to Gitmo a poor schmuck whose girlfriend happens to be named "Connie", who complained about "drinking a merry can of vodka" the night before and now needs "some tylenol to kill the pain".
    
    Using strong understatement, TFA were basica
ode-cay (Score:4, Funny)

by fahrbot-bot ( 874524 ) writes: on Friday June 13, 2008 @01:37PM (#23781661)

Ust-jay eak-spay in ode-cay.

Share
twitter facebook
Why change the packet size? (Score:3, Interesting)

by argent ( 18001 ) writes: <peter@noSpAm.slashdot.2006.taronga.com> on Friday June 13, 2008 @02:18PM (#23782351) Homepage Journal

Send fixed size packets, splitting longer syllables into more packets and packing multiple short syllables into single packets.

Share
twitter facebook
- Re: (Score:3, Informative)
  
  by rantingkitten ( 938138 ) writes:
  
  Then you'd be losing the point of compression, in which case you could bypass the problem entirely since the attack relies on examining the compression. :)
  
  In fact, you might be making it worse at that point, since now it's not compressed and you're splitting things into more packets than you were before, which could compound any latency-related issues that may be present.
  - Re: (Score:2)
    
    by argent ( 18001 ) writes:
    
    I don't think you actually understood my suggestion, because I'm not suggesting anything that would reduce the effectiveness of compression, nor am I suggesting splitting things into more packets, on average.
    
    It would probably increase latency, but given the existing variation in latency I've seen streaming over the public internet I doubt you'd notice any increment from this.
Much like traffic analysis attacks on SSH (Score:2, Informative)

by runexe ( 24089 ) writes:

This is very similar to traffic analysis attacks on SSH (like this one [acm.org]) where packet sizes and inter-arrival times can indicate which keys you are typing.
Effective, practical counter-measures against good traffic analysis techniques are very difficult - especially if the attacked has enough traffic to work with (i.e. many conversations, many sessions, etc.).
- Re: (Score:3)
  
  by pclminion ( 145572 ) writes:
  
  Hahaha! Compressing encrypted data?! My sides are splitting!
  
  In case you can't figure it out: good encryption makes data look completely random. Do you know of any algorithms which compress PURELY RANDOM data? I sure as hell don't.
  - Re:Here's a thought (Score:5, Insightful)
    
    by blueg3 ( 192743 ) writes: on Friday June 13, 2008 @12:45PM (#23780531)
    
    There's a reason for that. With a good encryption mechanism, the ciphertext will have maximum entropy (one bit of entropy per bit of ciphertext). Random data also has maximum entropy.
    
    The point of compression is to take data that's expressed in a way that doesn't maximize entropy and reexpress it in a way that is higher-entropy (more information per bit). As such, maximum-entropy data is, by its nature, incompressible.
    
    Parent Share
    twitter facebook
  - Re: (Score:2)
    
    by mhall119 ( 1035984 ) writes:
    
    It's funny, because if I take any random file, make 10 copies of it, encrypt all 10 with the same key, then compress them all together, I actually do get some amount of compression.
    
    Unless you plan on not duplicating sounds or sound sequences throughout your conversation, or using really big packets, chances are that you'll be repeating some of the same chunks of data, which will result in the same chucks of encrypted data, which would allow for compression. A generously lossy encoding of the original data
    - Re: (Score:2)
      
      by mstahl ( 701501 ) writes:
      
      Of course you get some compression. You probably get a file that's ever-so-slightly bigger than just one of the encrypted files. The encrypted files—assuming pretty effective encryption—have close to their maximum entropy (ideal encryption, like a well-chosen one-time-pad, would have entropy equivalent to the length of the message in bytes, making it indistinguishable from random data). Repeating them reduces the overall entropy of the message as with each identical packet no additional informat
      - Re: (Score:2)
        
        by mhall119 ( 1035984 ) writes:
        
        The two ends could build an map, where instead of transmitting a duplicate chunk you just send "It's the same as chunk #123". Of course that table would get big very fast, so you'd either need lots of memory or trim the index every so often, so that only the most commonly used chunks stay in memory.
        
        Seriously it's not that simple (Score:2)
        
        by mstahl ( 701501 ) writes:
        
        Totally not even how VoIP works. You're making the assumption that chunk #123 actually got there. There's no ACK packets in VoIP; if a packet is received out of sequence it's dropped. That's that "jitter" that happens when the line breaks up a bit every now and again. It's your packets not all taking the same route and getting to the destination device out of order.
        
        You have to remember: VoIP is a real-time protocol, and keeping up with real time is the paramount concern, not necessarily absolute accuracy.
        
        Re: (Score:2)
        
        by pclminion ( 145572 ) writes:
        
        The two ends could build an map, where instead of transmitting a duplicate chunk you just send "It's the same as chunk #123".
        
        Great -- now, if an attacker manages to decrypt "chunk #123" they now know the contents of ALL chunks labelled "chunk #123." I can't see how that's good.
    - Re: (Score:3, Insightful)
      
      by marcansoft ( 727665 ) writes:
      
      Even a one bit change in the input totally changes the output of data after encryption (with secure encryption algorithms anyway). So unless you feed a deterministic voice synthesizer to the VoIP compressor and adjust the timing to exactly match that of the packets, no, you aren't going to get any compressible chunks in the output data after encryption. At all. Besides, if the encryption is any good it'll use a random IV for every packet, because encrypting the same plaintext to the same ciphertext [wikipedia.org] itself c
  - Re: (Score:3, Funny)
    
    by gstoddart ( 321705 ) writes:
    
    In case you can't figure it out: good encryption makes data look completely random. Do you know of any algorithms which compress PURELY RANDOM data? I sure as hell don't.
    
    Sure, drop every other byte. It'll be half as big. ;-)
    
    Cheers
  - Voice codecs are lossy compression (Score:3, Informative)
    
    by billstewart ( 78916 ) writes:
    
    Voice codecs are lossy, so they'll happily compress your encryption data to something smaller, treating it as if it were audio samples from a human vocal tract. Unfortunately, you won't get all the bits back when you uncompress it, so decrypting the data isn't going to reconstruct anything resembling the original voice stream :-)
  - - Re: (Score:2)
      
      by gnick ( 1211984 ) writes:
      
      Obviously you compress first then encrypt the compressed data not the other way around.
      That's what they're doing now. That's why checking packet size is yielding exploitable information. Are you trying to be funny or do you really not grasp the topic here?
  - - Let's try better than "information theory FTW" (Score:3, Informative)
      
      by mstahl ( 701501 ) writes:
      
      The entropy for a perfectly random coin toss will always be one bit. The formula, if I'm remembering right, is -sum(p_i * log(p_i)) where the p's are the probabilities of the various possible outcomes. In the case of a fair coin toss, these are both 0.5 and the outcome is 1, or 1 bit.
      
      If the stream you're compressing has patterns in it, it is purely by coincidence and overall, the average entropy of any number of these streams will turn out to be 1 if you sample enough of them. Furthermore, if you do have a
      - Re: (Score:2)
        
        by robo_mojo ( 997193 ) writes:
        
        Getting a smaller file out of that is more improbable than being attacked by a shark while being struck by lightning while you're holding a winning lottery ticket.
        That's how my brother died you insensitive clod.
        
        Re: (Score:2)
        
        by mstahl ( 701501 ) writes:
        
        Omg I didn't know... I am so sorry.... ;-P
      - Re: (Score:2)
        
        by ResidntGeek ( 772730 ) writes:
        
        Getting a smaller file out of that is more improbable than being attacked by a shark while being struck by lightning while you're holding a winning lottery ticket.
        I don't know about gzip, but for a perfect encryption algorithm the chance of getting a smaller file out of that should be 0.5.
        
        You might want to re-think that (Score:2)
        
        by mstahl ( 701501 ) writes:
        
        Seriously try what I said to try if you've got a linux/unix/mac system to try it on. You'll come up just slightly larger than the original file pretty much every single time.
        
        Considering the output of a coin toss to be a random variable, and the string of bits to be a randomly variant process of probability 0.5, the probability of any given pattern is 2^(-n) where n is the length of the pattern in bits. Square it to give the probability of that pattern repeating. In order to come up with a file that's small
- Re: (Score:2)
  
  by kbonin ( 58917 ) writes:
  
  The output of any decent encryption algorithm should be indistinguishable from random noise for any non-trivial size sample, which breaks compression. In practice, most compressions of encrypted data are slightly larger than the original, as they generally are comprised of a header stating "use this directly", then the original data.
  - Re: (Score:2)
    
    by GuldKalle ( 1065310 ) writes:
    
    But how about compressing the data before encryption?
    - Re: (Score:2)
      
      by GuldKalle ( 1065310 ) writes:
      
      Oops, I didn't see the GP, he was modded down. Nevermind what I said
- Re:Here's a thought (Score:5, Insightful)
  
  by corsec67 ( 627446 ) writes: on Friday June 13, 2008 @12:39PM (#23780385) Homepage Journal
  
  Except that might not help here.
  The issue is that VOIP is an application that needs low latency. You have to send the data you have within (.1 seconds? something small) a specific amount of time, and can't wait for the buffer to fill before sending it, compressed, encrypted or not. Thus you get packets that are different sizes.
  
  This isn't sending the whole conversation at once, this is a constant stream of data with specific requirements on latency.
  
  A solution would be to make each packet the same size by padding it with random data that the other side will discard. But that eliminates some of the benefit of compression.
  
  Maybe just use a fixed bit rate, as opposed to a VBR encoding?
  
  Parent Share
  twitter facebook
  - Re: (Score:2, Insightful)
    
    by rml1997 ( 929311 ) writes:
    
    GSM already performs some pretty nifty compression involving regenerating missing packets. By enhancing this, it should be possible to just send the encrypted message text and a voice profile and have the receiving phone talk in your voice. I'll get right on it... Actually, part of the problem with the encryption could be the GSM (or other codec) compression itself. It looks for similar packets and tells the receiver to use a previous packet instead of sending the new one. This would obviously be a much sh
- Re: (Score:2)
  
  by your_mother_sews_soc ( 528221 ) writes:
  
  Some of the codecs, at least those that were developed a few years back when internet telephony first became legitimized, compress first for a good reason. They model their encoding on the "physiological" aspects of speech. The audio is analyzed for things like the noise component and formants, and simplified considerably. It is this analysis, I'm guessing, that is the "compression."
- Re: (Score:3, Insightful)
  
  by Anonymous Coward writes:
  
  What idiot modded this up? Encrypted data is (pretty much by definition) uncompressable. Encryption works by hiding information and removing redundancy. Compression works by identifying and removing redundancy. The two concepts simply CANNOT BE APPLIED IN THAT ORDER. Go back to school -- both the OP, and whatever moron was moderating.
  
  "Just stutter when you talk!" "Just play music in the background!" "Just switch languages in mid-sentence!" God help us. You must be the idiots who designed this protocol in
  - Re:Here's a thought (Score:4, Funny)
    
    by oodaloop ( 1229816 ) writes: on Friday June 13, 2008 @01:06PM (#23781005)
    
    Voice data just CAN'T be securely encrypted.
    Really? I have a Top Secret phone on my desk, and I can assure you it's pretty secure. (And no, it's not a shoe.)
    
    Parent Share
    twitter facebook
    - Re: (Score:3, Interesting)
      
      by pclminion ( 145572 ) writes:
      
      I bet you that phone is not packet based, not compressed, and runs over a physically secure line. BIG fucking difference.
      - Re: (Score:2)
        
        by oodaloop ( 1229816 ) writes:
        
        Yup, you hit the nail on the head with that one, land lines all the way. Especially the ones on ship.
      - Re: (Score:2)
        
        by blueg3 ( 192743 ) writes:
        
        Even if all three were true, it's still voice data. If you recall, you claimed it's impossible to securely encrypt "voice data".
        
        The third, running over a physically secure line, is certainly not always the case. High-security phones encrypt data for transmission over radio waves for communication to points that landlines don't cover (ships, teams in the field).
  - Re:Here's a thought (Score:5, Interesting)
    
    by wolrahnaes ( 632574 ) writes: <seanNO@SPAMseanharlow.info> on Friday June 13, 2008 @01:06PM (#23781009) Homepage Journal
    
    Voice data just CAN'T be securely encrypted. That's because the spacetime information HAS to be there because we inherently interpret voice data according to these characteristics. Either you reveal this information in the stream, or you must increase the latency to the point that communication is impossible. If you want security, don't speak, WRITE, and use a cryptosystem that isn't a piece of shit.
    
    I disagree. The problem pointed at in this article can be easily solved on many SIP endpoints. I spend all day working on VoIP phones from vendors such as Linksys, Polycom, Aastra, Cisco, and if I really have to snom. Most of these have an option where it'll just send blank full bitrate audio rather than the usual "put silence here" instructions on G.711 calls. In fact that is the default behavior on some, since it makes the latency a bit more predictable to have a constant-rate data stream. If you want to use a VBR codec, of course this is a problem, but don't act like it's impossible or even hard to solve. If you are concerned enough to encrypt your conversations, use a CBR codec. 64 kbit/sec is not hard to free up.
    
    Parent Share
    twitter facebook
    - Re: (Score:2)
      
      by Quattro Vezina ( 714892 ) writes:
      
      I love that you put "and if I really have to" before you mentioned Snom :D
      
      Their phones are a pain in the arse to configure. You pretty much have to use their web interface. Not to mention the buttons require so much effort to push, you're pretty much guaranteed to typo anything you try to enter. Bleh.
      
      On the other hand, I love working with Cisco IP Phones and Avaya one-X 96xx phones. The Cisco IP Phone 7970 is particularly awesome.
      
      Polycom and Aastra phones look pretty sweet, but I've not had much of a chance
      - Re: (Score:2)
        
        by rantingkitten ( 938138 ) writes:
        
        haha, what? You like Cisco phones over the others? Ciscos are a pain and a half to deal with. Just getting SIP firmware loaded on them is an undertaking that never, ever goes according to Cisco's documentation, and only certain versions are upgradable to certain other versions, but not the ones that are in the documents, and so on.
        
        To configure the damn thing you either have to set up a tftp server with the config files, and then pray that it works (because it won't half the time), or enter SIP inform
        
        Re: (Score:2)
        
        by Quattro Vezina ( 714892 ) writes:
        
        To be fair, I've only ever dealt with Cisco phones in a Skinny environment. I've no experience with Cisco SIP (which I understand is a highly non-standard SIP implementation...).
        
        Never had any TFTP issues--usually CCM takes care of that, and we use a proxy on top of that (not Cisco Phone Proxy). I'm not a big fan of web interfaces, and I like Cisco's phone interface. That big screen is really helpful. Yeah, it'd probably suck to try and configure something small like a 7911 using the phone keypad, but luckil
        
        Re: (Score:2)
        
        by rantingkitten ( 938138 ) writes:
        
        Their SIP implementation is a horrible mishmash of nonstandard garbage. :) The other annoying thing, at least for someone like me who works in a SIP environment, is that Cisco's SIP stack is basically an afterthought. It'll function as a bare-bones SIP client but it won't do all the fancy crap people buy Skinny Cisco systems for, and then the users get all pissy, and then they whine at me, and then I die a little inside.
        
        I am not a huge fan of web interfaces (hello, commandline!) but they're leaps and bo
    - Re: (Score:2)
      
      by TooMuchToDo ( 882796 ) writes:
      
      I wish IP phones would, by default, stream the full audio instead of "put silence here". I understand you would be using more bandwidth, but the "comfort noise" to know the line is still up an running (when it's silent, you sometimes ask the person if they're still there) is a nice thing.
      - Re: (Score:2)
        
        by Quattro Vezina ( 714892 ) writes:
        
        Not only that, but media silence can be considered a media anomaly, and possible evidence of malicious use. If you're just sending signalling messages with no media, that can trip certain security features. You want media going through at all times, even if it's blank.
  - Re: (Score:2)
    
    by Quattro Vezina ( 714892 ) writes:
    
    Wow, I bet you're a lot of fun at parties.
- Re: (Score:2)
  
  by srmalloy ( 263556 ) writes:
  
  What I find amusing about this announcement is that the VOIP encryption methods are reported to be vulnerable to "bugging", while Homeland Security et al. is blatting on about how it is vital to national security that they be allowed to require all ISPs to install back doors in their VOIP setups to allow them to tap and monitor VOIP calls. I guess Homeland Security can't afford to pay for decent IT security people.
  - Re: (Score:2)
    
    by InlawBiker ( 1124825 ) writes:
    
    Yeah but there's a big difference between having to actually decrypt the packets, even with a crib and a known weakness in the encryption, and having the keys to the kingdom.
- - Re: (Score:2)
    
    by Vellmont ( 569020 ) writes:
    
    It sounds like they are using whatever ciphers in ECB mode.
    
    No. This type of attack relies on leaking information through a variable bitrate sound compression algorithm. The problem is there's more information in certain phonemes (individual speech parts) than in others, so they compress at different rates. That means you can look at the amount of information going across the wire as a function of time and guess at some of the phonemes. If there's enough of them, you can guess at some of the words. The
- Re: (Score:2)
  
  by Bob of Dole ( 453013 ) writes:
  
  "These guys have a 75% chance (based on statistical analysis of voip packet sizes) of talking about terrorism! Better bug them for real, so the next time we can listen in."

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Easy Solution: (Score:5, Insightful)

Re:Easy Solution: (Score:5, Insightful)

Re:Easy Solution: (Score:5, Funny)

Re: (Score:2)

Re: (Score:3, Funny)

Protocol isn't broken - it's badly mixed (Score:5, Insightful)

Re: (Score:3, Informative)

Re: (Score:2)

Not a joke. Another layer on the onion, friend. (Score:2)

Re: (Score:3, Funny)

Not really... (Score:4, Informative)

Re: (Score:2)

Do what my grandparents do (Score:5, Interesting)

Re:Do what my grandparents do (Score:5, Funny)

Re:Do what my grandparents do (Score:5, Funny)

Re:Do what my grandparents do (Score:4, Funny)

Re:Do what my grandparents do (Score:4, Interesting)

Re: (Score:2, Informative)

Re: (Score:2, Funny)

Re: (Score:2)

Re: (Score:3, Insightful)

Re: (Score:2)

Re: (Score:3, Funny)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Randomize the packets slightly (Score:2)

Re:Randomize the packets slightly (Score:4, Informative)

Re: (Score:3, Interesting)

Re: (Score:2)

Re: (Score:2)

Evasive, ummm, technology (Score:5, Funny)

Re:Evasive, ummm, technology (Score:5, Funny)

Re: (Score:2, Funny)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

It's easy to encrypt your conversations (Score:3, Interesting)

Re: (Score:3)

Re: (Score:3, Informative)

This is compressed, encrypted VOIP (Score:3, Informative)

Re: (Score:2)

Re: (Score:3, Funny)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Bad science (Score:4, Insightful)

Re: (Score:2)

It's not a flaw in the cryptosystem. (Score:2)

Counter measure. (Score:2)

Re: (Score:2)

Re: (Score:2)

Fuzzy Logic vs. Binary Information (Score:2)

It's only rythm based (Score:2)

ode-cay (Score:4, Funny)

Why change the packet size? (Score:3, Interesting)

Re: (Score:3, Informative)

Re: (Score:2)

Much like traffic analysis attacks on SSH (Score:2, Informative)

Re: (Score:3)

Re:Here's a thought (Score:5, Insightful)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Seriously it's not that simple (Score:2)

Re: (Score:2)

Re: (Score:3, Insightful)

Re: (Score:3, Funny)

Voice codecs are lossy compression (Score:3, Informative)

Re: (Score:2)

Let's try better than "information theory FTW" (Score:3, Informative)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

You might want to re-think that (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)