DNA Encryption 102
brn writes "There is this BBC article that talks about hiding information inside the DNA. Very interesting and well worth a read, apparently information hidden this way is virtually undetectable.
" It's the espionage agencies that first got interested, but the notion of "watermarking" is another that is brought up-and you thought PIII ids were bad.
Re:DNA Storage Capacity (Score:1)
Incidentally, it might also be possible to embed information through other means:
When creating a DNA fingerprint, enzymes are used to "slice" a strand of DNA whenever a certain sequence occurs. The result is a whole load of smaller DNA stands, each with different lengths-- the exact number of strands and their lengths is unique to each person/animal/whatever. A small electric current is then used to pull the miniature stands across a special material. The smaller strands travel more quickly across this material than the large ones, causing a unique fingerprint that looks something like:
|| ||| | | | || ||
If an organization (say, maybe, the NSA) were to sequence a strand that were to be cut at certain points by the right enzymes, they could (in theory) create a strand of DNA whose fingerprint represents a bar code that holds data.
Any other ideas for data hiding in DNA?
I like the BBC (Score:1)
I think European media in general is so much better because they are sponsored by the state. The European states have checks and balances to make sure the media is not a propaganda tool. Our American media does not have the same safeguards to prevent commercial interests from twisting the news.
Re:Prety scary stuff.. (Score:1)
In some sense, this brings new meaning to the old joke about BTP (bipedal transfer protocol)...
Tampering with human DNA, though, isn't as bad as it sounds. The demonstration in the article doesn't involve any tampering with DNA in an organism -- a letter that you write and send (yes, some people still do
As for humans: it's not like that one Far Side where a kid is sitting on an airliner, and his armrest contains a "Wings Stay On/Wings Fall Off" switch. The human body would interpret a little random junk in the occasional DNA strand as just that -- random junk. It won't kill anyone, make them grow a third arm, or even turn them into Steve Gutenberg (halleluia!) Keep in mind: (1) They're only affecting one out of every 30 million (something like that) DNA strands, and (2) they're not going to give you a pill that transforms every DNA strand in your body.
I'd imagine that, if a human were to be used as a carrier, they'd doctor up their DNA appropriately and then inject a small amount just under the skin (like a tuberculosis test). The solution would stay for a few days before dissipating throughout the body, so there's no effect on the "packet" (if you will), and it's easy for the intended reciever to extract and decode the message (whereas an enemy would have no idea where to begin, and couldn't do anything nasty to find out before the solution dissipates). So, even though it sounds a bit scary, it's not so bad. (How much worse is it than high-strength encryption?)
BTW - I'm not sure about capacity. Something like 90% of our raw DNA is thrown away and not used to make proteins anyway: if the data were placed in those portions, we could be talking about data capacity in the tens of thousands of base pairs. A base pair translates to two bits, (assuming you have a scheme to tell which backbone is the one you want), so if we use 10,000 base pairs, we can encode 20,000 bits (or 2.5 KB) of data.
Exactly what we need (Score:1)
hmm (Score:1)
DNA Storage Capacity (Score:1)
That requires a central repository (Score:1)
People don't crash or erase their own DNA. Not to mention the space that would be required to store that volume of information in a database.
Re:Nothing earth-shattering here (Score:1)
watermark your biological cpu (Score:1)
Who am I?
Why am here?
Where is the chocolate?
Yes....but... (Score:1)
Who am I?
Why am here?
Where is the chocolate?
Johnny Mnemonic?! (Score:1)
What's next?
Re:Scary (Score:1)
It could be placed in the interons. With sufficient care it would do no biological harm there.
Risks? Sure, plenty of them. But since when have intelligence agencies been concerned about risk, especially when it's someone else at risk.
More specifically, the risk is that the interon sequence may end up active if an unlucky data stream is coded there.
I'm not too worried about that since there's not much (if any) advantage to embedding the coded sequences into an organism's genetics vs. simply placing the DNA on something. For example, simply spraying the coded DNA on a courrier (possably an unwitting courrier) and collecting a sample of that courrier's dead skin later.
Example, sequence your message. Suspend the encoded DNA in a liquid carrier in an atomizer. Go to the airport and spray a suitable tourist. Tourist goes and has a nice vacation. Agents at the destination collect DNA from bedsheets at the hotel. Tourist is none the wiser, and not in any danger unless enemy agency finds out they're carrying a DNA code sequence.
Re:Scary (Score:1)
The genetic material you already carry is sufficient to uniquely identify you. There's no need to add a watermark.
Re:Playing with DNA... (Score:1)
http://strangehaze.freeservers.com/index.html
"The number of suckers born each minute doubles every 18 months."
-jafac's law
Re:Yes....but... (Score:1)
One of the best variations on this (IMHO) was The Andromeda Strain, where aliens had sent out millions of microscopic space "capsules", with a genetically engineered germ living on them, designed to live off the material of the "capsule", and radiation in space. It would be cheaper to mass produce billioins of these capsules with the germ, and send them out into space, than to attempt to broadcast radio waves into space to contact other civilizations. Since these capsules would normally burn-up on entry to an atmosphere, a non-spacefaring race would not be able to receive this "signal", but in the story, Earth sent up satellites to scoop micrometiorites out of orbital space for research. The craft that intercepted the capsule crashed in Piedmont, Arizona, and the germ mutated, and destroyed the entire population. The USAF then took the capsule to a secret underground lab where they picked it apart and discoverd the secret. (but not before some tense moments when the thing mutated again, and started eating all the rubber gaskets, and leaked into the lab, and the automatic nuclear self-destruct was triggered - just as the scientists figured out that this thing thrived on gamma radiation.
"The number of suckers born each minute doubles every 18 months."
-jafac's law
Re:Scary (Score:1)
"The number of suckers born each minute doubles every 18 months."
-jafac's law
SF (Score:1)
Or get this, for a way to pass a message undetected:
Incorporate a secret message into the DNA of a highly contagious (but non-fatal) flu virus, infect yourself. Spread the disease around by copughing in the subway, etc.. Disease spreads around the world, and is picked up by destination lab, sequenced, decoded and message is delivered. It's a tradeoff; slow delivery time vs. a untraceable delivery path (i.e. no radio or message handoff).
Yes, you can patent it (Score:1)
No, the technique itself can't be patented. However, bizarre though it sounds (at least, it does to me), individual DNA sequences can be (and routinely are) patented.
Quality news and government support (Score:1)
However, privitization of the BBC is always talked about. These days, people in the UK, like here in the US, have access to satellite/cable television. Since there exists a good market for quality programming, there will be a channel with that programming. Competition is hurting the BBC, especially with regards to news. The news.bbc.co.uk web site is outstanding, and it is a result of severe competition in the news industry...huge investments were made to make it possible.
The thing is, the British, in general, have a higher demand for good quality news than Americans do. I need not remind people that the magazine "The Economist" is outstanding, and is not state sponsored.
Quality local news happens in the US, and it is the result of markets. A report is available which shows the percentage of households watching news in metropolitan areas. Columbus, Ohio, my current home, is rather low on the list, despite the size of its metro area population, consequently, the news isn't all that great. On the other hand, the other "C" cities--Cleveland and Cincinnati, are both in the top 10, and the news programming is more diverse and higher quality. Good broadcast jouralists strive to be in the higher rated cities, and it is no coincidence that Cincinnati has been a point on the way for some to become national anchors (including Jerry Springer, which is another story.)
My point is, Americans seem not to demand higher quality news (and it is not like it isn't available...for instance, the Macneil News Hour on PBS is very good, but underwatched. NPR is excellent, and is on par with BBC radio.)
I am not sure about the checks and balances statement. The BBC is more curtailed in coverning British government than American news organizations...because of tradition and law.
Re:I like the BBC (Score:1)
> If you have cable TV, see if you can get BBC News 24.
Why bother getting up to switch the TV on? Use their their " BBC News On-Air [bbc.co.uk]" Internet service. (You can even get Radio 4's wonderful "Today" news show in Real Audio format [bbc.co.uk]!)
Regards, Ralph.
Re:Scary (Score:1)
An expert clarification would be appreciated.
Re:Yes....but... (Score:1)
I am not a biologist, but I imagine that the standard cancer-destruction mechanisms would kill that cell.
What appeals to me as a cryptogeek is the notion of finding or creating two separate genomes, which when recombined (or mapped and XOR'ed) revealed a message.
You could do quite a good short story on this, depending upon the means of recombination. Human beings probably would not make very good one-time-pads, but I'm sure that something could be thrashed out.
"like an infinite number of monkeys, there exist two creatures whose DNA, then XORed together (after using the appropriate decoding scheme to generate a bitstring)" will generate {the complete works of shakespeare, the meaning of life, the text of (insert your favourite holy book), god's last message to his creation}..."
lotta potential there.
- alec
Re:Yes....but... (Score:1)
Well, recombination can occur without any specific mechanism -- just at a low frequency. There's nothing stopping it. Horizontal transfer is emerging as one of the more important causes of molecular evolution, in fact
There is one advantage... (Score:1)
Re:No need for watermarking people (Score:1)
Re:Risky. Data just might code a deadly gene. (Score:1)
Re:Scary (Score:1)
Re:This is not encryption (Score:1)
The proposed scheme is really steganography, but if you encoded a number of messages into DNA, you would just need to attach MACs to the real messages and fake MACs to the fake messages, adding significant security.
I guess the marker that lets you pick out the signal from the mass of human DNA is rather like computing the MAC and finding that it matches the included MAC of a message.
Of course, you could just encrypt the signal messages instead - chaffing and winnowing is really intended for when you want to 'encrypt' without using any encryption, so to speak (only authentication is used to achieve confidentiality here.) However chaffing and winnowing helps by making the actual signal messages very hard to detect.
This is not encryption (Score:1)
Hey everyone, this is not encryption. Why? The message itself is never obfuscated or transformed in any way, instead it is merely hidden among a lot of other DNA, on a very small dot that wouldn't be noticeable under brief scrutiny. The only similarity between this and encryption is the existence of a secret (the base-pair sequence of the DNA strand "caps") between two parties.
There is a term for this type of message-hiding, but it's eluding me (someone around here knows it, I'm sure...). Other popular examples include:
Again, this is different from encryption because the important message is hidden in an otherwise benign and useless message. So the hard part of breaking the system is knowing whether a message is hiding another message. Once you know that, breaking the scheme is pretty trivial - there's no computational complexity behind the system.
Re:Risky. Data just might code a deadly gene. (Score:1)
probably look like the bar from the "Star Wars"
where "to be decrypted" people would hang out and
have some innocent, harmless fun, like killing each other. That would establish one to one
correspondence between killing the messenger and
cancelling the message.
Kelvin's 10 Year Old SETI Prediction (Score:1)
Not great fiction, but it is interesting to read it ten years later in light of this story and the SETI at Home news.
Re:Scary (Score:1)
(All _my_ slaves have the same genetic watermark.
--
Not efficient yet. (Score:1)
This is amazing that Nature published this paper.
The idea is on a surface. For years one letter code is used to define aminoacids and stop signals and people have fun putting their names in DNA for a long time.
You can do it too! (Score:1)
[clearlight.com]
Encode your own secret messages in DNA code. Now all you need is a synthesis machine to create your encoded ladder, and someone to give it to whose got some good biology knowhow and a gene sequencer machine. (I'm sure there are plenty of people in column A, but not column B)
Interesting but unlikely to be used much (Score:1)
a) modification/creation of desired DNA sequences
b) DNA analysis
would make this technique prohibitive. And, in terms of security, how many people would you trust in these labs anyway? (A one-man operation would probably have trouble handling many samples
And, as mentioned elsewhere, this technology is nowhere near the point where they can insert coded messages into your own genome! They could, however, put the information into self-replicating circles of DNA (plasmids) that can be propagated in microorganisms
(Before I began my developing career I was a molecular biologist, and stories like this always catch my eye
YS
Re:This is not encryption (Score:1)
Information Inside (Score:1)
The cool part about it is no one expects a message. If you send someone a CD, papers, e-mail, anything than it suspect. But if it's in DNA, inside someone, they pop in, leave a drop of blood or a skin scrape, voila message delivered.
Or how about using someone (something? . . . dogs, cats) as an information repository. They walk around with all this data encoded in them and who would know? *They* might not even know.
One thing that's pretty cool is you can layer information in DNA sequences by making a one base pair frameshift. There are viruses that use this technique to pack as much genetic information into there small head capsules as they can.
It looks like this:
ACG ACG ACG ACG
CGA CGA CGA CGA
GAC GAC GAC GAC
Of course, that's if your using 3 base pair codons, it might be easier to encode using straight base pairs. Which would allow for more storage? You might have to use natural codons though, in order for them to exist/be replicated in your snazzy new BioDrive (TM).
. . . Act now!! Store all your pr0n, ROMs and MP3s in the privacy of you own DNA, never leave home without them again!!
Re:DNA Storage Capacity (Score:1)
This is pretty far fetched, but how about storing information in the three-dimensional aspects of proteins. We don't have the means of calculating the shape of a protein from its amino acid sequence yet, but if we did
How about storing information in the number/position of cell receptors, then you could do some kind of immunoassay to search for certain limited messages.
Your idea is definitely more feasible. I had actually thought of writing a science fiction short story in which espionage was committed in much the same way as described in the article, but what held me up was where would you store the DNA? You have encoded your message, the people at the other end have their gels ready, you put the message-DNA in the carrier
Re:Yes....but... (Score:1)
Re:I like the BBC (Score:1)
Scary (Score:1)
Normaly I'm luckey if people find another way of data-storage. But this expectation realy scares me.
Re:Prety scary stuff.. (Score:1)
The human DNA is removed a person and mixed with the coded DNA just to present background nooise. The DNA isn't biologically active and in fact would get chewed up by polymerases within biological organisms.
Re:Yes....but... (Score:1)
That's not actually true. Only retroviruses actually insert stuff into the host's geonome. Something like influenza doesn't do this, it just takes over the cellular machinery.
Even with retroviruses it's a bit difficult to get the virus to inject itself into the geonome. For example, the lambda phage lyses the host E. coli cell most of the time and integrates itself rarely. By fiddling around with the lab conditions, you can encouragely integration but its still difficult. The preferred vectors are stuff like an F element(E. Coli) or artificial chromosomes(S. Cerevisiae) due to size considerations.
For humans, you would need to limit the size of the message due to the size limitations of the vector. Plus, with current technology you would have to wait a generation or two to get a person who had the genetic code in every cell.
Re:Scary (Score:1)
Interesting, maybe not as scary as some people think but a little disturbing...
Re:BBC licence fee (Score:1)
In Germany we are having the same system of a monthly tv fee, collected by the GEZ (Gebuehreneinzugszentrale). You are expected to pay that fee, and they sometimes send people to households who don't, trying to find out whether you own a tv by asking trick questions or something like that...
Tiny messages (Score:1)
Re:DNA Storage Capacity - practical aspects (Score:1)
But creating the message - how could that be done quickly? Doesn't it take some time to create a specific DNA sequence? I can't imagine we can do that quickly outside a lab.
If this holds, the best use I can imagine is in "fingerprinting" items. Imagine them used like "taggants" in explosives. First you create a a whole library of different sequences (a precomputed alphabet) and then assign them to specific items or specific lots of items you want to track. One hundred dollar bills, for instance. The detection lab kit would then become a standard part of the counterfeit detector's toolkit.
Anyway, my point really is that this new thing is really useful in the signature scenario than where you want to send an arbitrary message.
Nothing earth-shattering here (Score:2)
So, while this whole thing sounds cool, cryptographically this is nothing new and has no particular advantages (all the problems of dealing with secret symmetric keys, etc.). No wonder the NSA was not interested.
Kaa
No need for watermarking people (Score:2)
Consider this: to watermark a person you need to access his DNA and insert a watermark, then access it again to read it. Well, if during the first time you just read his DNA (which is unique) and the second time read it again and compare to a database -- voila! exactly the same result without all the unpleasantness of invasive DNA surgery. Why do you think reasonable people get worried when the police decides to take DNA samples from everybody arrested? (I don't know if the law/regulation to that effect passed in New York, but was definitely considered).
Kaa
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Recent SF story with this idea (Score:1)
It's a great time to be alive :) (Score:1)
Re:Exactly what we need (Score:1)
Re:Nothing earth-shattering here (Score:1)
bioinformatics and cryptography (Score:1)
DNA statistical models could be used in more clever ways than in the BBC article to encode messages that looked like other DNA. This is not difficult and could be done today.
Even better would be to exploit the biological machinery by creating a message in a synthetic gene that is expressed in the presence of some regulatory element, perhaps a synthetic small molecule.
Re:Nothing earth-shattering here (Score:1)
Re:Another article on same site, even better... (Score:1)
Re:Scary (Score:1)
So actually, there's quite a lot of unused space. The kicker is, there's not a limit on how big the genome can be. As long as the cellular machinery can handle it, there's no reason that you couldn't just insert sequences of however long you want into the DNA in the bits that get excised out or in stretches of junk DNA. For a change in DNA to affect the organism, you need a lot of other supporting DNA to ensure that the gene gets turned on- otherwise, it's just like a big string of garbage characters in a book- you just skip over it, it doesn't make reading the book any different.
Now, what's interesting is how compact viral DNA is. Viral DNA is so efficient (since so much information has to be packed into a tiny space) both "sides" of the DNA can be read to produce separate proteins. Normally, only one strand, the coding strand of DNA can be read- the non coding strand actually functions as kind of a check for errors. Viruses dont' care that much about accuracy, I guess... Here's what it looks like:
--Gene B
3'TGACTAGTACGTCATGAAGTCAGAGGGTC5'
5'ACTGATCATGCAGTACTTCAGTCTCCCAG3'
Gene A-->
Fascinating, huh?
correction of above post (Score:1)
--Gene B
3'TGACTAGTACGTCATGAAGTCAGAGGGTC5'
5'ACTGATCATGCAGTACTTCAGTCTCCCAG3'
Gene A-->
jesus I suck (Score:1)
I'm assuming my inadequate HTML knowledge is screwing up my arrows, so I can't draw the positions correctly.
That Gene B thing should be over to the right, with an arrow pointing to the left.
Re:DNA Storage Capacity (Score:3)
That works out to 3 billion (3x10^9) base pairs.
Each base pair is made up of one or two pairs of units; if we distinguish sense from antisense strands in the DNA, this means we can encode four values (or two bits) of information per base pair, working out to 6x10^9 bits, or a little more than 5.5 GB (note that since only one side of a strand is used in life, this means you can spell "human" in the genetic sense in less than 2.5 Gbits).
As somebody else pointed out, this scheme is not encryption, but a method of information smuggling. That is, instead of using it to send secure messages over an insecure channel, the technique is supposed to hide the fact that a message even exists.
Although virtually undetectable by casual inspection, it's hard to think of a practical application of this technology because of the effort and equipment needed to extract the message. Sending detailed plans for sabotage to your terrorist cell this way seems impractical, unless your terrorists are biologists with lots of time on their hands.
Maybe not encryption, but more than steganography (Score:1)
but without knowing the PCR primer "password",
it's unlikely that you'd be able to amplify
enough DNA to read its sequence.
So, no: The DNA is not exactly sitting there
"in the clear" and able to be read. Not only do you need to know where it is, you also need to know a little bit about what it is.
But, yeah, technically you're right: The message isn't encrypted. Encoded, certainly, but not encrypted. Maybe if you define "encrypt" as "hide" but I digress.
Re:bioinformatics and cryptography (Score:1)
How about a coding sequence for an autoendopeptidase that post-translationally cuts itself up into fragments whose noncovalent interactions reveal the plaintext?
Or maybe I should just stop sniffing glue.
Oh, calm down everybody (Score:2)
bears comment except for the hysteria it seems
to be inducing.
First of all, the technology described here --
synthesis of DNA oligomers and use of the polymerase chain reaction (PCR) to detect
them -- is old hat. Every molecular biology lab in the world has been using these techniques
routinely for over 15 years.
Second of all, coding DNA into something
biologically "meaningful" is orders of
magnitude more difficult than spelling out a message.
Finally, you all already can be uniquely identified by your DNA. Nobody needs to implant an ID in you, Big Brother can just take a drop of blood from you and store your DNA fingerprint in a database if he thinks you're worth looking after.
In closing, I would urge everyone not to look at biology (or even bioinformatics) through a hacker's eyes -- it doesn't work; the two fields don't map to each other well.
Oh, PS: If you want to see something genuinely cool being done with DNA by a hacker, check out Adleman's (the "A" in RSA) tackling the travelling salesman problem with DNA oligomers. (Sorry I don't have a cite handy.)
Re:This is not encryption (Score:1)
Stick this word in your favourite search engine
and see what comes up.
Someone just sent me a message-hiding-in-a-gif
program yesterday. Apparently it can
encode a whole website inside a GIF picture.
We will need this in Australia because of the new
Internet censorship law.
Cheers,
Alan Kennington
Mutations (Score:1)
excuse me while i slip this encrypted DNA in you.. (Score:1)
this DNA watermark story has all those hallmarks. to the point it relies on alphabet conversion to a piece of DNA spelling out " the secret message in the middle, plus short marker sequences at each end" then these synthetically engineered strands are just "slipped into a normal piece of human DNA".
gee i hope their data entry staff have high accuracy levels! no room for error...and what? do you meet these encryption chicks in bars, they lure you back to their place to privately encode you? or boys for us girls?
makes james bond and bond girls look postively small town. but probably far more likely.
Re:This is not encryption (Score:1)
As to whether this article describes encryption, it could be interpreted either way. I've looked up various definitions of encryption, and it looks like you could end up in a semantic mess of codes and symbols if you think about it too hard (after all, written language is just another code). It comes down to whether the intentions of the encoder count. If they do, it's not encryption, since encoding the message in DNA appears to be merely an expedient. If they don't, it is encryption, since this transformation of letters into DNA bases has the unintentional effect of making the message unreadable to a stranger (who somehow happens to bypass the steganography).
Re:Scary (Score:1)
So, no cancer here.
DNA...bleh (Score:1)
Re:Mutations (Score:1)
Another Application (Score:2)
Re:Yes....but... (Score:1)
viruses do nothing else - they paste their payload DNA into the infected cell.
You only have to put the modified/watermarked DNA into a harmless, but widespread virus (influenza, Bio-Weapon engineers have dozens of carriers at their hands) and there you go...
The only question is: why should any1 do such a thing ? your genetic code is already unique...
--
Jor
Re:Nothing earth-shattering here (Score:1)
As far as the watermarking goes, that is not new technology at all. I remember reading the same stuff many years ago when researchers first started cloning mice. The scary question to me is: how do we know people, plants, animals, etc., aren't already being watermarked?
Re:No need for watermarking people (Score:1)
Everybody considered a "hacker", "gay", "lesbian", "christian", or whatever could be marked with the appropriate permanent brand.
What if a variant of herpes (or similar genetic disease) was created which would alter the recipient's DNA to carry this watermark?
Re:DNA...bleh (Score:1)
Why bother when I can quietly slip as much data as I want into the noise of a compressed image file and hide it in the massive flow of pornography on the usenet alt.binaries. conferences?
Something else to worry about (Score:1)
Too much. I have to go lie down now.
Some clarification (Score:1)
1) DNA encryption is really no different from any other nonobvious code-based encryption. You must still transmit all the information to decode the message in a secure manner. The method referred to in the BBC article operates in the same manner as the DNA-o-gram generator (try it out), except that the code is generated randomly. If someone were to intercept the DNA-to-text code and the information necessary to isolate the coding DNA from the "noise" DNAs, they could crack the code.
2) This has nothing to do with human DNA. There is DNA in everything from the fungus between your toes to houseplants. The DNA used for encryption would probably be chemically synthesized.
3) A good sequencing run can read 1000 bases of DNA. Because three bases correspond to one english letter, you could get ~300 letters at a time.
4) You would be better off adding fake gene sequences to a gene database and giving someone the keys to decrypt it. That saves all the hassle of synthesizing and sequencing the DNA.
A description of Adelman's solution to the travelling salesman problem can be found at http://dna2z.com/dnacpu [dna2z.com]. (whuppy)
Prety scary stuff.. (Score:1)
And in the wrong hands it could become a very dangerous (+efficient) method of info transfering.
BTW - it doesnt say how much KB you can "stuff" into one person's DNA sequence...ne idea ne1?
Re:Exactly what we need (Score:1)
Come to think of it, perhaps Bill's figured this one out already...
Terrabytes of Information Can Be Stored In DNA (Score:1)
The only problem with this is that the government(s) will not allow this to happen, considering they like to keep knowledge imprisoned away from people - the goverment hates it when people get access to huge amounts of knowledge. That would explain why all our education systems suck - hardcore. Death to the United States, I say.