Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

 



Forgot your password?
typodupeerror
×
Encryption Security

Making The Case That Voynich Is A Hoax 382

DeadVulcan writes "The Voynich Manuscript, a mysterious book of uncertain age, is widely believed to be written either in an unknown language or a long-lost encryption scheme. Nature reports that computer scientist Gordon Rugg has demonstrated that it's possible to generate a text like the Voynich manuscript -- containing language-like regularities, despite being potentially meaningless -- using cryptographic techniques of the time. This lends some support to those who claim that the book is a hoax."
This discussion has been archived. No new comments can be posted.

Making The Case That Voynich Is A Hoax

Comments Filter:
  • My 2 cents (Score:3, Funny)

    by SYFer ( 617415 ) <syfer AT syfer DOT net> on Wednesday December 31, 2003 @02:24AM (#7843419) Homepage
    01001001011000110110100000100000011001000110010101 10111001101011011001010010000001110011011001010110 10010110111000100000011101100110010101110010011011 01011101010111010001101100011010010110001101101000 00100000011001010110100101101110011001010110111000 10000001010100011011110111000001100110001000000110 00010110111000100000010100110110001101101000011001 0101101001110111110110010100101110

    • Re:My 2 cents (Score:5, Informative)

      by Anonymous Coward on Wednesday December 31, 2003 @02:41AM (#7843482)
      Translation from binary:
      Ich denke sein vermutlich einen

      Translation from German from binary:
      I probably think its one
  • by Saeed al-Sahaf ( 665390 ) on Wednesday December 31, 2003 @02:28AM (#7843436) Homepage
    Somebody is laughing a lot.. Remember way back the Salamander Papers?
    • by Timex ( 11710 )
      I'd be surprised to find many that even KNOW that the Salamander Papers are related to the Church of Jesus Christ of Latter Day Saints...

      (I know this 'cause I was a member, once.)
  • Ershlap? (Score:4, Funny)

    by paul248 ( 536459 ) on Wednesday December 31, 2003 @02:30AM (#7843441) Homepage
    Abdook artelly oppetrom uplocty?! Astenboorsley... af arcoolodople!

    Bli, Fal.
  • by User 956 ( 568564 ) on Wednesday December 31, 2003 @02:31AM (#7843442) Homepage
    Gordon Rugg has demonstrated that it's possible to generate a text like the Voynich manuscript -- containing language-like regularities, despite being potentially meaningless

    That's funny. I thought Darl McBride had already proven that with all those open letters he's written.

    Mod me down, hippies!
  • Library of Babel (Score:5, Interesting)

    by Mrs. Grundy ( 680212 ) on Wednesday December 31, 2003 @02:31AM (#7843446) Homepage
    This reminds me of a passage from Jorge Luis Borges' Library of Babel. In fact a lot reminds me of that story these days.

    Five hundred years ago, the chief of an upper hexagon (2) came upon a book as confusing as the others, but which had nearly two pages of homogeneous lines. He showed his find to a wandering decoder who told him the lines were written in Portuguese; others said they were Yiddish. Within a century, the language was established: a Samoyedic Lithuanian dialect of Guarani, with classical Arabian inflections. The content was also deciphered: some notions of combinative analysis, illustrated with examples of variations with unlimited repetition.
  • Missing the fact.... (Score:4, Interesting)

    by Zibi ( 528341 ) <.moc.avonyrak. .ta. .nylla.> on Wednesday December 31, 2003 @02:34AM (#7843459) Homepage
    I think this report is missing the fact that if someone really wanted to make a hoax book, they could simply translate any other book (even the bible) into a made up language. If it's an obscure book the likliness that anyone would every figure it out is slim.

    • Wanna know something, I thought of the same thing. Out of boredom one day in like 2000 I wrote ghost in the shell [antioffline.com], and I was going to rewrite it to make it more informative. Only this time I set out to do something sort of like a caesar cipher based scheme only it would've been a forward/reverse scheme.

      Using a preselected number (ala rot13) I took a letter and manually (no pc) set the number 16 to my base. So the letter a was now p. The next shift would have been reversed 15 spaces so if b was the next let

    • by Anonymous Coward on Wednesday December 31, 2003 @03:08AM (#7843568)
      actually very few people could write on any known topic (such as a topic for which we have a contemporaneous book in a known language) in a consistent but made-up language without being easily decipherable. We couoldn't figure out ancient egyptian because we had no idea what topic they were even talking about.... ALL it took to figure out ancient egyptian was being told (in ancient Greek, which we knew) what topic a couple of sentences of egyptian were talking about...we had no idea, having almost NO idea what various examples of the writing could POSSIBLY have stood for.
    • by Professor D ( 680160 ) on Wednesday December 31, 2003 @03:36AM (#7843629)
      But, a volume of self consistent language (even a made up one) of over a hundred pages of text with accompanying pictures should fall to statistical and linguistic analysis.

      Champolion cracked the Rosetta stone with much much less.

      The 'true' examples of lost written languages/cyphers (do a google search) are mysteries because there exist few examples of brief length usually bereft of context (of grammar, history, linguistic evolution etc.).

      The sheer volume of the Voynich manuscript, plus its origin in relatively modern Europe is what makes it so interesting to amateur cryptographers.

      The Nature Paper is too brief to know how good Rugg's analysis is (and the Cryptologia site has been slashdotted), but if it holds up it is an interesting result, even if it is a conclusion that many "very smart cryptographers"(TM) have suspected for a long time

      • by Black Parrot ( 19622 ) on Wednesday December 31, 2003 @05:55AM (#7843942)


        > But, a volume of self consistent language (even a made up one) of over a hundred pages of text with accompanying pictures should fall to statistical and linguistic analysis.

        I doubt it. How many possible mappings are there between strings of characters and meanings? And even with plausible interpretations of the pictures (e.g., a herbarium), the number of things that might be said in that context is for all purposes unbounded:

        xyz =?= "this soothes the throbbing toe"

        xyz =?= "this is very poisonous"
        xyz =?= "this grows only in Ys"
        xyz =?= "I learned this from my grandmother" ...
        Surely it will never be deciphered if it is in an unknown language.

        > Champolion cracked the Rosetta stone with much much less.

        Actually, he had the benefit of a parallel text.

        In the absence of a parallel text, this will only be decyphered the way Linear B was: after a rigorous analysis of the patterns in the text, and a much tighter context (essentially lists of <picture,name,number> tuples), it was noticed that some very obvious translations ("man" and "woman", or such) fit the inflectional pattern of a language historically spoken in the region where the texts were found, and that simple mapping could be extended to other obvious <picture,name> pairs without introducing inconsistencies.

        I suppose it's possible that something similar could be done with the manuscript, but IMO only if there are some clearly labeled images that give tight enough a context to guess the specific word being used. And then some luck, because somebody has to recognize some language-specific patterns (such as the Greek masculine/feminine inflectional suffixes). And of course, more luck in what language it happens to be: Linear B might never have been deciphered if Greek didn't use gender-based patterns in its noun declensions.

        If it happens to be written in some unknown language, IMO it will never be deciphered.

        • No you underestimate the inherent limits of a structured language. The reasons you list are the reasons it might not be deciphered if it was a cryptographic language. If it is a natural language it would still fail.

          Imagine attacking common words and phrases. If you read an english text, you would quickly notice words like "the" "a" "and", and it was a letter stuff like "you" and "me" Once you have a large set of common words and phrases you look at how they are placed and structured, and start making quali
          • Basically out cryptographica today, is so advanced that it now only can break most common encryptions, but it can infact break the differences between most langauges if guided by human sense.

            I think that this is an interesting curiosity but possibly a sad one for our age. It's hard to find people with heavy skills in dead languages nowadays.

            On a more discouraging note, once you throw encryption into the picture and add it to an unknown(?) inflected language, you see that the problem will require the asse

    • by 1u3hr ( 530656 ) on Wednesday December 31, 2003 @03:54AM (#7843667)
      if someone really wanted to make a hoax book, they could simply translate any other book (even the bible) into a made up language.

      Making up a language, that isn't just a scrambled version of an existing one, is very, very hard. It takes someone like Tolkien (a professor of Old English who could translate Norse on the fly) to do that convincingly, and I doubt that anyone in the period could have done it in a way that would still defy detection.

      • Well remember that scholars of that period, 16th century, could most likely read and speak 3-4 languages fluently, including latin, greek, and 1-2 local languages. Really good linguists could translate much more.
        • scholars of that period, 16th century, could most likely read and speak 3-4 languages

          Yes; but the theoretical, abstract knowledge of language that you'd need to make up a novel, believable one didn't exist. Of course, a genius could appear and work it out for himself.

      • Yeh huell? Methinkses int et so easy ferfun fule...

        And then there are the folks who can wix up mords, mometimes in sid-sentence, on fle thy. (Yes, I'm just as fluent when speaking as well. With dome sifficulty, I man wix up three cords as well.)

        I don't know if it was a side effect of all the programming I've done, or just some latent dyslexia, but I'm able to mix up words and invent new ones with relative ease. Perhaps the challenge of having to remember umpteen different passwords, and change them ever
    • by Zibi ( 528341 )
      Wow I've never actually made a comment on slashdot and had so many replies. To be entirley honest I don't know much about the document in question. When I scanned through, it struck me that they are looking into complex ways of proving it to be a hoax when it could be something more simple. I do understand the complexities of creating a language, and I didn't really mean to make up a completely new language with new gramar etc., I was more refering to creating your own alphabet. Create your own symbols.
  • Beale Papers (Score:5, Interesting)

    by Dan East ( 318230 ) on Wednesday December 31, 2003 @02:39AM (#7843475) Journal
    Sounds a bit like the Beale Papers [unmuseum.org].

    Dan East
  • Ridiculous (Score:4, Interesting)

    by SargeZT ( 609463 ) * <pshanahan@mn.rr.com> on Wednesday December 31, 2003 @02:40AM (#7843479) Homepage
    I'm sorry, but calling the Voynich Manuscript a hoax is unfeasible. Sure, could it have in theory been a hoax? Yes, but there is no point to this. The "hoaxer" creates this in 3+ months, with very accurate drawings, and probably hangs on to it till he dies, so that it can be sold to a king 100 years later and eventually make it to america? Then again, maybe Nostradamus wrote it.
    • Re:Ridiculous (Score:5, Interesting)

      by Seth Morabito ( 2273 ) on Wednesday December 31, 2003 @03:00AM (#7843541)
      The point of a hoax, in my opinion, would most likely have been financial gain.

      There is no clear evidence pointing to an exact date that the manuscript was written, and the only firm circumstantial evidence we have to go on is Marcus Marci's letter to Anasthasius Kirchir, which mentions that the manuscript was sold to King Rudolph for 600 ducats. That is a heck of a lot of money. It seems perfectly reasonable to me that someone manufactured the manuscript to extract 600 ducats from the emperor.

      This assumes a lot. It assumes that the letter is genuine, and it assumes that the facts mentioned in the letter are true, and it assumes that Rudolph was the first buyer, so it is by no means a sure thing. But a lot of us who lean (gingerly) toward the hoax theory stand by Occam's Razor, which points to a hoax being at least a feasable, and probably even likely solution. Rugg's analysis is just more circumstantial evidence, not proof, but every little bit weights the scale more.
      • Re:Ridiculous (Score:5, Insightful)

        by shaitand ( 626655 ) on Wednesday December 31, 2003 @03:43AM (#7843646) Journal
        No actually "evidence" THIS broad lends no weight whatsoever. I saw this wholeheartedly as someone who has never even heard of the particular manuscript in question.

        Here is what I know, partly assuming what you've said is accurate. Nobody knows when the manuscript was produced, the only evidence that indicates it's existance at a particular point may be suspect (although this is the case with much of the dates we've fixed for events in history and even the basis for several things we believe happened to the degree we call and teach them as facts). Yet this discovery claims at the time the manuscript was produced it was possible to produce fake meaningless gibberish that appears to have meaning.

        Am I the only one who finds a problem with that in itself? How can you claim something was possible at the creation date when you don't know the creation date?

        Next, giving that magically the date looked into did happen to coincide with the creation date that nobody knows. How exactly does a process being theoretically possible at a date get considered as evidence that is what was done in a particular instance?

        Example, my house catches fire. Firefighters are unable to determine the source. The insurance company denies my claim on the grounds that the technology existed to rub two sticks together to generate heat and produce fire.

        I wouldn't even call that circumstantial evidence. That isn't EVIDENCE at all. Hell if there were two sticks in the lawn right under the tree, then it would become the most ridiculous circumstantial evidence that should obviously be tossed aside. But it would be the sticks that are the evidence there, not the fact that it's possible to create fire by rubbing two sticks together and the technology existed at the time. However there isn't even that much here.
        • by DeadVulcan ( 182139 ) <dead@vulcan.pobox@com> on Wednesday December 31, 2003 @08:26AM (#7844390)

          Example, my house catches fire. Firefighters are unable to determine the source. The insurance company denies my claim on the grounds that the technology existed to rub two sticks together to generate heat and produce fire.

          Of course, this is ridiculous. But there have been many who claimed that producing a hoax as convincing as the Voynich papers was virtually impossible. Rugg has shown that, at the earliest known date of "discovery," it was possible, and perhaps well worth doing for the price it fetched.

          So, your analogy is incomplete. The insurance company's argument would have some relevance if you had previously been claiming that it was technologically impossible for you to light the fire. They just produced a counter-argument.

          Coming back to the Voynich manuscript, it just means that the possibility of a hoax cannot be ruled out because of the effort required to produce it. Turns out it's not as hard as people thought.

      • The point of a hoax, in my opinion, would most likely have been financial gain.

        The manuscript was produced in a time when alchemy was the only science in town. Knowledge of herbal cures would've been a goldmine during that period, and studies to discover how to turn base metals into gold were the arms race of its day. Given that alchemist commonly encrypted their notes, this manuscript would've made a tempting purchase.

        One overlooked thought is the amount of effort that went into encryption and decrypti

  • by the end of britain ( 575444 ) on Wednesday December 31, 2003 @02:42AM (#7843489)
    The technique really is interesting. We have techniques that can identify patterns that are meaningful (all of cryptology, most of number theory, graph theory) but this application is neat because it is an effort to prove--rigorously--that a given set of data is just total noise.
    • by dido ( 9125 ) <dido@imperiuUUUm.ph minus threevowels> on Wednesday December 31, 2003 @05:50AM (#7843927)

      One definition of randomness, and one that seems quite reasonable is that a string is "random" if it cannot be compressed to smaller than it is, i.e. listing its characters itself is the most compact possible description. Formally, a string is random if there exists no algorithm generating the string whose description on some universal Turing machine is smaller than the string itself (this is the definition used in the field of Kolmogorov complexity [everything2.com]). A string of a billion digits making up Pi, for example, is not random by this definition, as one can easily write a short program [everything2.com], whose length would certainly be less than one billion characters, whose output is the digits of Pi. Think of it this way: the most general form of pattern matching device that we know of is a Turing machine, and if the best device you can construct to match that pattern is as complex or more complex than the pattern itself, then well, you have total randomness. Unfortunately, rigorously proving that a particular string is random by this very strong definition is extremely difficult, as you run into undecidability everywhere you turn.

      This is the sort of stuff that real theoretical computer science is made of. For a very good overview of the theory of Kolmogorov Complexity and algorithmic information theory, Gregory Chaitin's home page [auckland.ac.nz] is a good starting point

      To go back to the Voynich manuscript, if there is some sort of regularity that can be discerned from it, then perhaps a context-free or context-sensitive (or something in between) language may be found to characterize it. Once you have such a syntactic characterization, perhaps it might be possible to divine the semantics from context. The shape of the grammar that results may well prove whether the Manuscript is in fact a real language, a fabrication, an elaborate cipher, or just total gibberish.

      • That definition of randomness does make sense. Unfortunately it is undecidable, so you can never prove something is random according to the definition. You can prove something is not random, if you can find a program generating it. But if you cannot find such a program, you don't know if it is because it doesn't exist, or if you just didn't look on the right one.

        As for finding a language given the string, it isn't hard to find a regular language containing the string, the hard part is to find the right la
  • so obvious (Score:4, Funny)

    by segment ( 695309 ) <sil AT politrix DOT org> on Wednesday December 31, 2003 @02:42AM (#7843490) Homepage Journal
    Gordon Rugg has used the techniques of Elizabethan espionage to recreate the Voynich manuscript, which has stumped code-breakers and linguists for nearly a century

    Had Mr Rugg just used rot13 he would've cracked the code long ago. Want Crypto? [politrix.org]

  • by WIAKywbfatw ( 307557 ) on Wednesday December 31, 2003 @02:43AM (#7843492) Journal
    ...but it's another thing to prove it.

    Anyone can say anything is a hoax but it takes scientific evidence - actual empirical data - to prove such a claim.

    For example, people once believed that the Earth was flat (some people still do) but the circumnavigation of the globe by explorers such as Magellan, lunar exclipses, etc provide evidence to the contrary.

    Saying that just because something could be a hoax then it is a hoax is just plain stupid. Like Fermat's Last Theorem, it may be many years before Voynich is proved to be geniune or accurate, but the absence of proof of the former doesn't provide proof of the latter. Remember, even though TLF has been proved, we still don't have the "simple proof" that Fermat himself discovered.

    Saying that the manuscript is more likely to be a hoax than not just because computer scientists have theorised that it could have been faked in the 16th century is like a 25th century scholar saying that the Wright Brothers flight, the atomic bomb and the Apollo missions are more likely to be hoaxes than not just because they could have been faked with 20th century technology.
    • Remember, even though TLF has been proved, we still don't have the "simple proof" that Fermat himself discovered.

      That's because he almost certainly didn't discover one.

      Fermat was known for making some pretty bone-headed mistakes. Also, in his future writings he posed challenges to prove FLT for the case of n=3 or n=4, but never for general n>2. If he had found a truly elegant proof of the general case, and believed it was true, why not pose the general challenge?
    • I think what you're trying to get at is the inherent difficulty of "proving a negative". It's always much easier to prove an affirmative (which inherently contradicts the position to be disproved). Hence, sailing around the world to proves that it's round, and therefore not flat.

      However, the article offers speculation, not claims of proof/disproof.

      I don't see anything unreasonable in the claim that the manuscript might be a hoax; reasonable observers will note that this is not actual proof.

      In the meant
    • RTFArticle. It is pretty clear that if the text can be produced by the algorithmic chart as described, it is meaningless gibberish.

      You remind me of Stanislav Lem's classic book "Memoirs Found in a Bathtub." It's about a society that revolves around codebreaking. Lem makes huge plot points about short texts that are ambiguously decodable into dozens of other possible texts. They are never sure if the message really IS a code, or whether one of the decoded versions contains further codewords. But everyone is
      • Bible Code? (Score:4, Interesting)

        by gillbates ( 106458 ) on Wednesday December 31, 2003 @09:18AM (#7844676) Homepage Journal
        I do believe that there are "codes" in the Bible, but the reason is different than what the fanatics describe. My belief is that the Bible codes exist for only one reason: to ensure accuracy. Consider the following:

        The cat in the hat caught a rat and that was the end of that.

        Notice the rhyming. Now translated into spanish (courtesy babelfish):

        El gato en el sombrero cogio una rata y ese era el final de eso.

        Now translated back into english:

        The cat in the hat took a rat and that one was the end of that.

        Okay, so notice in the original that the rhyming words appeared in positions 1, 4, 7, 9, and 14 (zero based). In the retranslation, the rhyming words appear in positions 1, 4, 7, 9 and 15. This disparity alone is enough to determine that the retranslation is not accurate.

        Supposing that one writes in such a manner that there is a definitive pattern to their sentences and word choices, it is easy to determine the accuracy of a text after having gone through many translations. For a book such as the Bible, this was of paramount importance. I believe the original purpose of the "Bible codes" was to ensure that the meaning of scripture was not lost as it was passed from one generation to the next.

        Consider for example, the poem. If a poem is incorrectly copied, it no longer rhymes, or the meter is disrupted. This simple mechanism not only ensures easy memorization, but provides a security against unintended alteration. In much the same manner, the "Bible codes" have provided scholars a way of discerning the accuracy of a copy of scripture. In fact, some of scripture is indeed poetic, further reinforcing the confidence in the original scriptures.

        I find it somewhat interesting that lossless copying was available long before digital electronics were invented.
        • Re:Bible Code? (Score:4, Insightful)

          by Ami Ganguli ( 921 ) on Wednesday December 31, 2003 @09:52AM (#7844930) Homepage

          An interesting conjecture, but you'd have to provide some sort of evidence to back it up. The "famous" bible codes are clearly nonsense - you can tweak the algorithm to extract just about anything from any text (see here [anu.edu.au] for an example). Do you have some alternative code that stands up better to scrutiny?

          Also, at the time the books in the bible were written, accurate transcription wasn't considered nearly as important as it is today. The stories were part of an oral tradition anyway, and would have evolved in the telling before ever being committed to paper. Early scribes were aware of this and would not have thought twice about "correcting" parts of the story that didn't, to them, seem to be right.

          • Re:Bible Code? (Score:3, Interesting)

            by Samrobb ( 12731 )

            Also, at the time the books in the bible were written, accurate transcription wasn't considered nearly as important as it is today.

            Sorry - you're wrong, particularly in terms of the writings that make up the Old Testament. The requirements for copying these texts were pretty stringent [bereanpublishers.co.nz]. Requirements 4, 6, and 7 are particularly interesting:

            The Talmud lists the following rules for copying the Old Testament:

            1. The parchment had to be made from the skin of a clean animal, prepared by a Jew only, and was t
    • The claim being made is not that this proves it to be a hoax, but that it disproves the only real evidence for it to be authentic, that it is too complex to be a forgery. In other words, it simply opens the debate a bit more.
    • Actually, the burden of proof would be on those who claim there is some meaning in it. Reading the article, and references to the manuscript, the "It's a hoax" proposition now has a plausible explanation as to how a hoax could be perpetrated. While not conclusive evidence.

      Anyone can say anything is a hoax but it takes scientific evidence - actual empirical data - to prove such a claim.

      Anyone can claim anything, but the more outrageous the claim the more evidence they need to support it. Someone could
    • Anyone can say anything is a hoax but it takes scientific evidence - actual empirical data - to prove such a claim.

      No. It is the proponents of the idea that the book is genuine's job to prove that it is indeed that. One doesn't need to prove that something is a hoax if it is, Occam's Razor does that job. What explanation is contains the fewest ubstantiated assumptions: That something was written a language nobody knows, containing valuable information nobody has any idea about, or that it was produced using a simple encryption technique to fool somebody to pay loads of shiny ducats?

      For example, people once believed that the Earth was flat (some people still do) but the circumnavigation of the globe by explorers such as Magellan, lunar exclipses, etc provide evidence to the contrary.

      I find it amazing that some people still hold this myth as true! What kind of history education have you had!?!

      Look, no scientist have never claimed the earth was flat. For one thing, in every other culture than the western, it has never been claimed otherwise ("they even knew the earth was spherical"), but some has got the weird notion that Columbus had to argue that the earth wasn't flat.

      He didn't. The moron had the wrong numbers, and would have gotten killed if America didn't happen to be there.

      Allready the pupils of Thales claimed their master knew the earth was round. Erastostenes, measured the circumference of the earth with an error of 3%! The true circumference of the earth was known to the greeks in antiquity! Plato and his pupil Aristotle himself knew many arguments for the spherical shape of the earth, and why is this important? Because though some Christian scholars around 300 AD didn't like the idea of a spherical earth, St. Augustin adopted much of Plato's philosophy and made it an important part of christianity in the same century, and they adopted the ideas of a spherical earth as well. Through Augustin, every leading authority accepted the idea of a spherical earth.

      Eventually, Erastostenes numbers was also accepted , but Columbus didn't like them, because it meant that going the other way to India was infeasible. So, he used some other numbers, and he used Marco Polo's exaggerated estimates of the distance he had travelled, and so he made it quite feasible. But it wasn't, he was wrong.

      Columbus thought the distance to Asia was 4000 km, his contemporary scientists 16000 km, the real distance is 23000 km, while Columbus eventually travelled 6500 km.

      So, why is this important? Because people who hold this belief often have many other misunderstandings about science. Indeed, you can't prove that the book is a hoax, but for that reason, the burden of the proof rests with the proponents of the idea that it is genuine. Who, of course, might cling to the idea that it is, long after the world has moved on to greener pastures. That's how it usually works anyway.

      • The post it answer to marked as "informative" or "insightful" isn't informative or insightful and completly misudnerstood science or scientific theory (aka : where lie the burden of proof). Whereas the parent hit it the nail over the head and to boot it up add also information on popular belief which turn out wrong (earth's form being well known).
      • by notfancy ( 113542 ) <matias@k-bel l . com> on Wednesday December 31, 2003 @11:14AM (#7845622) Homepage

        Anyone can say anything is a hoax but it takes scientific evidence - actual empirical data - to prove such a claim.

        No. It is the proponents of the idea that the book is genuine's job to prove that it is indeed that. One doesn't need to prove that something is a hoax if it is, Occam's Razor does that job. What explanation is contains the fewest ubstantiated assumptions: That something was written a language nobody knows, containing valuable information nobody has any idea about, or that it was produced using a simple encryption technique to fool somebody to pay loads of shiny ducats?

        No, to you. Occam's Razor is a heuristic for selecting hypotheses to test. It doesn't relieve you of the burden of proof just because your burden is heavier. You definitely do need to prove that "X is false", if that is the hypothesis you selected based on whatever heuristics you choose.

        Voynich is patently written in an unknown code (i.e., language): that's not an assumption, it's a given for both hypotheses. The first hypothesis (you used the non-synonim "unsubstantiated assumption") is that Voynich has high information content in the algorithmic sense. The second hypothesis is that Voynich has low information content, again in the algorithmic sense. Considerations of value, motive, etcetera are irrelevant to this analysis although they might be of heuristic value for selecting hypotheses, but not for application of Occam's Razor (which is another heuristic).

        To sum it up, you still have the burden of proof, and you can't use heuristics for selecting heuristics.

  • Lets see - it turns out that the Voynich manuscript is likely a bunch of drivel that pictures of naked women. Looks like we haven't come that far since it was written, as this Filipino [adage.com] edition of FHM would suggest!
  • Google found me this (Score:5, Informative)

    by ElDuque ( 267493 ) <adw5@lehigh . e du> on Wednesday December 31, 2003 @02:46AM (#7843502)

    In case you're wondering what it looks like

    http://www.voynich.nu/ [voynich.nu]
  • by puzzled ( 12525 ) on Wednesday December 31, 2003 @02:53AM (#7843520) Journal


    There is a portion of Cryptonomicom by Neal Stephenson where a real book of coded intercepts is replaced by random number strings encrypted with a fairly simple scheme.

    Does anyone know if this book is a seed for Stephenson's story? He draws an awful lot of information from the history of computing for his stories.
    • Well, a more likely inspiration for the "Cryptonomicon" manuscript mentioned in Cryptonomicon and Quicksilver is the Steganographia [esotericarchives.com] of Trithemius. In the late 1990's the book was briefly in the news because a well known cryptographer, Jim Reeds, found and deciphered a hidden message from it.
      • The "Cryptonomicom" has an obvious liguisitic similarity to the "Necronomicon" of H.P. Lovecraft. Colin Wilson later wrote sci-fi / horror stories that included Lovecraft and which stated that the Voynich Manuscript was actually one copy of the Necronomicon.

        I have no idea if Stephanson knew this, but given the similarity of names, I would suspect so.

        More details can be found here [necfiles.org].
  • by carambola5 ( 456983 ) on Wednesday December 31, 2003 @02:54AM (#7843525) Homepage
    Does anyone else get the feeling that these people are just saying "It's too hard. We give up" ?
    • Does anyone else get the feeling that these people are just saying "It's too hard. We give up" ?

      Yes, I do, and this isn't the first time I've felt this way. I've thought the same thing when I hear the term "junk DNA."

      This "book" was authored in antiquity, through great effort and expense. It represents lost knowledge. If it is ever understood I doubt it will actually matter; likely it's a book on botany as it was understood in it's time, replete with mysticism, fables and bad ideas, while managing to
  • A Hoax? To What End? (Score:5, Informative)

    by WombatControl ( 74685 ) on Wednesday December 31, 2003 @03:03AM (#7843555)

    I've studied the Voynich manuscript before, and the possibility of a hoax seems just as unlikely as many of the theories that have been floating about. Yes, the language of the Voynich manuscript could be an elaborate hoax, but Rugg's analysis only proves what is already widely known.

    The problem of creating such an elaborate hoax is that even Rugg's theory doesn't explain all the features of the Voynich manuscript. Furthermore, it seems unlikely that a sixteenth-century forger would go to the trouble of creating something that would have all the qualities of a real language and would include techniques that would deliberately resemble an actual document when viewed with analytical techniques that wouldn't be developed later. Occam's Razor makes it seem more likely that there some kind of language operating in the manuscript than a random system of patterns. Then again, there's no real way of knowing.

    There are some images of the text of the Voynich Manuscript available here [voynichinfo.com]. Analysis of the text and the illustrations support the theory that the manuscript has defined sections on astrology, herbal medicine, and other subjects. There have been some serious and some rediculous theories about the manuscript from the intriguing notion that the Voynich text is mathematically similar to East Asian languages [unicamp.br] like Chinese or Vietnamese, or that the Voynich manuscript is written in an ancient form of Ukrainian. (I've read the supposed translation of it from the Ukrainian, and it hardly makes sense given that the manuscript's illustations don't match the text of the supposed translation.)

    In the meantime, this site offers more information on modern translation efforts [bham.ac.uk] including a font for the Voynich script. (Which would make a lovely way of annoying co-workers by switching their default system font to Voynich text...)

    • by aepervius ( 535155 ) on Wednesday December 31, 2003 @07:34AM (#7844163)
      If you and other purport it having a meaning then you should come forward with it. Unless you have compelling evidence, thern what you are presenting is no more no less than wishfull thinking or belief.

      Indeed right now it isn't prooved at all that this manuscript has any meaning (encrypted or not) and a researcher prooved that you can reproduce most of the feature of the manuscript by using an encryption technic born a few year earlier. Furthermore the person selling it to the first known possessor was a forger. Yes not all feature are repdroduced. But this is a step forward.

      The burden of proof is with you and "Then again, there's no real way of knowing." isn't an answer. At least none a scientific and a person interresed into knowing moer hold for enough. And, yes "Voynich manuscript. Furthermore, it seems unlikely that a sixteenth-century forger would go to the trouble of creating something that would have all the qualities of a real language and would include techniques that would deliberately resemble" Well I have news for you. 3.5 Kilogram gold (a prince wealth for the time) make it more likely than you wish to hold it.

      You might have included a lot of link making people see your post as informative, but frankly it isn't especially your dubious use of Occam's Razor (The explanation needing the LESS number of new entity is the most probable). Sorry but to purport that the manuscript hold meaning is having one unknown new entity (from where that language come ?) more than purporting that using the clever trick aforementionned (available at that time) which hold no unknown new entity.

      My final point is, Occam's razor only say you what is the most likely explanation. NOT WHAT IS THE CORRECT ONE.
  • Author's Page (Score:5, Informative)

    by mlc ( 16290 ) on Wednesday December 31, 2003 @03:19AM (#7843593) Homepage
    Prof. Rugg has a website [keele.ac.uk] about his methods and results, which may be of interest.
  • by aztektum ( 170569 ) on Wednesday December 31, 2003 @03:27AM (#7843610)
    To prove that the manuscript is a hoax, one would need to produce entire sections using this technique, says Pelling. Tweaking the grilles and tables should make this possible, reckons Rugg.

    It's called a Xerox machine man.
  • repeats (Score:5, Insightful)

    by 1u3hr ( 530656 ) on Wednesday December 31, 2003 @03:48AM (#7843657)
    The Nature story says:
    The text contains some features that are not seen in any language. The most common words are often repeated two or three times, for example - the equivalent of English using 'and and and' - giving weight to the hoax theory.
    Indonesian pluralises words by duplicating them (anak = child, anak anak = children). And many languages, including English ("he was really, really stupid") intensify by repetition, so this point is not at all conclusive.
    • Just some nitpicking, your analogy doesn't quite match. It's the _most_common_ words that are repeated. Neither "anak" nor "really" fit that.

      • Re:repeats (Score:4, Interesting)

        by PurpleFloyd ( 149812 ) <zeno20@ a t t b i.com> on Wednesday December 31, 2003 @04:30AM (#7843748) Homepage
        Of course, that particular point isn't much, cryptographically. Ever since frequency analysis came into use, historical cryptographers used "nulls" in their codes - random meaningless characters which would hopefully cause trouble to frequency analysts. It may be that the manuscript's code contains keywords that the decoder should ignore (all repitions of a word, for instance), or instruct the decoder to perform a certain action (say, 3 repititions means to skip the next three words).

        On the other hand, this certainly could be a hoax. After all, the author was familiar with cryptographic methods and was paid an enormous amount of money for the manuscript. The real truth could certainly be either hoax or reality - there simply aren't enough facts available to decide right now, despite the huge amount of work put into the manuscript by many talented amateur cryptographers.

      • Re:repeats (Score:3, Insightful)

        by 1u3hr ( 530656 )
        Neither "anak" nor "really" fit that.

        Those were random examples. In Indonesian, EVERY noun is doubled to pluralise. So this is very common feature indeed. In English, no, we don't duplicate so much.

        As far as the main article goes, though, I'd vote for it being a hoax.

        • Yes, but I have to ask, since my knowledge of indonesian is non-existant, is the most used word in indonesian also a noun? I'd expect the most used word to be something like "and", or maybe a verb like "is", or maybe "the" or whatever has a similar grammatical role to these in indonesian.

          • Yes, but I have to ask, since my knowledge of indonesian is non-existant, is the most used word in indonesian also a noun?

            Could be if you happened to use "men" in every sentence, as it's certainly possible to do in English.

            Anyway; I wasn't arguing that this language was Indonesian, and we weren't given any figures in the brief article. I simply wanted to point out that frequent duplication is part of at least some natural languages, and so is not much of an indicator of whether it's real.

          • Latin doesn't have article (a, the), the verb "to be" (essere) is often just left out, likewise "and".

            Of course Latin was written and spoken for centruries, and in many different territories so there is some variation to this; nevertheless in most Classical texts you'll get to read when you start to study Latin, "and", "is", "a" and "the" will hardly make an appearance.
    • Gue sepak biji lu!
    • The most common words are often repeated two or three times, for example - the equivalent of English using 'and and and' - giving weight to the hoax theory.

      Repetition, especially thrice, is actually very common at least in Ancient Egyptian language, and many modern occult texts are based on egyptian mysticism. It is puzzling that the writers don't know that.

      IANAEIAE, but three repetitions typically refer to plural form (such as ntrw, gods, usually depicted as three "axes" when ntr, god, is usually depict
  • 'Well, Terry, as I always say, "we skipped the light fandango, turning cartwheels across the floor, I was getting kind of seasick, but the crowd called out for more!"'

    'Why Jimmy, that's very profound! What does it mean?'

    'Well, Terry, I'm fucked if I know!'

    (From Alan Parker's The Commitments)
  • by imag0 ( 605684 )
    It *is* encrypted quite well. However, after viewing the pages, printing them out and alligning them just so I have discovered the message hidden over the ages:

    YHBT YHL HAND
  • Interesting problem. (Score:5, Interesting)

    by Black Parrot ( 19622 ) on Wednesday December 31, 2003 @05:22AM (#7843863)


    Those who read the article can take note of an interesting challenge: though Rugg has shown that it is possible to generate a high quality hoax using a Cardan grille, proving it to be a hoax may require producing a character grid that will actually generate large portions of the text. My question is, could that be done with a genetic algorithm, and are any Slashdotters up to the task?

    Also, a few comments about formal analysis. Notice that if you took some arbitrary text, typeset it in a fixed-width font to force the characters into columns, and then skimmed it with a grille in order to generate a new text, you would automatically preserve such basic statistics as character frequency, including spaces and also punctuation if you used them in your grid. (Depending on how you applied the grille, you could actually be generating a simple permutation of the original text.) However, you would disrupt all the within-word correlations.

    For example, in compound words derived from Latin there is a familiar pattern where ad C* ==> aCC* (where C is some arbitrary consonant), but that pattern would be completely obscured if the characters were read off a diagonal grille as shown in the photograph. You would still get the increased frequency for C, but not the common aCC pattern.

    More subtly, there are some well known universals of syllable structure in natural languages, but those would be scrambled just as the aCC would be. You would have the right proportions of consonants and vowels, but not a realistic distribution within words.

    Likewise, prefixes and suffixes would be scrambled. If it is a hoax generated by a Cardan grille, it should not have prefix/suffix patterns that occur commonly in many languages. (Ditto for suffixal inflections.) In fact, the letters appearing at the beginnings and ends of words should be a random sampling from the frequency distribution of letters in the whole text; this may be the easiest metric to check.

    Also, by using spaces as characters in your grid you'd get the right proportion of spaces, and therefore the right average word length, but you would obscure any patterns in word length. Someone has already linked to studies of the word lengths in the manuscripts, but those assumed that the distribution of Latin word lengths word lengths would be preserved. However, only the average would be preserved. I suspect the distribution would be converted to a gaussian. Anyone got time for the experiment? (Notice that you may generate extra spaces with the grille, depending on how you use it. For example, what do you do when your grille starts running off the bottom of the page in your source text? Or, if your grille has 10 windows, do you transcribe to the first space and then move the grille, or do you transcribe everything in the grille and insert a "virtual" space for position 11? It looks to me like you might be able to generate the document's actual "word" lengths from Latin, given only some very basic assumptions.)

    • by Black Parrot ( 19622 ) on Wednesday December 31, 2003 @05:31AM (#7843881)


      > In fact, the letters appearing at the beginnings and ends of words should be a random sampling from the frequency distribution of letters in the whole text; this may be the easiest metric to check.

      Actually, the distribution of initial letters might be preserved, or at least mostly preserved. If the source text is written so that lines always begin with a new word, and the grille is always aligned with the start of a line, then what you read out of the grille will preserve the frequencies of word-initial letters. But if you read more than one "word" out of the grille before moving it, you will get a mixture of the true word-initial distribution plus the distribution of all the letters in the document. And if you don't always align the grille to the start of a line, all bets are off.

      Off hand, I don't see any way that the distribution of word-final letters would be preserved. The first thing I would do to detect a hoax is compare that distribution to the distribution of all the letters in the document. If they are the same, then I would suspect the use of a grille or some other randomizer.

  • Repeats? (Score:3, Interesting)

    by plumby ( 179557 ) on Wednesday December 31, 2003 @05:32AM (#7843882)
    contains some features that are not seen in any language. The most common words are often repeated two or three times, for example - the equivalent of English using 'and and and'

    What about Chines? From the little that I've learned, they often repeat a word for emphasis - e.g., Xie Xie meaning thank you.

  • I doubt I'm alone in that this article reminded me of the Codex Seraphinanus, [io.com] an untranslated, if I recall correctly, book of sketches of imaginary flora and fauna. It's a modern work by a european artist, mostly in colored pencil.

    There are only around 600 copies of the book, but I got a chance to see it recently via the marvelous university-interlibrary loan system. Worth a look!
  • As long as we're on the subject, could someone please do the world a favor and discredit the Protocols of Zion once and for all? It seems there are a few people in the world who still take this document seriously.
  • by buckeyeguy ( 525140 ) on Wednesday December 31, 2003 @08:22AM (#7844363) Homepage Journal
    Therefore it must be important! Eh, no. (See the Urantia Book [urantia.org] for one example of why some old nonsense is better left aside.)

    Years ago I had a coworker who would blather on about the Urantia book and its 'answers'... but then he was an old stoner too.

  • by Mikey-San ( 582838 ) on Wednesday December 31, 2003 @08:26AM (#7844387) Homepage Journal
    Here's a great little bit of information regarding Voynich:

    http://rec-puzzles.org/new/sol.pl/cryptology/Voy ni ch

    Mmm, strangeness.
  • ... a book from an alternate universe that somehow made it here.
  • the seed is COMSTOCK.
  • Maybe it's the long lost Namshub of Enki...
  • by wintermute1974 ( 596184 ) <wintermute@berne-ai.org> on Wednesday December 31, 2003 @12:35PM (#7846341) Homepage
    Its text contains features found in no known language: for instance, its commonest words may be repeated two or three times in succession.
    Source: http://www.keele.ac.uk/depts/cs/staff/g.rugg/voyni ch/index.html

    It is very, very, very unlikely that common words would be repeated again and again and again unless someone really, really, really wanted to.

  • by Elonka ( 710689 ) on Wednesday December 31, 2003 @01:31PM (#7846901) Homepage Journal
    On my own list of Famous Unsolved Codes [elonka.com], the Voynich Manuscript is right up there at #2, just under the Beale Ciphers (which also have some pretty compelling arguments that they're a hoax).

    Some other good links for Voynich information:

    • An excellent viewer [yak.net] which lets you quickly see thumbnails of all of the pages at once.
    • A good overview [crystalinks.com] page
    • The Voynich Mailing List [voynich.net] - a site maintained by Jim Gillogly (famous for cracking the first few parts of Kryptos [elonka.com]).

    Elonka :)

  • by GMFTatsujin ( 239569 ) on Wednesday December 31, 2003 @07:05PM (#7849771) Homepage
    The Solution of the Voynich Manuscript by Leo Levitov was published by the Aegean Press in 1987. Links to Amazon.com are left as an exercise to the Slashdot readership.

    Levitov provides methodology for extracting the linguistic model that the book encodes. Many examples and translations are provided, and there is plenty of work for the reader to do if he wants to prove the system to himself.

    Levitov proposes that his solution reveals a manual of heretical text regarding the ease and assistance of the mortally ill into death -- euthenasia, basically. To my knowledge, his work has not been discredited, only ignored.

    For the definitive hoax-type artificial reality book, check out the amazing Codex Seraphinianus [io.com].

Technology is dominated by those who manage what they do not understand.

Working...