Forgot your password?
typodupeerror
IT

The People Rescuing Forgotten Knowledge Trapped On Old Floppy Disks (bbc.com) 57

smooth wombat writes: At one point in technology history, floppy disks reigned supreme. Files, pictures, games, everything was put on a floppy disk. But technology doesn't stand still and as time went on disks were replaced by CDs, DVDs, thumb drives, and now cloud storage. Despite these changes, floppy disks are still found in long forgotten corners of businesses or stuffed in boxs in the attic. What is on these disks is anyone's guess, but Cambridge University Library is racing against time to preserve the data. However, lack of hardware and software to read the disks, if they're readable at all, poses unique challenges.

Some of the world's most treasured documents can be found deep in the archives of Cambridge University Library. There are letters from Sir Isaac Newton, notebooks belonging to Charles Darwin, rare Islamic texts and the Nash Papyrus -- fragments of a sheet from 200BC containing the Ten Commandments written in Hebrew.

These rare, and often unique, manuscripts are safely stored in climate-controlled environments while staff tenderly care for them to prevent the delicate pages from crumbling and ink from flaking away.

But when the library received 113 boxes of papers and mementoes from the office of physicist Stephen Hawking, it found itself with an unusual challenge. Tucked alongside the letters, photographs and thousands of pages relating to Hawking's work on theoretical physics, were items now not commonly seen in modern offices -- floppy disks.

They were the result of Hawking's early adoption of the personal computer, which he was able to use despite having a form of motor neurone disease known as amyotrophic lateral sclerosis, thanks to modifications and software. Locked inside these disks could be all kinds of forgotten information or previously unknown insights into the scientists' life. The archivists' minds boggled.

These disks are now part of a project at Cambridge University Library to rescue hidden knowledge trapped on floppy disks. The Future Nostalgia project reflects a larger trend in the information flooding into archives and libraries around the world.

This discussion has been archived. No new comments can be posted.

The People Rescuing Forgotten Knowledge Trapped On Old Floppy Disks

Comments Filter:
  • I imagine the are going to just have them imaged off as .vfd files for the "standard" 3.5" & 5.25" floppies and call it a day (for the data on the disks part, I did that with my own years..huh...decades ago? Crap, time flys). Maybe some high rez pictures of any that are labeled by people (especially if by Mr. Hawking). I have no idea on those Amstrad Floppy disk variants (compatibility wise)....oh, just read the linked article "3-inch Treasures: Transferring Material from Amstrad Floppy Disks", yup, loo

  • I do this kind of stuff all the time. Get a Greaseweazle and spend a few days imaging the disks. It's not difficult

    • Seriously. The author makes it sound like they're translating hieroglyphics without the rosetta stone. There's only a handful of drives that didn't use standard 5.25 or 8" floppies. Some Data General drives being a notable example.

      • by JBMcB ( 73720 )
        There's those weird 3" floppies that dedicated word processors used, as well as other gadgets like early digital cameras, in the late 1980's through to the mid 1990s. It's probably the most widespread oddball floppy format.
        • by Megane ( 129182 )
          As I understand it, those 3" drives use the same connector as mainstream drives. As long as you have an actual drive, they can still be archived with a Greasweasel.
      • Yes, there was a time that the 5.25 floppy, DSDD, was king, storing a whole 360K. However, not everybody used that storage format. I had a TI 99/4A that used an external 5.25 drive and formatted its floppies to 390K. Good luck using some sort of PC to read those if you don't know about the oddball formatting!
      • by XXongo ( 3986865 ) on Friday October 10, 2025 @07:57PM (#65717652) Homepage

        Seriously. The author makes it sound like they're translating hieroglyphics without the rosetta stone. There's only a handful of drives that didn't use standard 5.25 or 8" floppies. Some Data General drives being a notable example.

        The bits on floppies are magnetic; the magnetism decays, and the information becomes unreadable.. How long it takes to lose the information depends SENSITIVELY on the manufacturer (not all disks are alike) the year (they got better, but manufacturers also learned to cut corners to make them cheaper), on the writer (how strongly was it magnetized) and the reader (how sensitive is the reader) and on the storage temperature.

        • by cusco ( 717999 )

          The oxide can flake as well, and some of the substrates got brittle if they weren't stored correctly. Then there's the fun of trying to figure out what the file actually is. Word processor? Database? Executable? At least Windows insisted on a file extension that can give a clue, but a lot of the early systems didn't (and Linux still doesn't).

      • by cusco ( 717999 )

        Wang word processors used a 2 1/2" disk, leading a coworker to comment that he "was put in charge of handling a 2 1/1" Wang" when he was tasked with retrieving some data off one.

    • I found reading the full article worthwhile and believe the exclusive reader will find it similarly.

    • Same, I have a Greaseweazel and an Amiga with the catweasle interface, I can read more or less any floppy disk to my Amiga and most others to the PC with that setup. I recently recovered some photos from an Amiga/Catweasle 2.3mb formatted disks. It only took me an hour or so to set it all up. I often wondered why these folks never think to reach out to the classic computer enthusiasts who basically do this stuff for fun.
  • Perhaps with a chisel and hammer. Not on something that relies on rust (floppy/HD) or a reaction to intense light that also can suffer corrosion over time (CD/DVD).

    I guess what I'm saying is, if it's important - record it on vinyl?
    • Baked clay tablets are the best proven technology for long term archival. Use some error correction and a good storage policy and you should be good for the foreseeable future. You also need to create LOTS of copies of correspondences between the clay writing and other widely used written languages so that you can reconstruct the semantics after 1000 years when the language has been forgotten.

      Benefit over chiseling is it's much easier and cheaper to record on soft clay.

      Could probably even adapt the technolo

      • The smaller you write, the less obvious it is to future trash-pickers that there's any writing there. You could put a large font title somewhere, but what if that bit is broken off?

  • Don't copy that floppy!

  • by VaccinesCauseAdults ( 7114361 ) on Friday October 10, 2025 @06:20PM (#65717496)
    . . . except for the same short message in the boot sector

    ———

    I have a truly marvelous demonstration of Grand Unified Theory which the capacity of this floppy disk is too small to contain. Sincerely yours,

    Stephen Hawking

    Lucasian Professor of Mathematics

    University of Cambridge

  • by spaceyhackerlady ( 462530 ) on Friday October 10, 2025 @06:25PM (#65717500)

    It's not that long ago that I found myself with a box of 8" floppy discs from a legacy product and no way to read them. Yes, the software on them was long obsolete. But I would have liked to be able to preserve a bit of company heritage.

    The product in question (Glenayre GL-3000) had been updated in the interim to use 3.5" floppies, though with a bespoke format. I figured out how to use Linux and creative parameters to dd to write disc images. We packaged this as a bootable CD for customers to write their own disc images. After a sharp drop in floppy quality around 2005 I discussed other storage options with my boss (e.g. USB) but the business case just wasn't there.

    ...laura

    • by AmiMoJo ( 196126 )

      Nice work. There is a solution for this now though. A Greaseweazel is a cheap device that you can connect most floppy drives to, and which can read pretty much any format. Rather than acting as a normal floppy drive controller, it just captures the raw flux transitions and then allows the computer to decode them. A typical disk ends up being about 50MB of data, but it captures everything perfectly, including copy protected disks.

      I've been using one for a while. The only thing that could be improved is that

  • Without their efforts, we could lose the permanence code!

  • by mccalli ( 323026 ) on Friday October 10, 2025 @06:38PM (#65717550) Homepage
    I'm reading a lot of "it's simple, just get a...". If you read the article, it says they're "associated with an early Mac computer". That almost certainly means these a GCR formatted disks, and need a drive that can do variable speed rotation.

    It's not impossible obviously, but it's likely the best way to do this is with a vintage Mac itself. Which then implies hooking up a mass storage device of some kind to that Mac so that it can be transferred to something more modern. So not super rare and impossible, but definitely fiddly.
    • I was thinking that someone should a "universal floppy reader", where just the magnetic material is put on a surface, and stepper motors could be used to move a micro magnetic sensor over every micrometer, and do a 2D reconstruction. Software could be written to decode any format. I think the reason my mind goes there is because of degradation in the magnetic field, gunk getting in the jacket over the years, and the stubbornness of most floppy drives to just say "cant' be read" because it finds one bad CR
      • Re:GCR (Score:4, Interesting)

        by mosschops ( 413617 ) on Friday October 10, 2025 @07:48PM (#65717640)

        You don't need any custom sensor hardware to read the disk as the original drives have signal lines that provide all that's needed. They usually have some automatic gain control, but it usually gives the right result.

        You still need to connect the drive to a cheap device like the Greaseweazle (USB), which reads the raw flux reversals from the surface (plus timing information) as they're picked up by the drive head. All data decoding is done in software on the host system, which can handle ANY encoding type, and be smarter about error recovery.

        The disks themselves sometimes need a bit of manual cleaning to be in a good position to read. Not everything can be recovered but older disks are generally more reliable than cheaper newer media.

        • I think that is very interesting information, and rings very true to me. I'm just thinking that over time... I mean those floppy disks used to lay down a HUGE magnetic field per bit, and the field degrades, I think within almost 5 years the original readers lose the ability read 100% them. Then my guess is that the start an exponential decay. It is a gut feeling that some kind of a 'universal reader' could be made that is much more accurate, that can pick up bits from floppys maybe for the next 50 ye
      • I kinda think most of the mechanism can be done with legos and some 3d prints for various spindle adapters. While different floppy technolgies used different track widths, you can almost always use a smaller width sensor to read a larger media width. (writing with a smaller head was a huge compatibility problem back in the day. Like DD in an HD drive.)

        The magic of a teensy, rpi pico, etc can likely handle the stream of data from the head. Software can make sense of various encodings. Variable density encodi

    • it's likely the best way to do this is with a vintage Mac itself. Which then implies hooking up a mass storage device of some kind to that Mac

      Nah, there are ethernet cards for vintage macs, and you can use netatalk to provide them with file services. This makes a lot more sense today than messing with antique SCSI drives. You also don't have to get very vintage to read those floppies, there are lots of machines out there which will do it.

    • If it's a Mac there are a lot of them still working. I have a Mac SE with 800K drives. It also has a hard drive that still works. They made Ethernet cards for it to which I do not have, drat it. But it will LocalTalk to the G3 which does have Ethernet.

      You could also use the serial ports the old fashioned way.

      • I used to and might even still have a localtalk to ethertalk bridge. It was made by one of the handful of ubiquitous 68k era suppliers that made everything for macs back then, but I forget which.

        BITD I used to run netatalk, samba, and nfs on the same Linux box for a geek house. Everybody could access their homes and a shared drive.

        They also made localtalk cards for PCs.

        I am due to make another pass through my cables and crap bins and free another one up, I might still have an ethernet card which I think is

        • Adrian's Digital Workshop has videos on repairing old computers including Macs. If you are inclined to try to fix the SE you could look there. If you can figure out where to send it you could even donate it him for a video repair.

    • by tlhIngan ( 30335 )

      I'm reading a lot of "it's simple, just get a...". If you read the article, it says they're "associated with an early Mac computer". That almost certainly means these a GCR formatted disks, and need a drive that can do variable speed rotation.

      It's not impossible obviously, but it's likely the best way to do this is with a vintage Mac itself. Which then implies hooking up a mass storage device of some kind to that Mac so that it can be transferred to something more modern. So not super rare and impossible, b

    • by Megane ( 129182 )
      Modern archival floppy disk readers (aka Greaseweasel) don't care about drive speed, they just record the time between flux transitions. Drive speed is then handled by the decoding software.
  • Uh, "DIR A:"?

    It's not like it's a challenge to look at the time stamps on the files for DOS diskettes.
    • Re: (Score:3, Informative)

      A lot of the older DOS systems (XT and the like) didn't have a clock built in. If you wanted one you had to buy a separate card and run a little program as part of your autoexec.bat to set the current date and time.

      Otherwise, unless you specifically entered the date and time when you booted up the system, your current date and time was midnight of January 1, 1980.

      So a great many DOS files are dated January 1, 1980 regardless of when they were actually written.

  • Made him go from floppy to hard faster than the speed of light.

  • Trained archivist are experts in data storage and retrieval techniques and technology. If they need any help I've got an IBM 8" floppy drive that still works. A whole 3000 80x132 punch card storage. Who would ever need more than 640K memory ?

  • Check out CuriousMarc on YouTube. Those guys can definitely read anything out there.
  • Now imagine if all this was not on floppies or DVDs or pen drives but on some cloud service that was the flavor of the day during the time. And is no longer operational. Or wants to do 2FA with Hawking's phone or email.

    • Luckily my couple of 1TB external SSDs will ensure humanity is not deprived of my 'hidden' knowledge and umm.. carefully selected videos

    • Yup. There's essentially zero archival possibility for cloud or anything using an API, js, dynamic downloads, or web logins. It's basically digital damnatio memoriae that shits on anyone who cares about preservation of the past.
  • And find multiple articles of these because the problem with floppies include magnetic bit rot and end-user modification. Proper archival of old floppies is really difficult to get pristine artifacts.

    Also, if anyone has good floppy images for Northgate keyboards or Thrustmaster F-16 HOTAS, I'm still missing them. ):
  • by Bu11etmagnet ( 1071376 ) on Saturday October 11, 2025 @04:45AM (#65718042)

    How can people be trapped on old floppy disks?

  • Floppies are very volatile. Stuff that old they will likely lose about 1/4 of the data, based on my personal experience with hundreds of 20-30 year old floppies I recovered a couple of years ago. Even the brand name high quality ones.

    • by Megane ( 129182 )

      I imaged all my 1980s TRS-80 floppy disks around 2008 or so. The only read errors were original errors from back in the day. Sometimes you could even see the bad spot on the disk.

      I wouldn't feel so confident about 1990s 3.5" HD floppies, but that's because they were crap, especially the cheap 25-pack disks. They were always bad and don't get worse with time. It's more likely that you had problems from misaligned or dirty drive heads.

Those who can, do; those who can't, simulate.

Working...