Ask Slashdot: Best Way To Store Data In Hard Copy? 329
First time accepted submitter bmearns writes "I have some simple plain-text files (e.g., account information) that I want to print on paper and store in my firebox as a backup to my backup. What's the best way to encode the data for print so that it can later be restored to digital form? I've considered just printing it as text and using OCR to recover it. The upsides are that it's easy and I can even access the information without a computer if necessary. Downsides are data density, no encryption, no error correction, and how well does OCR work, anyway? Another option is printing 2D barcodes. Upsides are density, error correction, I could encrypt the data before printing. Downsides are that I'll need to split it up into multiple barcodes due to maximum capacity of popular barcode formats, and I can't access the data without a computer. Did I miss any options? What do slashdotters suggest?"
Use the correct font for OCR (Score:5, Informative)
Google for OCR-A and OCR-B as TTF. There are freely available versions. I use them for mailing labels, along with PostNet bar codes to make it as easy as possible for the Post Office.
This. (Score:4, Informative)
QR codes include error correction, and can encode binary data on the order of a hundred times the density of a regular bar code.
Re:Text, but why? (Score:5, Informative)
No need to worry about ink: even the cheapest and nastiest laser printers use toner, and a mixture of thermoplastic and carbon black thermally fused to your paper isn't going anywhere(in fact, if you use lousy enough paper, some lucky future archeology intern may have the... unmixed pleasure... of picking the little plastic character glyphs out of the pile of dust, trying to keep them in their original order!).
His data-restore needs probably don't extend to truly epic lengths in any case, so it shouldn't be a big deal.
Print it out with check digits (Score:5, Informative)
http://www.pgpi.org/pgpi/project/scanning/ [pgpi.org]
Re:Whatever you do.... Use papyrus (Score:4, Informative)
Re:Easy (Score:4, Informative)
Modern barcodes like PDF417, QR, and Data Matrix have robust error correction built into their spec and can take a lot of damage. If you're really wanting to print stuff on paper as a backup, these are definitely your best bet.
Personally, I'd just encrypt and shove into a few different off-site backups.
Re:QR codes? (Score:5, Informative)
Here's a link for Paperbak: http://ollydbg.de/Paperbak/ [ollydbg.de]
Re:Text, but why? (Score:4, Informative)
This is silly.
Without a computer he will have no need of this data. It's account data! What good is that without a computer? (Left unsaid is what kind of accounts we are talking about. If computer accounts you simply don't need it. If financial accounts you might temporarily need to work with paper till you restore the computers).
Why not a second or third or fourth backup at a different location all in common computer readable form?
Planning to scan in paper is far more complex than just a conventional backup on common media with a copy off site.
Re:Text, but why? (Score:5, Informative)
Sadly, many of my old dot matrix and teletype printouts have faded as much as any other liquid ink I've used. It depends entirely on the ink in the ribbon. The liquid ink present in ordinary ribbons was often of wildly varying quality, and most people who bought those ribbons in bulk sought out the cheapest possible ribbons. I wouldn't bet on their longevity.
Laser printed pages consist of carbon in plastic, and there's no reason they shouldn't last a century or more, as long as certain conditions are met: if the toner is properly fused to the paper, if the paper doesn't degrade beneath it, and if the facing page doesn't adhere to the toner.
1. Your printer should have the right temperature set in the fuser, and that's probably not even adjustable to you. If the toner comes out dusty or smeary, it's too cool. If it comes out brown and crispy, it's too hot :-) You should recognize it immediately if the print quality is poor.
2. Store the paper properly. Heat is your enemy: don't let it get too hot, and don't store anything you want preserved in sunlight. Don't let it get damp - mold will destroy paper. Don't use crappy paper that will disintegrate - acid free is always the recommendation for long term storage. Horizontal stacks of paper will apply a lot of pressure to the sheets near the bottom of the stack, vertical hanging files reduce this pressure.
3. Watch out for printed sheets facing other printed sheets, (like double sided printing) where the toner from bottom side of the upper sheet can stick to the toner on the top side of the lower sheet. A horizontal stack of paper, especially in a hot environment, will apply a lot of pressure that cause the toners to fuse together where they touch. I've also had problems with toner adhering to vinyl sheets commonly found in 3 ring binders or binder covers.
Comment removed (Score:4, Informative)
Re:Use the correct font for OCR (Score:4, Informative)
Forget OCR-A, just use OCR-B because, unlike all of the fancy digitally efficient compressed hardcopy versions, you can actually read OCR-B without going blind.
Re: Easy (Score:5, Informative)
Spoken like somebody who was born in the plug and play PC age...
The MFM controller actually controlled the disk positioning, and so you need to know the physicals of the device to access it, hopefully avoiding trashing the disk.