File formats:
Risks to Longevity

Peter Bubestinger-Steindl

2022-04-26

Risks to format longevity

  • Data errors
  • Obsolescence
  • Vendor lock-in
  • Interoperability/complexity issues

Countermeasures?

Data errors: Error resilience?

  • Bitstream checksums:
    Ability to know if the content is intact.

  • Error concealment: Optional choice of decoder to “mask” decoding issues. (decoder specific)

  • Make backup copies! 😇

Obsolescence / Vendor Lock-in

Open vs Closed

Enigma encryption rotor windows
Enigma encryption rotor windows

Theory vs Practice

“Implementation overrules paper specs. Always.”

Data Structure (Hex Editor)

Hex view of WAV header (annotated)

(Documented?) Data Structure

The Eternal Replayer

+ =

Format Complexity

Format Complexity: Less is More

Good rule = “Minimalistic Data Format”:

  • As simple as possible
  • As complicated as necessary

Simpler = more stable, easier to use, keep alive, reconstruct or fix.