Collection Management Systems - Considerations.

Peter Bubestinger-Steindl
(peter @ ArkThis.com)

Managing Digital Assets

Preservation Collection Digital Asset Management Systems?

Let’s map all of this digitally!

Puh. Where to begin?

It’s a database and some UI possibly referring to files. Designed to find and organize stuff.

It’s a commitment.

Serious long term. So choose and configure well… 🧙️🌟️

A loooong list of (70+) systems… 😮️

https://bits.ashleyblewer.com/blog/2017/08/09/collection-management-system-collection/

Condensed update (50+)

kb.kulturerbe-digital.at/books/empfehlungen-software/page/vergleich-sammlungsverwaltungssysteme

A short list

of some popular OpenSource CMS:

Not all DAMs are created equal.

  • Which use-cases was it designed for?
  • Proper media-type handling? (image, docs, AV?)
  • Multiple data schemas in parallel?
  • Local app vs browser-based?
  • License model?
  • Setup/maintenance complexity?

CMS Criteria - Examples

Basic Features?

  • Multilingual
  • Permissions
  • Physical Location
  • Reporting
  • Rights
  • Web-Access

Advanced Handling?

  • Batch Edit
  • Data Model & Metadata Schema
  • Preservation Storage
  • Workflow/Task Management
  • Loan/request

Future Proof & Interoperable?

  • Open Source
  • Import/Export
  • API
  • IIIF

Who you gonna call?

  • Paid Support
  • Community

Language matters.

For tech: English, please? 😇️

  • All field-names
  • All tech-terms
  • Documentation
  • but: UI and term-labels multi-language

Date and Time Formats

A classic.
Never gets “old”. 😜️

Date Formats

Dates Times
January 3rd, 1981 13:37
3. Jänner 1981 13:37:00
1/3/1981 0h13m37s
3.1.1981 123.4
1977-11-23-02

Timezones?
Other calendar systems…?

ISO 8601. 🤩️

  • Date:
    1981-01-03
  • Date & Time (+TZ):
    1981-01-03T00:12:34+02:00
  • Week (+weekday):
    2022-W46-5
  • Duration:
    PnYnMnDTnHnMnS

See: (Wikipedia) ISO 8601

Duration / Period

  • Support having “from/to” in date fields?
  • With different, even mixed “precision”?
    • “1920-1923”
    • “1920-01 - 1923”

Got “Fuzzy” Dates…?

  • 1910-01?
  • circa 1800?
  • Bronce age
  • WWII
  • etc.

Oh, btw: Got a common syntax?

Partially Approximately Uncertain?

  • ?”: Uncertain
  • ~”: Approximate (circa)
  • %”: Both uncertain & approximate
  • X”: Unspecified.

1981-01?, 20XX, 197X-11?-XX (https://www.loc.gov/standards/datetime/iso-tc154-wg5_n0039_iso_wd_8601-2_2016-02-16.pdf)

The Identifer

A “must have”. Also known as:

Everyone’s got one! 😉️

Examples

  • V-00815
  • W/S #00034
  • 38AF2EC1A13494B9DF6FD6E75960307
  • FBW002984
  • 8-717774-330128
  • adBDwKf_aSE
  • Q83697636

PIDs - Persistent Identifiers

Example: NHMW-ZOO-FS50750

“[PIDs] provide [means for offering] a long-lasting click-able link to a digital object.”

(Source: https://tanc-ahrc.github.io/HeritagePIDs/

PIDs - Infos

ID Requirements

  • Unique? Persistent?
  • Stable syntax?
  • Recognizable?
  • Scope?
  • Filename-safe?
  • Printable? (QR- or barcode label)

DAM-handle it?

  • Auto-generate?
  • Increment?
  • Validate?
  • More than 1 syntax?

Common terms?

“We’ve always called it that way, so it’s clear.”

  • Person, Company, Actor or: Agent?
  • Recording, Film, Video or: MovingImage?
  • Action, Shooting or: Event?
  • Author or: Creator?
  • etc.

Can you re-label terms in the UI?

Controlled Vocabularies

“Controlled vocabulary schemes mandate the use of predefined, authorised terms that have been preselected by the designers of the schemes, in contrast to natural language vocabularies, which have no such restriction.”

Source: “Controlled Vocabulary” (Wikipedia)”

Why CoVocs / Taxonomies?

Same-same 😇, but different… 😭

  • 35mm = 35 mm = 35 millimètre
  • dup pos = duplicate positive
  • de = deu = german = German = alemán
  • yuv422p10le = YUV, 4:2:2, 10 bpc
  • Director = Directed by = Regisseur = Regie

Handling CoVocs

  • Clear choice/definition of terms?
  • Standard terms? (e.g. ISO mapping for Language, Country, Region, Tech-Terms, etc.)
  • Keyboard shortcuts to select specific terms?
  • Nested hierarchies?!
  • Maintaining / augmenting terms?
  • Identifier vs Label?

Identifier & Label

Can the MAM store & handle this properly?

Identifier(for machines) Label(for humans>) Translations?
sq, sqi, alb, … Albanian Albanisch(de), Albanais(fr)
de, deu, ger, … German Deutsch(de), Allemand(fr)
und Undetermined …?

Examples: ISO 639 (Languages), ISO 3166 (Countries),…

Shared CoVocs

  • Same vocabulary terms/lists can be used across different systems/domains.
  • Use existing CoVocs where possible.
  • Share/publish yours.
  • Be in contact/exchange with others.
  • Be ready for embracing trade-offs.

Dynamic CoVocs

How does your DAM deal with:

  • Adding new terms (on the fly)?
  • Editing terms that have already been assigned…? 😱️
  • Import / export lists?

Shared Identifiers

If you’re not the only one referring to …

Wouldn’t it make sense to have a common, shared identifier?

Handling Multiple IDs?

Add attribute option declaring “which” identifier:

<id schema="dnb"> ... </id>
<id schema="efg"> ... </id>
<id schema="wikidata"> Q84199 </id>

Wikidata

WikiData Logo

“[…] is a free and open knowledge base that can be read and edited by both humans and machines.”

Example: Earth (Q2) vs Earth (Q83697636)

Data Structure Matters

“classical” catalogue = flat, table/cell thinking.

Example elgrito.witness.org

Relationships

  • Forward: Agent1 is parent of Agent2
  • Reverse: Agent2 is child of Agent1

Annotating relationships

  • For which time-period was the relationship valid? (Agent “married to” Agent)

  • Anything else you’d like to add?

Relationship queries?

(How well) can a system handle/search relationship-details?

  • Which Work(s) feature which Agent(s) under the alias of …?
  • List all … related to a work, in order of their …?

More than just 1 standard/schema?

Great! Now what about the “film-related” materials? 🤨️ (posters, documents, etc)

Import / Export

Import / Export: Considerations

  • (Nested) Lists & CoVocs?
  • Link cataloguing data (DB) & files? (storage)
  • Programming language skills required?
  • Value/term/schema mapping options?
  • Metadata layout (schema) valid?

Reference between Catalogue and Files

  • How are catalogue entries related to files (and vice-versa)?
  • How (in)dependent of the catalogue can files be used/modified?

Filename & Folder structure

  • Many DAMs do that automatically.
  • In a non-human readable layout (UUIDs).
  • Which level of control do you need over this?
  • Got external workflows accessing the files/data?

Files intact, but DAM gone?

If your files have PIDs, you’re lucky!

Access & Control

APIs: Whatfor?

Access & exchange your data…

  • … with other institutions.
  • … by external tools.
  • … for backup/migration.
  • … to use your data differently.
  • etc.

APIs: Whatfor?

Remote Control!

  • Common backend + different UIs.
  • Automate workflows/actions externally.
  • etc.
  • And: Simply (more) future-proof. 😎️ 🤓️

Does it scale?

Ask experienced tech staff.

  • Speed? (search, retrieve, data, files, etc)
  • Cache & tuning options?
  • Synchronize multiple instances?
  • Ongoing import/export? (in-house, external)
  • License conditions and impact?
  • Multiple data schemas in parallel?
  • Multi-language support?
  • Migration to a “better scaling” system when needed?

Got Plugins?

  • Usually an all-in-one monolith is rigid and hard to maintain.
  • Preferred: Slim core + additional features as plugins.
  • Necessary: Plugin interfaces open & documented.

Got Support?

  • Whom to call in case of …?
  • Option to switch/choose “your mechanic”?
  • Under which conditions?
  • Support contracts?
  • Which license model?
  • Train your staff?
  • Proper documentation?

Good practice

  • Get access to a demo installation (local or online).
  • Try to import/export lists/CoVocs.
  • Consider import/export/ui-feel with your data (layout). (Before you make any final choice)
  • Get cataloguers & technicians together.
  • Get help from experts with your tool-choice/use-cases.
  • As simple as possible, as complicated as necessary.

Have a disaster-plan ready.

  • Backups?
  • Failover system?
  • Got support contracts?
  • Aware of migration requirements?

The End

Questions?

Comments?

Peter Bubestinger-Steindl

info@kulturerbe-digital.at peter @ ArkThis.com

Oh, btw: “Says who?”

  • Declare information sources?
  • At which level? (record? field?)
  • Who edited which field when and why?
  • Could a source be “linked”? (as relationship)
  • What if you have ≥ 1 sources?

Oh, there’s a typo…

  • Should we correct it?
  • Is it even a typo, or was it proper “back then”?
  • Can we keep multiple “versions”?
  • (Where) can we document that? (commit msg)