Can these Open Source products be integrated and meet all the needs of POWRR? Michele Kimpton, DuraSpace CEO DuraSpace

Size: px
Start display at page:

Download "Can these Open Source products be integrated and meet all the needs of POWRR? Michele Kimpton, DuraSpace CEO DuraSpace"

Transcription

1 Can these Open Source products be integrated and meet all the needs of POWRR? Michele Kimpton, DuraSpace CEO DuraSpace

2 What is DuraCloud? Open Source Archiving and preserva<on pla=orm runs in the cloud S3 and Glacier SDSC Rackspace

3 Features Can replicate and manage content to mul<ple cloud storage loca<ons through single dashboard Automated health checking and repor<ng( independent of the storage provider) All content accessible 24/7 through web interface Easy to use ingest tools No proprietary packaging, no lock in Built on Open source sonware for full transparency

4 What DuraCloud does not do Not a repository ( like Fedora, DSpace, Islandora) Does not create Archival Informa<on Packages( this needs to be done before ingest) No public facing portal to showcase content Does not produce access deriva<ves No emula<on, or migra<on These features typically provided by other applica<ons integrated with DuraCloud

5 What is Archivematica?! Archivema<ca is a free and open- source digital preserva<on system designed to maintain standards- based, long- term access to digital objects! Archivema<ca allows users to process digital objects from ingest to access in conformance with the ISO- OAIS func<onal model! Archivema<ca is designed to output high- quality, standards- compliant Archival Informa<on Packages (AIPs) - Bagit, METS, PREMIS

6 Archivema<ca micro- services Approve transfer Assign file UUIDs and checksums Verify transfer checksums Generate METS.xml Quaran<ne Scan for viruses Clean up names Iden<fy file format Validate format Characterize and extract metadata Index transfer Create SIP Verify SIP compliance Add metadata Normalize Process submission documenta<on Prepare DIP Upload DIP Prepare AIP Generate AIP METS file Index METS file Bag AIP Compress AIP Create AIP pointer file Store AIP

7 archivematica

8 Both of these projects are Open Source Community driven Sustainability Transparency Open formats No vendor lock in Community Governance

9 Pilot partners Illinois Wesleyan University State Archives North Carolina University of Texas Hun<ngton Library Phillips Academy U of Washington Libraries Pepperdine University Berea College Kansas State University Libraries

10 Use cases Each ins<tu<on has selected content Iden<fied accompanying materials ( metadata, submission documenta<on, access files, service files) Iden<fied materials loca<on Order to files within set Workflow into Archivema<ca determined

11 Observations Truly diverse sets of content Many common use cases: i.e., Ins<tu<onal photos and communica<ons Lots of descrip<ve metadata but from variety of sources( systems, spreadsheets) On mul<ple media throughout ins<tu<on Few have submission documenta<on Lidle IT support

12 Types of content Audio, video, books, periodicals, disserta<ons, government publica<ons, researcher site, websites, facebook pages, flickrr collec<on, content dm objects, photographs, lectures

13 Steps " Define use cases, select content " Upload content into DuraCloud Process through Archivema<ca applica<on Create AIP packages Deposit content into DuraCloud Retrieve AIP package for download and review Test applica<on in accordance with POWRR grid

14 Output Publica<on of use cases tested and workflow established for variety of content types Does the Archivema<ca/DuraCloud sonware package meet all the elements of the POWWR grid? Is the sonware flexible enough to meet all use cases? What are the limita<ons if the sonware is hosted? Which use cases are out of scope?

15 Connecting our projects within the ecosystem

16 Web: Thank- you! POWRR: hdp://digitalpowrr.niu.edu/tool- grid/ Documenta<on hdp://wiki.duraspace/display/duracloud hdps:// For more info sign up: archivema<ca