Long-term Preservation in the Cloud: Data Management and Architecture Overview
Exploring the intricate landscape of long-term preservation in the cloud, this material delves into data management life cycles, roles, preservation awareness, collaborative search and access mechanisms, OAIS compliance, and user interface considerations. It highlights the importance of proactive preservation practices, metadata gathering, and the use of advanced tools and interfaces for efficient data handling and retrieval in cloud-based environments.
Download Presentation
Please find below an Image/Link to download the presentation.
The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.
E N D
Presentation Transcript
Long-term preservation in the cloud LTP-SaaS Dr. Claus-Peter Klas Prof. Matthias Hemmje
Data Management Life Cycle and Roles Reuse (Consumer , Producer, Archivist) Creation (Producer, Archivist, Consumer) Reuse Creation Adoption Assembly Assembl y (Producer, Archivist) Adoption (Archivist, Consumer) Pre-Ingest Post-Access Archival Ingest Access Archival (Archivist)
General Architecture Community Based Data Management Re-Use Ingest Collaborative Task-based Search & Access User OAIS Compliant Archive Interfaces Web Mobile Index Search Access
General Architecture Data Management Preservation Aware and Provenance URI/URN SIP Metadata Full-text Multi-Media Provenance OAIS Compliant Archive AIP User Collaborative Task-based Search & Access URI/URN Thumbnail Interfaces Web Mobile DIP
Preservation Aware Data Management Data management: Dropbox equivalent online storage for direct and dynamic data handling during information creation time Should be preservation aware, meaning gathering basic, dublin core like, metadata about the current task, project, persons etc. Gathering provenance information about access, re-use, transformations
OAIS Complaint Archive Ingest: Packaging Tool (SciDIP) PG Prototyp LTP System Cologne
Collaborative Task-based Search & Access ElasticSearch Server for Searching Metadata Provenance Full-Text Pictures Video REST Interface for UI REST Access Interface to index from Archive
User Interface Vaadin based Web Interface Mobile Interface (maybe also Vaadin)
Compliance with EGI Cloud Services All LTP services will run on dedicated virtual machines EGI monitoring service will supervision the virtual machines (VO) Furthermore monitoring services should monitor services in virtual machines, e.g. at least for a life sign aka ping Potential response: Notification of administrator Automatic recovery, replacement or extension of virtual machine EGI backup service to store the virtual machines, setups, etc. (Check by EGI: Application DB) EGI User handling and authorization and single point of authorization (Certificate, not Facebook account Roles: Curator Certificate Consumer: Open Access (Robot Certificate: Service needs to be saved) Workflows on dynamic extensions of storage and CPU based on service level agreements and monetary resources Easy and flexible billing
Starting Requirements 1-3 VMWares: Standard Ubuntu LTS with 1 CPU & 300GB Diskspace & 4GB RAM Start: 1 TB Shared Data Storage