Long-term Preservation in the Cloud: Data Management and Architecture Overview

undefined
 
L
o
n
g
-
t
e
r
m
 
p
r
e
s
e
r
v
a
t
i
o
n
 
i
n
 
t
h
e
 
c
l
o
u
d
L
T
P
-
S
a
a
S
 
Dr. Claus-Peter Klas
Prof. Matthias Hemmje
 
D
a
t
a
 
M
a
n
a
g
e
m
e
n
t
 
L
i
f
e
 
C
y
c
l
e
 
a
n
d
 
R
o
l
e
s
 
C
r
e
a
t
i
o
n
(Producer,
Archivist,
Consumer)
 
A
s
s
e
m
b
l
y
(Producer,
Archivist)
 
A
d
o
p
t
i
o
n
(Archivist,
Consumer)
 
R
e
u
s
e
(Consumer
,
Producer,
 Archivist)
 
A
r
c
h
i
v
a
l
(Archivist)
 
G
e
n
e
r
a
l
 
A
r
c
h
i
t
e
c
t
u
r
e
OAIS
Compliant
Archive
Collaborative
Task-based
Search &
Access
Index
Search
User
Interfaces
Web
Mobile
Community
Based
Data
Management
Access
Ingest
Re-Use
 
G
e
n
e
r
a
l
 
A
r
c
h
i
t
e
c
t
u
r
e
OAIS
Compliant
Archive
AIP
Collaborative
Task-based
Search &
Access
User
Interfaces
Web
Mobile
Data
Management
Preservation
Aware and
Provenance
DIP
SIP
URI/URN
 
Metadata
Full-text
Multi-Media
Provenance
 
URI/URN
Thumbnail
 
P
r
e
s
e
r
v
a
t
i
o
n
 
A
w
a
r
e
 
D
a
t
a
 
M
a
n
a
g
e
m
e
n
t
 
Data management: Dropbox equivalent online storage for direct and dynamic data
handling during information creation time
Should be preservation aware, meaning gathering basic, dublin core like, metadata about the
current task, project, persons etc.
Gathering provenance information about access, re-use, transformations
 
O
A
I
S
 
C
o
m
p
l
a
i
n
t
 
A
r
c
h
i
v
e
 
Ingest: Packaging Tool (SciDIP)
PG Prototyp
LTP System Cologne
 
C
o
l
l
a
b
o
r
a
t
i
v
e
 
T
a
s
k
-
b
a
s
e
d
S
e
a
r
c
h
 
&
 
A
c
c
e
s
s
 
ElasticSearch Server for Searching
Metadata
Provenance
Full-Text
Pictures
Video
REST Interface for UI
REST Access Interface to index from Archive
 
U
s
e
r
 
I
n
t
e
r
f
a
c
e
 
Vaadin based Web Interface
Mobile Interface (maybe also Vaadin)
 
C
o
m
p
l
i
a
n
c
e
 
w
i
t
h
 
E
G
I
 
C
l
o
u
d
 
S
e
r
v
i
c
e
s
 
All LTP services will run on dedicated virtual machines
EGI monitoring service will supervision the virtual machines (VO)
Furthermore monitoring services should monitor services in virtual machines, e.g. at least for
a life sign aka ping
Potential response:
Notification of administrator
Automatic recovery, replacement or extension of virtual machine
EGI backup service to store the virtual machines, setups, etc. (Check by EGI:
Application DB)
EGI User handling and authorization and “single point of authorization” (Certificate,
not Facebook account…
Roles:
Curator Certificate
Consumer: Open Access … (Robot Certificate: Service needs to be saved)
Workflows on dynamic extensions of storage and CPU based on service level
agreements and monetary resources
Easy and flexible “billing”
 
S
t
a
r
t
i
n
g
 
R
e
q
u
i
r
e
m
e
n
t
s
 
1-3 VMWares: Standard Ubuntu LTS with 1 CPU & 300GB Diskspace & 4GB RAM
Start: 1 TB Shared Data Storage
Slide Note
Embed
Share

Exploring the intricate landscape of long-term preservation in the cloud, this material delves into data management life cycles, roles, preservation awareness, collaborative search and access mechanisms, OAIS compliance, and user interface considerations. It highlights the importance of proactive preservation practices, metadata gathering, and the use of advanced tools and interfaces for efficient data handling and retrieval in cloud-based environments.

  • Cloud Preservation
  • Data Management
  • OAIS Compliance
  • Collaborative Search
  • User Interface

Uploaded on Sep 11, 2024 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

E N D

Presentation Transcript


  1. Long-term preservation in the cloud LTP-SaaS Dr. Claus-Peter Klas Prof. Matthias Hemmje

  2. Data Management Life Cycle and Roles Reuse (Consumer , Producer, Archivist) Creation (Producer, Archivist, Consumer) Reuse Creation Adoption Assembly Assembl y (Producer, Archivist) Adoption (Archivist, Consumer) Pre-Ingest Post-Access Archival Ingest Access Archival (Archivist)

  3. General Architecture Community Based Data Management Re-Use Ingest Collaborative Task-based Search & Access User OAIS Compliant Archive Interfaces Web Mobile Index Search Access

  4. General Architecture Data Management Preservation Aware and Provenance URI/URN SIP Metadata Full-text Multi-Media Provenance OAIS Compliant Archive AIP User Collaborative Task-based Search & Access URI/URN Thumbnail Interfaces Web Mobile DIP

  5. Preservation Aware Data Management Data management: Dropbox equivalent online storage for direct and dynamic data handling during information creation time Should be preservation aware, meaning gathering basic, dublin core like, metadata about the current task, project, persons etc. Gathering provenance information about access, re-use, transformations

  6. OAIS Complaint Archive Ingest: Packaging Tool (SciDIP) PG Prototyp LTP System Cologne

  7. Collaborative Task-based Search & Access ElasticSearch Server for Searching Metadata Provenance Full-Text Pictures Video REST Interface for UI REST Access Interface to index from Archive

  8. User Interface Vaadin based Web Interface Mobile Interface (maybe also Vaadin)

  9. Compliance with EGI Cloud Services All LTP services will run on dedicated virtual machines EGI monitoring service will supervision the virtual machines (VO) Furthermore monitoring services should monitor services in virtual machines, e.g. at least for a life sign aka ping Potential response: Notification of administrator Automatic recovery, replacement or extension of virtual machine EGI backup service to store the virtual machines, setups, etc. (Check by EGI: Application DB) EGI User handling and authorization and single point of authorization (Certificate, not Facebook account Roles: Curator Certificate Consumer: Open Access (Robot Certificate: Service needs to be saved) Workflows on dynamic extensions of storage and CPU based on service level agreements and monetary resources Easy and flexible billing

  10. Starting Requirements 1-3 VMWares: Standard Ubuntu LTS with 1 CPU & 300GB Diskspace & 4GB RAM Start: 1 TB Shared Data Storage

Related


More Related Content

giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#