B2HANDLE: Managing Digital Object Identifiers for Data Persistence

undefined
Minting and managing digital object identifiers
B2HANDLE
undefined
2
FAIR Digital Objects, PIDs and B2HANDLE motivation
B2HANDLE architecture and access
B2HANDLE services: Hosting, Profiles & Search
Integration with other EOSC services and workflow
example
Documentation and further information
10/7/2024
Content
undefined
3
Automated management of data objects and their metadata
-
early-to-mid data life cycle
Record connections between (meta)data objects, software,
workflows
-
Construction of B2HANDLE PID and metadata graph
-
Facilitate machine-driven workflows and realize the Intelligent
Data Fabric
10/7/2024
Motivation: Towards FAIR Digital
Object Management
CRUD
operations
undefined
4
Globally unique, persistent name identifying a digital
object
Persistency is achieved by registering the identifier in an
identifier registry
-
The registry works according to policies that ensure its
sustainability and trustworthiness
-
Persistency of the identifier does not necessarily imply
persistency of the identified resource
PID services provide registration and resolution
10/7/2024
Motivation: What are Persistent
Identifiers?
undefined
5
Persistent Identifiers for managing digital objects across
infrastructures
-
Interaction with objects first requires a reliable way to
address them
-
Independent of specific communities or services
Not to be confused with citation and credit scenarios
(e.g., via DOIs, DataCite)
-
B2HANDLE does not offer or enforce strong policies
Ultimately: Facilitate common operations on objects
-
Create, Copy, Move, Delete, Annotate
-
Versioning, replication, provenance tracking, object linking
10/7/2024
What does B2HANDLE target?
undefined
6
10/7/2024
Service architecture and interfaces
Scalable, hierarchical server federation fully integrated
with global Handle registry
-
Replication between B2HANDLE nodes
-
Central Catalog across B2HANDLE nodes
Added-value services at EUDAT B2HANDLE nodes
Global Handle
registry
B2HANDLE nodes
SURFsara, GRNET, SNIC, 
GWDG, MPCDF, ...
search
REST
Central
Catalog
undefined
7
Easy resolution via global Handle registry proxies
-
https://hdl.handle.net
HTTPS REST interface for CRUD operations on each node
Python client libraries (b2handle, pyhandle) for direct
integration with user scripts, applications and
middleware
-
easy access also to search interface
-
https://pypi.org/project/b2handle/
-
https://pypi.org/project/pyhandle/
10/7/2024
Service access
undefined
8
10/7/2024
Service options and attributes
undefined
9
Reliable hosting and resolution of PID namespaces
-
Well-proven infrastructure in place
-
Technical foundation is the Handle System
-
Namespaces provided through ePIC
Operational processes for namespace integration and
maintenance
-
Ensure balance across hosting nodes
-
Accommodate fail-over and retraction scenarios
10/7/2024
B2HANDLE: PID namespace hosting
undefined
10
Profiles determine the kernel information to keep with
every identifier
Essential information about the object
-
checksum, time stamp, parent, ...
B2HANDLE implements Kernel Information (RDA draft
recommendation):
-
Support machine-actionable services
-
Non-authoritative source for metadata
-
Stored directly at the resolver
-
W3C PROV compatible attributes
10/7/2024
B2HANDLE: Profile management
- ...
- ...
- ...
undefined
11
Fast reverse-lookups and searching/filtering across
B2HANDLE metadata space
Provides a metadata graph view
Two identical instances hosted by SurfSARA and GRNET
-
load-balancing and fail-over capacity
Procedures in place to integrate new B2HANDLE nodes
without service downtime
10/7/2024
B2HANDLE: Searching via the
Central PID Catalog
undefined
12
pyhandle Python library
-
Create, Read, Update, Delete of PIDs and Profile
information
-
Modes: Handle System REST interaction, direct database
operations, asynchronous batch mode
-
Interaction with Central PID Catalog
Profile migration tools and procedures
10/7/2024
B2HANDLE: Technical support for
PID and profile management
undefined
13
B2HANDLE is integrated with multiple other 
EOSC
services
, both generic and thematic
-
B2SAFE, B2SHARE, OneData, ECAS, ...
B2HANDLE is also used by 
user communities
 to manage
Handles for specific scenarios
-
Examples cases from CLARIN, ENES, Herbadrop, EPOS, ...
10/7/2024
B2HANDLE integration and use
undefined
14
10/7/2024
Full workflow example
Entry services
ENES
Community
workflow
Data Sources
Data & workflow
Sharing
ECAS work
environment
HPC
QC
early sharing,
community
review
 
versions,
errata IDs,
collection building
 
data input, versions,
 workflow link, instance/component,
collection building
PID pages
B2FIND
Information from all these steps is
gathered and linked through B2HANDLE
undefined
15
Public resolution is for free by nature of the Handle
System
Support libraries are free and open source software
Assignment of PIDs, prefixes, hosting and additional
services require costs to be covered
-
Ensure service reliability, scalability & trustworthiness
Detailed rates and conditions can be requested from
EUDAT or the B2HANDLE hosting partners
10/7/2024
Access policies and Funding models
undefined
16
How to request a PID namespace?
How to get namespace hosting?
How to establish a PID profile?
How to use the Central PID Catalog?
Contact B2HANDLE support at
-
https://eudat.eu/support-request?service=B2HANDLE
Consult user documentation at
-
https://www.eudat.eu/services/userdoc/b2handle
10/7/2024
How to get in touch with B2HANDLE
undefined
Thank you for
your attention!
 
 
Slide Note
Embed
Share

B2HANDLE is a service for minting and managing persistent identifiers (PIDs) to facilitate common operations on digital objects across infrastructures. The system enables automated management of data objects and metadata, facilitates machine-driven workflows, and supports the construction of PID and metadata graphs. With a focus on achieving FAIR data principles, B2HANDLE aims to ensure persistency and accessibility of identified resources, independent of specific communities or services. The service architecture includes scalable server federation, replication between nodes, and integrated global Handle registry for enhanced discoverability and management.

  • PIDs
  • Data Management
  • FAIR Data
  • Infrastructure
  • Metadata

Uploaded on Oct 07, 2024 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

E N D

Presentation Transcript


  1. B2HANDLE Minting and managing digital object identifiers eosc-hub.eu @EOSC_eu EOSC-hub receives funding from the European Union s Horizon 2020 research and innovation programme under grant agreement No. 777536.

  2. Content FAIR Digital Objects, PIDs and B2HANDLE motivation B2HANDLE architecture and access B2HANDLE services: Hosting, Profiles & Search Integration with other EOSC services and workflow example Documentation and further information 10/7/2024 2

  3. Motivation: Towards FAIR Digital Object Management CRUD operations Automated management of data objects and their metadata - early-to-mid data life cycle Record connections between (meta)data objects, software, workflows - Construction of B2HANDLE PID and metadata graph - Facilitate machine-driven workflows and realize the Intelligent Data Fabric 10/7/2024 3

  4. Motivation: What are Persistent Identifiers? Globally unique, persistent name identifying a digital object Persistency is achieved by registering the identifier in an identifier registry - The registry works according to policies that ensure its sustainability and trustworthiness - Persistency of the identifier does not necessarily imply persistency of the identified resource PID services provide registration and resolution 10/7/2024 4

  5. What does B2HANDLE target? Persistent Identifiers for managing digital objects across infrastructures - Interaction with objects first requires a reliable way to address them - Independent of specific communities or services Not to be confused with citation and credit scenarios (e.g., via DOIs, DataCite) - B2HANDLE does not offer or enforce strong policies Ultimately: Facilitate common operations on objects - Create, Copy, Move, Delete, Annotate - Versioning, replication, provenance tracking, object linking 10/7/2024 5

  6. Service architecture and interfaces Scalable, hierarchical server federation fully integrated with global Handle registry - Replication between B2HANDLE nodes - Central Catalog across B2HANDLE nodes Added-value services at EUDAT B2HANDLE nodes Global Handle registry REST B2HANDLE nodes search Central Catalog SURFsara, GRNET, SNIC, GWDG, MPCDF, ... 10/7/2024 6

  7. Service access Easy resolution via global Handle registry proxies - https://hdl.handle.net HTTPS REST interface for CRUD operations on each node Python client libraries (b2handle, pyhandle) for direct integration with user scripts, applications and middleware - easy access also to search interface - https://pypi.org/project/b2handle/ - https://pypi.org/project/pyhandle/ 10/7/2024 7

  8. Service options and attributes Reliable hosting, resolution and replication Operational processes to ensure balance Ordering of Handle namespaces (prefixes) Hosting Profile development for clients Fully aligned with cross-community standards and recommendations Key concern for FAIR data Profiles Optional service for participating nodes Reverse-lookup and metadata search via Central PID Catalog Search 10/7/2024 8

  9. B2HANDLE: PID namespace hosting Reliable hosting and resolution of PID namespaces - Well-proven infrastructure in place - Technical foundation is the Handle System - Namespaces provided through ePIC Operational processes for namespace integration and maintenance - Ensure balance across hosting nodes - Accommodate fail-over and retraction scenarios 10/7/2024 9

  10. B2HANDLE: Profile management Profiles determine the kernel information to keep with every identifier Essential information about the object - checksum, time stamp, parent, ... - ... - ... - ... B2HANDLE implements Kernel Information (RDA draft recommendation): - Support machine-actionable services - Non-authoritative source for metadata - Stored directly at the resolver - W3C PROV compatible attributes 10/7/2024 10

  11. B2HANDLE: Searching via the Central PID Catalog Fast reverse-lookups and searching/filtering across B2HANDLE metadata space Provides a metadata graph view Two identical instances hosted by SurfSARA and GRNET - load-balancing and fail-over capacity Procedures in place to integrate new B2HANDLE nodes without service downtime 10/7/2024 11

  12. B2HANDLE: Technical support for PID and profile management pyhandle Python library - Create, Read, Update, Delete of PIDs and Profile information - Modes: Handle System REST interaction, direct database operations, asynchronous batch mode - Interaction with Central PID Catalog Profile migration tools and procedures 10/7/2024 12

  13. B2HANDLE integration and use B2HANDLE is integrated with multiple other EOSC services, both generic and thematic - B2SAFE, B2SHARE, OneData, ECAS, ... B2HANDLE is also used by user communities to manage Handles for specific scenarios - Examples cases from CLARIN, ENES, Herbadrop, EPOS, ... 10/7/2024 13

  14. Full workflow example ECAS work environment Data & workflow Sharing ENES Data Sources Community workflow HPC Entry services QC PID pages early sharing, community review B2FIND Information from all these steps is gathered and linked through B2HANDLE versions, errata IDs, collection building data input, versions, workflow link, instance/component, collection building 10/7/2024 14

  15. Access policies and Funding models Public resolution is for free by nature of the Handle System Support libraries are free and open source software Assignment of PIDs, prefixes, hosting and additional services require costs to be covered - Ensure service reliability, scalability & trustworthiness Detailed rates and conditions can be requested from EUDAT or the B2HANDLE hosting partners 10/7/2024 15

  16. How to get in touch with B2HANDLE How to request a PID namespace? How to get namespace hosting? How to establish a PID profile? How to use the Central PID Catalog? Contact B2HANDLE support at - https://eudat.eu/support-request?service=B2HANDLE Consult user documentation at - https://www.eudat.eu/services/userdoc/b2handle 10/7/2024 16

  17. Thank you for your attention! @EOSC_eu eosc-hub.eu

Related


More Related Content

giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#