B2HANDLE: Managing Digital Object Identifiers for Data Persistence

Slide Note
Embed
Share

B2HANDLE is a service for minting and managing persistent identifiers (PIDs) to facilitate common operations on digital objects across infrastructures. The system enables automated management of data objects and metadata, facilitates machine-driven workflows, and supports the construction of PID and metadata graphs. With a focus on achieving FAIR data principles, B2HANDLE aims to ensure persistency and accessibility of identified resources, independent of specific communities or services. The service architecture includes scalable server federation, replication between nodes, and integrated global Handle registry for enhanced discoverability and management.


Uploaded on Oct 07, 2024 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

E N D

Presentation Transcript


  1. B2HANDLE Minting and managing digital object identifiers eosc-hub.eu @EOSC_eu EOSC-hub receives funding from the European Union s Horizon 2020 research and innovation programme under grant agreement No. 777536.

  2. Content FAIR Digital Objects, PIDs and B2HANDLE motivation B2HANDLE architecture and access B2HANDLE services: Hosting, Profiles & Search Integration with other EOSC services and workflow example Documentation and further information 10/7/2024 2

  3. Motivation: Towards FAIR Digital Object Management CRUD operations Automated management of data objects and their metadata - early-to-mid data life cycle Record connections between (meta)data objects, software, workflows - Construction of B2HANDLE PID and metadata graph - Facilitate machine-driven workflows and realize the Intelligent Data Fabric 10/7/2024 3

  4. Motivation: What are Persistent Identifiers? Globally unique, persistent name identifying a digital object Persistency is achieved by registering the identifier in an identifier registry - The registry works according to policies that ensure its sustainability and trustworthiness - Persistency of the identifier does not necessarily imply persistency of the identified resource PID services provide registration and resolution 10/7/2024 4

  5. What does B2HANDLE target? Persistent Identifiers for managing digital objects across infrastructures - Interaction with objects first requires a reliable way to address them - Independent of specific communities or services Not to be confused with citation and credit scenarios (e.g., via DOIs, DataCite) - B2HANDLE does not offer or enforce strong policies Ultimately: Facilitate common operations on objects - Create, Copy, Move, Delete, Annotate - Versioning, replication, provenance tracking, object linking 10/7/2024 5

  6. Service architecture and interfaces Scalable, hierarchical server federation fully integrated with global Handle registry - Replication between B2HANDLE nodes - Central Catalog across B2HANDLE nodes Added-value services at EUDAT B2HANDLE nodes Global Handle registry REST B2HANDLE nodes search Central Catalog SURFsara, GRNET, SNIC, GWDG, MPCDF, ... 10/7/2024 6

  7. Service access Easy resolution via global Handle registry proxies - https://hdl.handle.net HTTPS REST interface for CRUD operations on each node Python client libraries (b2handle, pyhandle) for direct integration with user scripts, applications and middleware - easy access also to search interface - https://pypi.org/project/b2handle/ - https://pypi.org/project/pyhandle/ 10/7/2024 7

  8. Service options and attributes Reliable hosting, resolution and replication Operational processes to ensure balance Ordering of Handle namespaces (prefixes) Hosting Profile development for clients Fully aligned with cross-community standards and recommendations Key concern for FAIR data Profiles Optional service for participating nodes Reverse-lookup and metadata search via Central PID Catalog Search 10/7/2024 8

  9. B2HANDLE: PID namespace hosting Reliable hosting and resolution of PID namespaces - Well-proven infrastructure in place - Technical foundation is the Handle System - Namespaces provided through ePIC Operational processes for namespace integration and maintenance - Ensure balance across hosting nodes - Accommodate fail-over and retraction scenarios 10/7/2024 9

  10. B2HANDLE: Profile management Profiles determine the kernel information to keep with every identifier Essential information about the object - checksum, time stamp, parent, ... - ... - ... - ... B2HANDLE implements Kernel Information (RDA draft recommendation): - Support machine-actionable services - Non-authoritative source for metadata - Stored directly at the resolver - W3C PROV compatible attributes 10/7/2024 10

  11. B2HANDLE: Searching via the Central PID Catalog Fast reverse-lookups and searching/filtering across B2HANDLE metadata space Provides a metadata graph view Two identical instances hosted by SurfSARA and GRNET - load-balancing and fail-over capacity Procedures in place to integrate new B2HANDLE nodes without service downtime 10/7/2024 11

  12. B2HANDLE: Technical support for PID and profile management pyhandle Python library - Create, Read, Update, Delete of PIDs and Profile information - Modes: Handle System REST interaction, direct database operations, asynchronous batch mode - Interaction with Central PID Catalog Profile migration tools and procedures 10/7/2024 12

  13. B2HANDLE integration and use B2HANDLE is integrated with multiple other EOSC services, both generic and thematic - B2SAFE, B2SHARE, OneData, ECAS, ... B2HANDLE is also used by user communities to manage Handles for specific scenarios - Examples cases from CLARIN, ENES, Herbadrop, EPOS, ... 10/7/2024 13

  14. Full workflow example ECAS work environment Data & workflow Sharing ENES Data Sources Community workflow HPC Entry services QC PID pages early sharing, community review B2FIND Information from all these steps is gathered and linked through B2HANDLE versions, errata IDs, collection building data input, versions, workflow link, instance/component, collection building 10/7/2024 14

  15. Access policies and Funding models Public resolution is for free by nature of the Handle System Support libraries are free and open source software Assignment of PIDs, prefixes, hosting and additional services require costs to be covered - Ensure service reliability, scalability & trustworthiness Detailed rates and conditions can be requested from EUDAT or the B2HANDLE hosting partners 10/7/2024 15

  16. How to get in touch with B2HANDLE How to request a PID namespace? How to get namespace hosting? How to establish a PID profile? How to use the Central PID Catalog? Contact B2HANDLE support at - https://eudat.eu/support-request?service=B2HANDLE Consult user documentation at - https://www.eudat.eu/services/userdoc/b2handle 10/7/2024 16

  17. Thank you for your attention! @EOSC_eu eosc-hub.eu

Related