
Building National Data Infrastructure in Czechia
"Learn about the status and plans for EOSC in Czechia, focusing on data management for scientists with the goal of ensuring storage and accessibility, alongside the development of a National Data Infrastructure comprising key components like the National Metadata Directory and Catalogue of Repositories. Close collaboration with national stakeholders and significant funding commitment highlight the importance of data accessibility and FAIR principles in the country's research landscape."
Download Presentation

Please find below an Image/Link to download the presentation.
The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.
You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.
The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.
E N D
Presentation Transcript
EOSC in Czechia Status and Plans Ludek Matyska, CESNET and Masaryk University
National Background A single large research e-infrastructure e-INFRA CZ CESNET in Prague NREN, coordinator; also EOSC A mandated organization CERIT-SC at Masaryk University in Brno flexible e-science center Supercomputing center IT4Innovations at the Technical University Ostrava e-INFRA CZ a natural coordinator of EOSC activities in Czechia Driving and coordinating the architecture and implementation But not a sole player at the national level Close collaboration with Ministry of Education, Sports and Youths since 2020 Governmental decision to allocate some 120 M till 2028 for EOSC implementation NTE Ireland, 3/11/2023 2
What we try to build as EOSC in Czechia Deliberately narrow focus (compared to the general EU position) Primary target: Data produced by scientists Goal: Make sure that at least 80% of all data producing scientists knows where to store them permanently NTE Ireland, 3/11/2023 3
What we try to build in Czechia Deliberately narrow focus (compared to the EU position) Primary target: Data produced by scientists Goal: Make sure that at least 80% of all data producing scientists knows where to store them permanently Secondary target: Scientists looking for data Goal: All scientists know where to look for the data they are interested in NTE Ireland, 3/11/2023 4
What we try to build in Czechia Deliberately narrow focus (compared to the EU position) Primary target: Data produced by scientists Goal: Make sure that at least 80% of all data producing scientists knows where to store them permanently Secondary target: Scientists looking for data Goal: All scientists know where to look for the data they are interested in Corollary: All the data we speak about is FAIR NTE Ireland, 3/11/2023 5
What we plan to build National Data Infrastructure, a system composed of National Metadata Directory (NMA) And National Catalogue of Repositories Repositories Services and tools Plus overseeing and monitoring background and training NTE Ireland, 3/11/2023 6
What we plan to build National Data Infrastructure, a system composed of National Metadata Directory (NMA) And National Catalogue of Repositories Repositories Data storage Interface Optimized I/O streams Metadata models Services and tools NTE Ireland, 3/11/2023 7
What we plan to build National Data Infrastructure, a system composed of National Metadata Directory (NMA) And National Catalogue of Repositories Repositories Services and tools Core like AAI, Data transfer, NMA filling General like Data stewardship wizard, actionable Data management plans, Repositories specific services NTE Ireland, 3/11/2023 8
Preparatory Phase Architecture of EOSC implementation in Czechia Document accepted in 2021 by Ministry of Education, Youth and Sports EOSC CZ Working Groups since Autumn 2021 EOSC CZ Coordination Board at the Ministry since December 2022 Structural Funds support Open Science/EOSC CZ series of calls Collaborative approach towards the calls Large, all community encompassing projects, no unnecessary competition NTE Ireland, 3/11/2023 10
EOSC CZ Working Groups 4 Cross-cutting Architecture Metadata Core Services Education and Training 8 Thematic Bio/Health/Food Social Sciences Humanities and the Arts Physics Environmental Sciences Material Sciences and technology Data management for AI and ML Sensitive data NTE Ireland, 3/11/2023 11
EOSC CZ Initiative Calls Timeline NTE Ireland, 3/11/2023 12
Systemic Project EOSC CZ Overseeing, Monitoring and Support EOSC CZ Secretariat EOSC CZ Virtual training center Key components of the NDI National Metadata Directory (NMA) Core AAI Fast data transfer Project approved and started on 1stJanuary 2023 18 M budget, till end of 2028 3 partners (e-INFRA CZ) NTE Ireland, 3/11/2023 13
National Repository Platform Project (NRP) The place where data is stored, accessed and provided Several layers from hardware through storage, repository software to repositories and their interfaces Hardware Around 5 geographically distributed nodes (major cities) Object storage Ceph (S3) Replication, High availability Repository platforms NTE Ireland, 3/11/2023 14
Repository platforms Three initially supported, but otherwise no restrictions ALR, DSpace, Invenio Open environment The actual repositories are build there However, you may also have a repository directly over the Ceph layer The primary partners for the NRP project are repository builders and curators Project under finalization 11 partners, 50+ M budget, till end 2028, NTE Ireland, 3/11/2023 15
Thematic Clusters Project Currently under discussion The call is not open yet Again one project expected 40 M budget, higher (around 25) partners, till 2028 To be coordinated by the largest Czech university Charles University Driven by the EOSC CZ thematic working groups To support/build the actual repositories layer Probably smaller number of larger, well curated repositories International impact NTE Ireland, 3/11/2023 16
Summary Who builds what? EOSC-CZ and CARDS projects (2023-2028) NMA including its Metadata model(s) Part of the core services (AAI, data transfer) PIDs NRP project (2024-2028) The whole NRP and its core services Three supported platforms (and pilot repositories) Documentation and training materials Clusters project (2025-2028) The repositories and their specific services Also metadata, interoperability, standards, NTE Ireland, 3/11/2023 17
Summary We focus on the care of FAIR research data Reasonably easy to explain what we aim to the scientific communities Visible impact on the researchers and their work Key elements fully compliant with the EOSC at EU level plans (SRIA) EOSC Node concept currently under discussion Not (yet) included in the strategic and projects documents We want to re-use and expand what is already available, not to build new parallel structures Key role of national large research e-infrastructure e-INFRA CZ But also very close collaboration with other large RIs NTE Ireland, 3/11/2023 19
Thank Thank you you for your attention for your attention Questions Questions? ?