Introduction to CernVM File System (CVMFS)

undefined
Scalable, reliable and low-maintenance
software distribution service
CernVM File System - CVMFS
undefined
2
Motivation and driving consideration about the service
Service architecture and interfaces: overview
-
How the user can access the service
E.g.: REST, GUI, CLIs, etc.
-
Service options and attributes
Acceptable Usage Policy (AUP)
Access policy and business model
Use cases
Documentation/tutorial/information
9/30/2024
Content
undefined
3
CVMFS was developed to assist High Energy Physics
(HEP) collaborations to deploy software on the
worldwide distributed computing infrastructure used to
run data processing applications
Technology has been also adopted by other computing
communities outside HEP – Medical, Space and Earth,
Biological sciences
9/30/2024
Motivation
undefined
4
Built 
using standard technologies (fuse, sqlite, http,
squid and caches)
Files and directories are hosted on standard web servers
and get distributed through a hierarchy of caches to
individual nodes
Software needs one single installation, then it is
available at any site with CernVM-FS client installed and
configured
Mounted in the universal 
/cvmfs
 namespace at client
level
9/30/2024
What is CVMFS?
undefined
5
Because of standard technologies, it can be used
everywhere
-
Not only Grid, but also Cloud, local clusters
-
Add CernVM-FS client to a VM image => 
/cvmfs
 space
automatically available
Due to increasing interest, technology has been
optimized also for access to conditions data and other
auxiliary data
-
File chunking for large files
-
Garbage collection on revision level
-
File system history
9/30/2024
What is CVMFS?
undefined
6
Stratum-0
-
The single source of the CernVM File System (the
repository)
-
Protected read/write instance maintained by a dedicated
release manager machine or installation box
-
The CVMFS tool kit is used to publish the current state of
the repository on the installation box
-
Data stored in CVMFS is converted into a CVMFS repository
during the process of publishing
-
Basically an Apache HTTP server
9/30/2024
CVMFS Elements
undefined
7
Stratum-1
-
Or Replica Server
-
Standard web server that uses the CVMFS tool kit to create
and maintain a mirror of the CVMFS repository served by a
Stratum-0 server
Site-local proxy servers
-
Between the clusters of nodes with CernVM-FS clients and the
Stratum-1 servers
-
Used to reduce load on the Stratum-1 servers by local caching
Stratum-0 server, Stratum-1 servers and the site-local proxy
servers can be seen as content distribution network
9/30/2024
CVMFS Elements
undefined
8
Maintained and operated by EGI Resource Centres
Stratum-0 service
-
Maintains and publishes the current state of the
repositories (part of the egi.eu CVMFS namespace)
Stratum-1 service
-
Worldwide network of servers replicating the egi.eu
repositories
Uploader service
-
Provides upload area for egi.eu repositories
-
GSI-OpenSSH interface
9/30/2024
EGI CVMFS Service and Infrastructure
undefined
9
Topology –
July 2019
9/30/2024
EGI CVMFS Service and Infrastructure
undefined
10
Service attributes:
-
Size of the repository
-
Expected grow of the repository (if possible)
-
DNs (Distinguished Names) of VO users responsible for
maintaining the repository in the future
-
Emails of the VO users responsible for the maintaining the
repository in the future
9/30/2024
Service options and attributes
undefined
11
EGI PROC22
 in place to describe process of creating a
repository within the EGI CVMFS infrastructure
The involved computing community is expected to be
supported by more that one EGI production Resource
Centre
Ideally the maintainers of the repository should have
access to a X.509 certificate
9/30/2024
Additional info
undefined
12
EGI AuP
To use the service users have to apply the Virtual
Organisation AUP
9/30/2024
Acceptable Usage Policy
undefined
13
All LHC experiments are making use of CVMFS for:
-
Production software distribution
-
Experiment conditions database access
-
Continuous integration and testing software
All involved repositories are part of the cern.ch CVMFS
namespace
9/30/2024
Use Case - CVMFS within WLCG
undefined
14
9/30/2024
Use Case - CVMFS within WLCG
undefined
15
CVMFS documentation
More documentation
9/30/2024
Documentations
undefined
 
 
 
Slide Note
Embed
Share

CernVM File System (CVMFS) is a scalable, reliable, and low-maintenance software distribution service used by various computing communities. It was developed to support High Energy Physics (HEP) collaborations and has since been adopted by other fields like Medical, Space, and Earth Sciences. Using standard technologies, CVMFS allows easy access to software on distributed computing infrastructure, making it versatile for different environments. The service architecture provides various options such as REST, GUI, and CLIs, with an acceptable usage policy and clear documentation available. CVMFS consists of elements like Stratum-0, responsible for maintaining the repository and ensuring data availability through a hierarchy of caches. Overall, CVMFS simplifies software deployment and access across different computing setups.

  • Software Distribution
  • High Energy Physics
  • Scalable Technology
  • Reliable Infrastructure
  • Data Processing

Uploaded on Sep 30, 2024 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

E N D

Presentation Transcript


  1. CernVM File System - CVMFS Scalable, reliable and low-maintenance software distribution service eosc-hub.eu @EOSC_eu EOSC-hub receives funding from the European Union s Horizon 2020 research and innovation programme under grant agreement No. 777536.

  2. Content Motivation and driving consideration about the service Service architecture and interfaces: overview - How the user can access the service E.g.: REST, GUI, CLIs, etc. - Service options and attributes Acceptable Usage Policy (AUP) Access policy and business model Use cases Documentation/tutorial/information 9/30/2024 2

  3. Motivation CVMFS was developed to assist High Energy Physics (HEP) collaborations to deploy software on the worldwide distributed computing infrastructure used to run data processing applications Technology has been also adopted by other computing communities outside HEP Medical, Space and Earth, Biological sciences 9/30/2024 3

  4. What is CVMFS? Built using standard technologies (fuse, sqlite, http, squid and caches) Files and directories are hosted on standard web servers and get distributed through a hierarchy of caches to individual nodes Software needs one single installation, then it is available at any site with CernVM-FS client installed and configured Mounted in the universal /cvmfs namespace at client level 9/30/2024 4

  5. What is CVMFS? Because of standard technologies, it can be used everywhere - Not only Grid, but also Cloud, local clusters - Add CernVM-FS client to a VM image => /cvmfs space automatically available Due to increasing interest, technology has been optimized also for access to conditions data and other auxiliary data - File chunking for large files - Garbage collection on revision level - File system history 9/30/2024 5

  6. CVMFS Elements Stratum-0 - The single source of the CernVM File System (the repository) - Protected read/write instance maintained by a dedicated release manager machine or installation box - The CVMFS tool kit is used to publish the current state of the repository on the installation box - Data stored in CVMFS is converted into a CVMFS repository during the process of publishing - Basically an Apache HTTP server 9/30/2024 6

  7. CVMFS Elements Stratum-1 - Or Replica Server - Standard web server that uses the CVMFS tool kit to create and maintain a mirror of the CVMFS repository served by a Stratum-0 server Site-local proxy servers - Between the clusters of nodes with CernVM-FS clients and the Stratum-1 servers - Used to reduce load on the Stratum-1 servers by local caching Stratum-0 server, Stratum-1 servers and the site-local proxy servers can be seen as content distribution network 9/30/2024 7

  8. EGI CVMFS Service and Infrastructure Maintained and operated by EGI Resource Centres Stratum-0 service - Maintains and publishes the current state of the repositories (part of the egi.eu CVMFS namespace) Stratum-1 service - Worldwide network of servers replicating the egi.eu repositories Uploader service - Provides upload area for egi.eu repositories - GSI-OpenSSH interface 9/30/2024 8

  9. EGI CVMFS Service and Infrastructure Topology July 2019 9/30/2024 9

  10. Service options and attributes Service attributes: - Size of the repository - Expected grow of the repository (if possible) - DNs (Distinguished Names) of VO users responsible for maintaining the repository in the future - Emails of the VO users responsible for the maintaining the repository in the future 9/30/2024 10

  11. Additional info EGI PROC22 in place to describe process of creating a repository within the EGI CVMFS infrastructure The involved computing community is expected to be supported by more that one EGI production Resource Centre Ideally the maintainers of the repository should have access to a X.509 certificate 9/30/2024 11

  12. Acceptable Usage Policy EGI AuP To use the service users have to apply the Virtual Organisation AUP 9/30/2024 12

  13. Use Case - CVMFS within WLCG All LHC experiments are making use of CVMFS for: - Production software distribution - Experiment conditions database access - Continuous integration and testing software All involved repositories are part of the cern.ch CVMFS namespace 9/30/2024 13

  14. Use Case - CVMFS within WLCG All LHC experiments are making use of CVMFS for: Production software distribution Experiment conditions database access Continuous integration and testing software All involved repositories are part of the cern.ch CVMFS namespace CernVM-FS content distribution network for the cern.ch domain cvmfs.readthedocs.io/en/latest/cpt-replica.html#fig-stratum1 Stratum1 replica servers are located in Europe, the U.S. and Asia One protected read/write instance (Stratum-0) located at CERN is feeding up the public, distributed mirror servers A distributed hierarchy of proxy servers fetches content from the closest public mirror server 9/30/2024 14

  15. Documentations CVMFS documentation More documentation 9/30/2024 15

  16. @EOSC_eu eosc-hub.eu

More Related Content

giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#