Computer Data Storage with Ceph

 
CEPH
 
Introduction: means of computer data storage
 
Introduction: means of computer data storage
 
Block storage
File storage
Object storage
 
API used for storing computer data
 
Introduction: means of computer data storage
 
Block storage:
0. block
1. block
2. block
3. block
4. block
5. block
 
Block read/write
 
Blocksize (eg.
512b, 4KB)
 
Examples:
Most of physical data storage devices
HDD, SSD, pendrive
SCSI, SATA, usb mass storage protocols are all block based
LBA scheme nowadays instead of a CHS
Logical block devices made out of physical data storage devices
Hardware/software RAID (pl. Linux md devices)
Linux device mapper (/dev/dm-* devices)
LVM
Encrypted disks (dm-crypt)
Also zvol (block device made out of a ZFS dataset)
 
Examples:
Variants of physical data storage device protocols
that work over IP/Ethernet
eg. iSCSI
Block API protocols of distributed data storage
solutions
CEPH RBD (RADOS block device)
 
Introduction: means of computer data storage
 
File storage:
 
Fájl open, close, write, read, create, delete
 
Examples:
Filesystems
Block, file, object storage are often built on each other
Basic filesystems are based on block
But in case of ZFS an object layer is placed between the
file and block layer:
block -> object -> file
ZFS zvol:
Block -> object -> block
Ext4, xfs, ntfs, fat, zfs, btrfs
IP based file access
SMB, NFS, WebDAV
File API of distributed data storage solutions
cephFS, glusterfs
A lot of layers until we reach the block layer
 
Introduction: means of computer data storage
 
Object storage:
 
Unstructured data
In general object name is only a UUID
Flat namespace
Metadata is key
To be able to search in the data
Metadata is indexed, object storage is
optimized for searching based on
metadata
Eg. Searching pictures based on various
properties
Background=blue,
isThereAHuman=true,
creationYear=2021 stb.
 
Example:
AWS S3 API, OpenStack swift
There are API-s, lots of implementations
S3-at is also provided by Amazon as a service
Most cloud storage solutions also use this in the backgroud
Google Drive (format of share links)
Object API of distributed data storage solutions
CEPH librados, RGW (S3/swift)
ZFS uses it internally, but it’s not accessible directly
 
CEPH
 
CEPH
 
CEPH
 
Distributed data storage solution
Free/open source
Runs on “normal” PC/server machines
Open source alternative of vendor
made, turnkey solutions
4 API types
Object: Librados
Object: RGW
Block: RBD
File: CEPH FS
 
CEPH
 
Physical view
 
Daemons:
OSD.0
OSD.1
OSD.2
OSD.3
MON
MGR
OSD.4
OSD.5
MGR
MON
MGR
 
host1
 
host2
 
host3
CLIENT
CLIENT
 
CEPH
 
Logical view
Every daemon talks with each other
Those on the same host on
loopback
Host boundaries have no role besides
optimalisation
 
Daemons:
OSD.0
OSD.1
OSD.2
OSD.3
MON
MGR
OSD.4
OSD.5
MGR
MON
MGR
 
CEPH cluster
 
Daemon types
MON: monitor
Central config
MGR: manager
OSD: object storage daemon
1 disk+cache – 1 OSD
CLIENT: CEPH client
Administration, data storage
CLIENT
CLIENT
 
CEPH
 
Every daemon talks with each other
No central node
The monitor only provides cluster
information
 
Data storage – RADOS object:
OSD.0
OSD.1
OSD.2
OSD.3
MON
MGR
OSD.4
OSD.5
MGR
MON
MGR
 
CEPH cluster
CLIENT
CLIENT
 
CEPH
 
Replication
Erasure coding
 
Redundancy:
OSD.0
OSD.1
OSD.2
OSD.3
MON
MGR
OSD.4
OSD.5
MGR
MON
MGR
 
CEPH cluster
CLIENT
CLIENT
 
Original object
 
EC blocks
 
CEPH
OSD.0
OSD.1
OSD.2
OSD.3
MON
MGR
OSD.4
OSD.5
MGR
MON
MGR
 
CEPH cluster
CLIENT
CLIENT
 
RBD:
Image file striped into
multiple object client-
side
RGW:
S3/swift object striped
into multiple objects by
RGW daemons
 
CEPH
OSD.0
OSD.1
OSD.2
OSD.3
MON
MGR
OSD.4
OSD.5
MGR
MON
MGR
 
CEPH cluster
iSCSI GW
iSCSI GW
 
iSCSI GW:
CEPH client
iSCSI target
iSCSI initator
 
iSCSI
 
iSCSI
Slide Note
Embed
Share

Delve into the world of computer data storage through an exploration of block storage, file storage, and object storage. Learn about the APIs, protocols, and examples associated with Ceph, a distributed data storage solution. Understand the nuances of physical data storage devices, filesystems, and the layers that culminate in the block layer within data storage architectures.

  • Computer Data Storage
  • Ceph
  • Block Storage
  • File Storage
  • Object Storage

Uploaded on Oct 01, 2024 | 1 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript


  1. CEPH Introduction: means of computer data storage

  2. Introduction: means of computer data storage API used for storing computer data Block storage File storage Object storage

  3. Introduction: means of computer data storage Block storage: Block read/write 0. block 1. block 2. block 3. block 4. block 5. block Blocksize (eg. 512b, 4KB) Examples: Most of physical data storage devices HDD, SSD, pendrive SCSI, SATA, usb mass storage protocols are all block based LBA scheme nowadays instead of a CHS Logical block devices made out of physical data storage devices Hardware/software RAID (pl. Linux md devices) Linux device mapper (/dev/dm-* devices) LVM Encrypted disks (dm-crypt) Also zvol (block device made out of a ZFS dataset) Examples: Variants of physical data storage device protocols that work over IP/Ethernet eg. iSCSI Block API protocols of distributed data storage solutions CEPH RBD (RADOS block device)

  4. Introduction: means of computer data storage File storage: F jl open, close, write, read, create, delete Examples: Filesystems Block, file, object storage are often built on each other Basic filesystems are based on block But in case of ZFS an object layer is placed between the file and block layer: block -> object -> file ZFS zvol: Block -> object -> block Ext4, xfs, ntfs, fat, zfs, btrfs IP based file access SMB, NFS, WebDAV File API of distributed data storage solutions cephFS, glusterfs A lot of layers until we reach the block layer

  5. Introduction: means of computer data storage Object storage: Unstructured data In general object name is only a UUID Flat namespace Metadata is key To be able to search in the data Metadata is indexed, object storage is optimized for searching based on metadata Eg. Searching pictures based on various properties Background=blue, isThereAHuman=true, creationYear=2021 stb. Example: AWS S3 API, OpenStack swift There are API-s, lots of implementations S3-at is also provided by Amazon as a service Most cloud storage solutions also use this in the backgroud Google Drive (format of share links) Object API of distributed data storage solutions CEPH librados, RGW (S3/swift) ZFS uses it internally, but it s not accessible directly

  6. CEPH CEPH

  7. CEPH Distributed data storage solution Free/open source Runs on normal PC/server machines Open source alternative of vendor made, turnkey solutions 4 API types Object: Librados Object: RGW Block: RBD File: CEPH FS

  8. CEPH Daemons: Physical view host1 host2 OSD.1 OSD.0 OSD.5 OSD.4 OSD.2 OSD.3 MGR MON MGR host3 CLIENT MGR MON CLIENT

  9. CEPH Daemons: Logical view Every daemon talks with each other Those on the same host on loopback Host boundaries have no role besides optimalisation CEPH cluster OSD.1 OSD.0 OSD.5 OSD.4 OSD.2 OSD.3 MGR Daemon types MON: monitor Central config MGR: manager OSD: object storage daemon 1 disk+cache 1 OSD CLIENT: CEPH client Administration, data storage MON MGR CLIENT CLIENT MGR MON

  10. CEPH Data storage RADOS object: Every daemon talks with each other No central node The monitor only provides cluster information CEPH cluster OSD.1 OSD.0 OSD.5 OSD.4 OSD.2 OSD.3 MGR MON MGR CLIENT CLIENT MGR MON

  11. CEPH Redundancy: Replication Erasure coding CEPH cluster OSD.1 OSD.0 OSD.5 OSD.4 OSD.2 OSD.3 MGR MON MGR CLIENT EC blocks Original object CLIENT MGR MON

  12. CEPH CEPH cluster RBD: Image file striped into multiple object client- side RGW: S3/swift object striped into multiple objects by RGW daemons OSD.1 OSD.0 OSD.5 OSD.4 OSD.2 OSD.3 MGR MON MGR CLIENT CLIENT MGR MON

  13. CEPH CEPH cluster iSCSI GW: CEPH client iSCSI target OSD.1 OSD.0 OSD.5 OSD.4 OSD.2 OSD.3 MGR MON MGR iSCSI GW iSCSI iSCSI GW iSCSI iSCSI initator MGR MON

More Related Content

giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#