Enhancing Database Accelerators with OpenCAPI Technology

Slide Note

Explore how OpenCAPI is revolutionizing database accelerators by enabling high bandwidth connections, improving data movement speeds, reducing latency, and enhancing memory scalability. The adoption of OpenCAPI in conjunction with FPGA technology promises significant performance boosts for computation-intensive applications in high-performance data warehousing and analytics environments.

zeki_917 Follow

Uploaded on Sep 24, 2024 | 1 Views

Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

Download Presentation

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript

Adopting OpenCAPI for High Bandwidth Database Accelerators Authors: Jian Fang1, Yvo T.B. Mulder1, Kangli Huang1, Yang Qiao1, Xianwei Zeng1, Jan Hidders2, Jinho Lee3, H. Peter Hofstee1,3 1 H2RC @ SC 17, Denver, USA Speaker: Jian Fang (j.fang-1@tudelft.nl) November 17th, 2017

Netezza Data Appliance Architecture Netezza 2 Source: The Netezza data appliance architecture: A platform for high performance data warehousing and analytics

S-Blade in Netezza Compressed Data Filtered Data Decompressed Data 3 Source: The Netezza data appliance architecture: A platform for high performance data warehousing and analytics

Netezza Data Appliance Architecture Netezza Bottleneck 4 Source: The Netezza data appliance architecture: A platform for high performance data warehousing and analytics

DB with FPGAs: What is new? Databases move from Disk to Memory Databases move from Disk to Flash FPGAs still help? Faster Data Movement 5 Source: https://www.datanami.com/2015/10/21/ neo4j-touts-10x-performance-boost-of-graphs-on-ibm-power-fpgas/

OpenCAPI 3.0 OpenCAPI 3.0 OpenCAPI 3.0 OpenCAPI 3.0 OpenCAPI Helps 100GB/s in total with 4 Channels OpenCAPI 3.0 OpenCAPI brings FPGAs memory scale bandwidth OpenCAPI 3.0(x8) -> 25GB/s OpenCAPI 4.0(x32) -> 100GB/s High OpenCAPI 4.0 Bandwidth 100GB/s in total with 1 Channel OpenCAPI 4.0 Low Latency Shared memory Address Translation Save extra copies Shared Memory Target on more than computation-intensive applications 6 Source: http://opencapi.org

Acceleration DBs with OpenCAPI Decompress-Filter Hash-Join Merge-Sorter 7

Decompress-Filter Parquet format Partitionable Supports GZIP, LZO, Snappy, ... Snappy (de)compress algorithm Based on LZ77, byte-oriented Low compress ratio, but fast (de)compress speed Computation-bound Highly data dependent Multiple engines to keep up the bandwidth Trade off between stronger but fewer engines and simpler but more engines (64KB history for each engine) Memory access pattern Sequential read for each stream (engine) Do we need compression & decompression? 8

Hash-Join Memory-bound Low locality of the data and multiple passes of data transfers The internal memory (BRAM) is too small to store the hash table Memory access pattern Sequentially read the relations Randomly write/read the hash table Granularity matters during random accesses 40% Waste Require: Access : Wasted: 40B tuple 64B cacheline 24B 9 Fang J, et al. Analyzing In-Memory Hash Joins: Granularity Matters, ADMS 2017.

Merge-Sorter Need strong sorter for the final pass Memory access pattern Sequentially read within each stream, but randomly choose between streams Solutions Even-odd sorter to continuously produce multiple tuples per cycle Multi-stream bufferring to feed this beast Odd Cycle Even Cycle Q(N) Q(N) Q4 Q4 Q3 Q3 3,5,6,1,8,4,2,7 9,10,11,12,13,14,15,16 10,15,11,16,13,14,12,9 1,2,3,4,5,6,7,8 10 OUTPUT INPUT

Summary Databases have/need faster rate moving data With OpenCAPI, FPGAs can help DBs more Challenges of high bandwidth acclerator design Three examples 11

Authors Kangli Huang TU Delft Yang Qiao TU Delft Jian Fang TU Delft Yvo T.B. Mulder TU Delft Jan Hidders Jinho Lee IBM Research Xianwei Zeng TU Delft H. Peter Hofstee TU Delft & IBM Research Vrije Universiteit Brussel 12

Thank You More Detail: Progress with Power Systems and CAPI https://ibm.ent.box.com/v/OpenPOWERWorkshopMicro50/file/239719608792 Leveraging the bandwidth of OpenCAPI with reconfigurable logic https://indico-jsc.fz-juelich.de/event/55/other-view?view=standard Contact Me: j.fang-1@tudelft.nl 13

Enhancing Database Accelerators with OpenCAPI Technology

Download Presentation

Presentation Transcript

Related

More Related Content