Overview
Syllabus
Introduction to OCI Cluster Networks
What is RDMA?
History of RDMA at OCI
Why is RDMA Challenging?
Importance of RoCE
Pitfalls of RoCE
Overcoming Pitfalls of RoCE
Limited use of PFC
Tailored QoS for multiple workloads
How to use ECN in RDMA networks
Tuning ECN to HPC workloads
Tuning ECN to GPU and DB workloads
Are OCI Cluster Networks in the same network?
Why do we need a separate RDMA network?
Performance optimizations for workloads
Flow aware traffic distribution
Traffic locality optimization
Traffic topology information vending service
Why OCI RDMA network is better, differentiated
Balancing scale and latency
Taught by
Oracle