Courses from 1000+ universities
Class Central experiments with cataloging online courses from California Community Colleges, offering diverse, affordable, and credit-worthy learning opportunities.
Organize and share your learning with Class Central Lists.
View our Lists Showcase
Explore KubeFlux, a Kubernetes plugin integrating HPC scheduling capabilities. Learn about its performance benefits and potential for enhancing cloud-based high-performance computing workloads.
Explore advanced Kubernetes networking for AI and HPC workloads, focusing on multi-interface support, Multus CNI, and the new Multi-NIC CNI project for improved performance and scalability in large-scale GPU clusters.
Explore new Kubernetes Job API features for distributed Batch/AI/HPC workloads, including Indexed Jobs and Pod Failure Policy, with real-world examples from DeepMind and Lawrence Livermore National Laboratory.
Explore high-performance middleware design for exascale systems, focusing on MVAPICH2 project's features and performance in HPC, AI, and data science applications across various platforms.
Explore HPC-based big data platform management with Chameleon, an extension of Apache Ambari. Learn about Lustre filesystem integration, advanced YARN monitoring, and dynamic dashboards for streamlined operations.
Explore HPC/HTC landscape in cloud-native environments, covering solutions, deployments, workload migration, resource management, and tools for high-performance computing.
Learn best practices for SRE teams supporting GPU-enabled Kubernetes clusters for HPC and AI workloads, including key metrics, monitoring tools, and operational strategies.
Explore InterLink, an open-source Virtual-Kubelet extension enabling seamless access to external hardware-accelerated machines through Kubernetes APIs, revolutionizing scientific computing workflows.
Explore TACC, a unified cloud-native infrastructure for AI and HPC, bridging K8S and Slurm advantages. Learn about seamless UI, multi-tenant resource management, and robust distributed infrastructure.
Explore TACC, an innovative AI infrastructure solution bridging K8S and Slurm advantages. Learn about seamless UI, multi-tenant resource management, and robust distributed infrastructure for large-scale GPU clusters.
Explore coordinated checkpointing for distributed HPC applications across multiple Pods and nodes, including new CRIU mechanisms, Kubelet Checkpoint API, and potential Kubernetes scheduler extensions.
Explore mixed precision algorithms in HPC, focusing on convergence properties and performance considerations for modern hardware implementations of floating-point arithmetic.
Explore Julia's potential in HPC, combining scientific ecosystem with high performance. Gain insights on its strengths, limitations, and notable international projects from an experienced user.
Explore recent LLVM efforts for HPC: portable CUDA, debugging at scale, GPU execution of legacy codes, automatic differentiation, ML in compilers, and static information impact.
Explore the capabilities and differences of various HPC GPUs, comparing NVIDIA models, examining competitors, and understanding new developments like APUs.
Get personalized course recommendations, track subjects and courses with reminders, and more.