Leveraging Wasm for Portable AI Inference Across GPUs, CPUs, OS and Cloud-Native Environments

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!

Grab it

Explore the advantages of using WebAssembly (Wasm) for AI inference tasks in cloud-native ecosystems through this 25-minute conference talk. Discover how Wasm enables developers to create AI applications on their personal computers that can be uniformly executed across various hardware platforms, including GPUs, CPUs, operating systems, and edge cloud environments. Learn about Wasm's seamless integration with cloud-native frameworks, enhancing the deployment and scalability of AI applications. Gain insights into how Wasm provides a flexible and efficient solution for diverse cloud-native architectures, including Kubernetes, allowing developers to fully harness the potential of large language models (LLMs), particularly open-source ones. Tailored for cloud-native practitioners and AI developers, this talk offers valuable knowledge on maximizing AI application potential by leveraging Wasm's cross-platform capabilities, ensuring consistency, cost-effectiveness, and efficiency in AI inference across various computing environments.

Syllabus

Leveraging Wasm for Portable AI Inference Across GPUs, CPUs, OS & Cloud-Nativ... Miley Fu & Lucas Lu

Taught by

CNCF [Cloud Native Computing Foundation]

Reviews

Start your review of Leveraging Wasm for Portable AI Inference Across GPUs, CPUs, OS and Cloud-Native Environments

Taught by

Leveraging Wasm for Portable AI Inference Across GPUs, CPUs, OS and Cloud-Native Environments

Cloud-Native AI: Wasm in Portable, Secure AI/ML Workloads

Wasm across Any Cloud, Any Kubernetes, or Any Edge with CNCF wasmCloud

Efficient and Portable AI/LLM Inference on the Edge Cloud - Workshop

Write Once Run Anywhere for GPUs - Portable AI Workloads with Rust and WebAssembly

Building Serverless AI Workflows with Wasm and Rust

9 Best Kubernetes Courses for 2024

Never Stop Learning.