Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn how to deploy high-performance and energy-efficient foundation AI models on Kubernetes in this informative conference talk. Discover the five crucial steps: containerizing foundation models, deploying them on Kubernetes, measuring energy consumption during model serving, reducing energy consumption through GPU frequency tuning, and analyzing the tradeoffs between model inference performance and energy costs. Gain valuable insights into running sustainable AI models on container platforms, addressing the growing concern of balancing impressive AI capabilities with energy efficiency in cloud-native environments.
Syllabus
5 Steps to Deploy Cloud Native Sustainable Foundation AI Models - Chen Wang, IBM & Huamin Chen
Taught by
Linux Foundation