Explore the challenges and solutions in adopting model compilation for production models in IT companies through this conference talk. Learn about ByteIR, a tool developed to improve model compilation productivity, built on OpenXLA and LLVM/MLIR compiler infrastructure. Discover how ByteIR's three components - frontends, compiler, and runtime - address issues such as model coverage, framework integration, performance optimization, new ASIC adoption, and backward library transition. Understand how these components can work together or independently to meet various business needs, ultimately leading to seamless model compilation integration in AI acceleration.
Overview
Syllabus
Toward Seamless Model Compilation Integration - Hongyu Zhu, ByteDance
Taught by
CNCF [Cloud Native Computing Foundation]