LLMOps: Comparison of OpenVino, ONNX, TensorRT, and PyTorch Inference
The Machine Learning Engineer via YouTube
Overview
Explore the world of model conversion and inference optimization in this 40-minute video tutorial. Learn how to convert a machine learning model to ONNX, OpenVino, and Tensor-RT formats, and compare their inference performance on both CPU and GPU against native PyTorch inference. Gain practical insights into LLMOps techniques for data science and machine learning applications. Access the accompanying notebook on GitHub to follow along and experiment with the demonstrated concepts.
Syllabus
LLMOps: Comparison Openvino, ONNX, TensorRT and Pytorch Inference #datascience #machinelearning
Taught by
The Machine Learning Engineer