LLMOPs: Multimodal Prompting and Inference with Phi-3 Vision 128K Instruct on CPU - ONNX 4-Bit Quantization in C#

Overview

Explore multimodal prompting and inference on CPU using Phi 3 Vision 128K Instruct model quantized to 4 bits in ONNX format with C# in this 23-minute video tutorial. Learn how to implement LLMOPs (Large Language Model Operations) for data science and machine learning applications. Access the accompanying code on GitHub to follow along and practice the demonstrated techniques. Gain insights into optimizing inference for resource-constrained environments and leveraging advanced language models for vision-based tasks.

Syllabus

LLMOPs: Inference en CPU Phi3 Vision 128k Intruct ONNX 4bits in C# #datascience #machinelearning

Taught by

The Machine Learning Engineer

Reviews

Start your review of LLMOPs: Multimodal Prompting and Inference with Phi-3 Vision 128K Instruct on CPU - ONNX 4-Bit Quantization in C#

Taught by

LLMOPs: Inferencia en CPU con Phi3 Vision 128k Instruct - ONNX 4bits en C#

LLMOPs - Inference in CPU with Phi3 4k Instruct ONNX 4-bit Model Using C#

LLMOPs - Inferencia en CPU con Phi3 4k Instruct ONNX 4bits en C#

LLMOps: Quantization Models and Inference with ONNX Generative Runtime

LLMOps: Converting Video Classifier (ViViT) to ONNX and Inference on CPU

LLMOps: Inference of Fine-Tuned ViT Classifier on CPU with C#

10 Best Data Science Courses

10 Best Machine Learning Courses for 2024: Scikit-learn, TensorFlow, and more

14 Best C# and .NET Courses for 2024

Never Stop Learning.