Learn about the fascinating intersection of vision and language in Large Language Models (LLMs) through this comprehensive lecture that explores how these advanced AI systems process and understand both visual and textual information simultaneously. Delve into the technical architecture, capabilities, and real-world applications of multimodal LLMs, examining how they bridge the gap between computer vision and natural language processing to enable more sophisticated AI interactions.
Overview
Syllabus
Vision-and-Language LLMs
Taught by
UofU Data Science