Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a detailed video analysis of Alibaba Group's latest open-source language models, focusing on the Qwen 2.5 72B Instruct model and its specialized variants for mathematics and vision processing. Learn about the mathematical model Qwen2.5-Math-72B-Instruct, which outperforms previous versions with significant improvements in both English and Chinese capabilities, and discover the new vision model featuring Multimodal Rotary Position Embedding (M-RoPE) technology for handling various image resolutions. Understand the differences between instruction models designed for chatting and base models intended for few-shot inference and fine-tuning, while examining performance comparisons with other large language models like Llama 3.1 405B. Gain insights into the technical aspects of these models, including quantization options available on HuggingFace and the implementation of RoPE for extended context length processing up to 100K tokens.
Syllabus
New Qwen2.5-72B MATH & Vision (BEST Open-Source?)
Taught by
Discover AI