Overview
Explore a comprehensive video analysis comparing newly released open source reasoning models from various companies against OpenAI's proprietary alternatives, demonstrating the current capabilities gap between open source and closed source language models. Learn about key developments including DeepSeek R1 Chat's performance benchmarks and interface demonstration, Qwen-QwQ's technical specifications and real-world applications, and the Marco-o1 model's features through detailed examinations and practical demonstrations. Delve into fundamental concepts of reasoning in language models through discussions of influential papers like "Let's Verify Step by Step" and "Chain-of-Thought Prompting Elicits Reasoning in LLMs," while gaining insights into the differences between standard and reasoning-focused LLMs. Access additional resources including GitHub repositories for LLM tutorials, relevant research papers, and opportunities to learn more about building LLM agents through the provided links and community platforms.
Syllabus
Intro OpenAI o1 December Release
New Reasoning Models
Learning to Reason with LLMs Blog
Standard LLM
Reasoning LLM
Let's Verify Step by Step Paper
Chain-of-Thought Prompting Elicits Reasoning in LLMs Paper
DeepSeek-R1-Light-Preview
DeepSeek Benchmarks
DeepSeek Chat Interface Demo
Qwen-QwQ
Qwen-QwQ Benchmarks
Qwen-QwQ Chat Interface Demo
Marco-o1
Marco-o1 Paper
Taught by
Sam Witteveen