Advanced AI Agents, Claude Prompt Caching, Grok-2, and Efficient RAG - LLM News Update
Elvis Saravia via YouTube
Overview
Syllabus
Claude Prompt Caching - https://www.anthropic.com/news/prompt-caching
Grok-2 - https://youtu.be/NzbLqwTXt-U?si=Rt9154SRy2jzWZa9
LMSYS Chatbot Arena - https://x.com/lmsysorg
Genie - https://youtu.be/LBa6gRvarzk?si=6rvS8CJiWMSVlM-x
JSON output not always good! - https://aider.chat/2024/08/14/code-in-json.html
The AI Scientist - https://youtu.be/WPh7oXiJFWc?si=D0_aouM93j34HyKF
Agent Q - https://www.multion.ai/blog/introducing-agent-q-research-breakthrough-for-the-next-generation-of-ai-agents-with-planning-and-self-healing-capabilities
Efficient RAG - https://x.com/omarsar0/status/1822744591810114044
rStar - https://arxiv.org/abs/2408.06195
Distilling & Pruning Llama 3.1 8B - https://developer.nvidia.com/blog/how-to-prune-and-distill-llama-3-1-8b-to-an-nvidia-llama-3-1-minitron-4b-model/
Taught by
Elvis Saravia