Overview
Explore key concepts in modern AI development through a comprehensive lecture covering RLHF (Reinforcement Learning with Human Feedback), the training methodology behind ChatGPT, and the critical aspects of alignment in Large Language Models. Delve into the technical foundations of efficient transformers, including the Performer architecture, while understanding how these components work together in creating advanced language models. Learn about the development and implementation of Instruct GPT, examining how human feedback mechanisms are integrated into AI training processes to improve model performance and reliability. Master the fundamental principles of AI alignment and discover how these concepts are applied in contemporary language model development.
Syllabus
Ali Ghodsi, Deep Learning, RLHF, GhatGPT, Alignment in LLMs, Fall 2023, Lecture 14
Taught by
Data Science Courses