Symbolic Knowledge Distillation- From General Language Models to Commonsense Models Explained

Overview

Explore an innovative approach to creating commonsense knowledge graphs using large language models in this 45-minute video lecture. Learn how to leverage GPT-3's capabilities through clever prompting techniques and targeted critic models to generate high-quality symbolic knowledge. Discover the process of distilling commonsense knowledge from general language models, and understand how this method outperforms traditional human-generated corpora in quantity, quality, and diversity. Gain insights into the Symbolic Knowledge Distillation framework, its application to the ATOMIC dataset, and its potential to revolutionize the training of smaller, specialized commonsense models. Follow along as the lecture covers topics such as generating events and inferences, evaluating datasets, and using critic models to filter data, ultimately leading to key findings that challenge conventional approaches in NLP and commonsense reasoning.

Syllabus

- Intro & Overview
- Sponsor: Weights & Biases
- Commonsense Knowledge Graphs
- ATOMIC dataset
- Generating the corpus from a model
- Prompting GPT-3
- Generating Events
- Generating Inferences
- Evaluating the created dataset
- Introducing the critic
- Using the critic to filter the data
- Training a student on the generated data
- Key Findings
- Comments & Conclusion

Taught by

Yannic Kilcher

Reviews

Start your review of Symbolic Knowledge Distillation- From General Language Models to Commonsense Models Explained

Taught by

OpenAI Fine-tuning vs Distillation - Techniques and Implementation

GPT-3 - Language Models Are Few-Shot Learners

Training Large Language Models - GPT-NeoX-20B, BigScience BLOOM, OPT-175B Explained

Train GPT-3 on Any Corpus of Data with ChatGPT and Knowledge Graphs - SCOTUS Opinions Part 1

Towards Neuro-Symbolic AI with Knowledge Graphs and Large Language Models

Understanding ChatGPT: From Machine Learning to Language Models

10 Best Machine Learning Courses for 2024: Scikit-learn, TensorFlow, and more

Never Stop Learning.