Understanding Generalization from Pre-training Loss to Downstream Tasks

Overview

Explore the mysteries behind pre-trained models and their generalization capabilities in this lecture by Tengyu Ma from Stanford University. Delve into the role of pre-training losses in extracting meaningful structural information from unlabeled data, with a focus on the infinite data regime. Examine how contrastive loss creates embeddings that capture manifold distance between raw data and graph distance of positive-pair graphs. Investigate the relationship between embedding space directions and cluster relationships in positive-pair graphs. Discover recent advancements that incorporate architectural inductive bias and demonstrate the implicit bias of optimizers in pre-training. Gain insights into the theoretical frameworks and empirical evidence supporting these concepts, shedding light on the behavior of practical pre-trained models in AI and machine learning.

Syllabus

Understanding Generalization from Pre-training Loss to Downstream Tasks

Taught by

Simons Institute

Reviews

Start your review of Understanding Generalization from Pre-training Loss to Downstream Tasks

Taught by

BLIP- Bootstrapping Language-Image Pre-Training for Unified Vision-Language Understanding and Generation

Generalization in Diffusion Models from Geometry-Adaptive Harmonic Representation

Secure Self-supervised Learning: Challenges and Solutions

Towards Better Understanding of Contrastive Learning

Implications of Implicit Bias in ReLU Networks - Understanding Generalization and Robustness

Understanding Contrastive Learning and Self-training in Neural Networks

10 Best Machine Learning Courses for 2024: Scikit-learn, TensorFlow, and more

Never Stop Learning.