Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

State of the Industry - Short-Circuiting Reward Functions

David Shapiro ~ AI via YouTube

Overview

Explore the complex landscape of AI safety and alignment in this 26-minute video lecture. Delve into the risks associated with learned optimization in advanced machine learning systems, examining the Inner Alignment Problem and the potential dangers of self-optimizing machines. Analyze the limitations of AI as presented in "The AI Alignment Problem" and consider the implications of world uncertainty on AI development. Investigate the importance of learning in AI systems, drawing parallels with the dangers of hyperpalatable foods in human behavior. Discuss the challenges of disentangling proximal and distal goals, and contemplate the potential objectives of a superintelligent AI. Evaluate the risks and implications of superintelligence, including its potential to intervene in goal information. Critically assess the flaws in current reasoning surrounding superintelligence containment, providing a comprehensive overview of key concerns in the field of AI safety.

Syllabus

- The risks of learned optimization in advanced machine learning systems
- The Inner Alignment Problem
- The dangers of self-optimizing machines
- The limitations of AI according to "The AI Alignment Problem"
- The Uncertainty of the World
- The importance of learning
- The dangers of hyperpalatable foods
- The cost of disentangling proximal from distal goals
- The goals of a superintelligence
- The possibility of an agent with a rich enough action space
- The dangers of a superintelligence
- The risks of a superintelligence intervening in goal information
- Flaws in the reasoning of superintelligence containment

Taught by

David Shapiro ~ AI

Reviews

Start your review of State of the Industry - Short-Circuiting Reward Functions

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.