Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Adapting Image-based Reinforcement Learning Policies via Predicted Reward Fine-Tuning

Discover AI via YouTube

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn about groundbreaking research from Johns Hopkins University in a 16-minute video exploring Predicted Reward Fine-Tuning (PRFT), a novel solution for domain shift challenges in Image-based Reinforcement Learning. Dive into the innovative approach that enables effective sim-to-real transfer of AI intelligence through imitation learning and behavior cloning between AI agents. Master the core methodology of PRFT, which combines policy and reward prediction model training using Maximum Entropy RL algorithm, addressing the critical challenge of visual environment changes between training and deployment. Explore how PRFT outperforms traditional methods like data augmentation and domain randomization by leveraging imperfect predicted rewards as valuable signals for policy fine-tuning in target domains, demonstrating superior performance in both simulated and real-world scenarios with high-intensity visual distractions.

Syllabus

Domain Shift solved: Predicted Reward Fine-Tuning

Taught by

Discover AI

Reviews

Start your review of Adapting Image-based Reinforcement Learning Policies via Predicted Reward Fine-Tuning

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.