Combined Preference and Supervised Fine-Tuning with ORPO

Trelis Research via YouTube Direct link

Results: Comparing SFT and ORPO with gsm8k, arithmetic and mmlu

9

of 11

9 of 11

Results: Comparing SFT and ORPO with gsm8k, arithmetic and mmlu

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Combined Preference and Supervised Fine-Tuning with ORPO