Test Time Compute: Sampling and Chain of Thought Techniques

Test Time Compute: Sampling and Chain of Thought Techniques

Trelis Research via YouTube Direct link

Controlling sampling parameters min p, top p, top k, beam search, temperature

20 of 27

20 of 27

Controlling sampling parameters min p, top p, top k, beam search, temperature

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Test Time Compute: Sampling and Chain of Thought Techniques

Automatically move to the next video in the Classroom when playback concludes

  1. 1 OpenAI o1 type techniques for scaling test time compute
  2. 2 Video Overview temperature, chain of thought
  3. 3 Training compute versus test time compute
  4. 4 Why spend more compute on test time / inference?
  5. 5 Using verifiers to select the best answers
  6. 6 Exploring and critiquing/verifying answers during inference
  7. 7 Understanding Temperature for sampling
  8. 8 Should you set temperature to zero?
  9. 9 Beam search
  10. 10 Problems with setting a non-zero temperature
  11. 11 Using top p, top k, min p, and best of
  12. 12 Recap on choosing temperature for sampling
  13. 13 How to implement chain of thought
  14. 14 Setup for notebook run-through on gsm8k and hotpot qa
  15. 15 Using sampling and chain of thought on hotpotqa and gsm8k
  16. 16 Running vllm in a Jupyter notebook allows for batching
  17. 17 Scoring / Grading with OpenAI gpt4o-mini using regex enforcement
  18. 18 Multi-threading the scoring / grading for speed
  19. 19 Running the dataset multiple times to get the mean and mean absolute deviation of correct answers
  20. 20 Controlling sampling parameters min p, top p, top k, beam search, temperature
  21. 21 Running temperature / sampling ablations WITHOUT chain of thought
  22. 22 Chain of Thought Setup
  23. 23 Running ablations WITH chain of thought
  24. 24 GSM8K Results Charts
  25. 25 Hotpot QA Results Charts
  26. 26 Recommendations on sampling, temperature and chain of thought
  27. 27 Video resources

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.