Squeezing the Hardware to Make Performance Juice

Squeezing the Hardware to Make Performance Juice

NDC Conferences via YouTube Direct link

Vectorization: Vector Normalization

13 of 22

13 of 22

Vectorization: Vector Normalization

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Squeezing the Hardware to Make Performance Juice

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 A Modern Processor
  3. 3 Processor Organization: Intel Nehalem+
  4. 4 Skylake Backend (sustained 4 uops/cycle)
  5. 5 Intel Top-Down Approach
  6. 6 Processor Behavior: Branch Prediction
  7. 7 Processor Behavior: ILP
  8. 8 Processor Behavior: Vector Units
  9. 9 Processor Behavior: Parallelization
  10. 10 Categories of Vector Instructions
  11. 11 Instruction Performance
  12. 12 Vectorization: Matrix Multiplication
  13. 13 Vectorization: Vector Normalization
  14. 14 Vectorization: AoS vs. SOA
  15. 15 Vectorization: N-Body Simulation
  16. 16 Vectorized strstr Illustrated
  17. 17 Vectorization: strstr
  18. 18 Vectorization: Sorted Set Intersection
  19. 19 Cache Structure, typical i5 (no L4 EDRAM)
  20. 20 Cache: Data Access Reordering
  21. 21 Cache: Tiling
  22. 22 Memory Bottleneck

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.