Overview
Syllabus
Intro
Introducing the SIMD features of your CPU
Scalar parallel computing based on concurrency
Vector parallel computing based on SIMD machine
Shape and species of a Vector
Creating vectors from arrays to sum them
Loading any array in a vector using masks
Avoiding masking when it is not supported
Parallel cross-lanes and lane-wise operations
Computing the norm of a vector in parallel
Computing the average of vector components in parallel
Filtering and compressing a vector in parallel
Reducing a vector in parallel
Wrapping up parallel computations using vectors
Examples, patterns and performances
Outro
Taught by
Java