Gallerie d’exemples¶

Parallelization of a dot product with processes (joblib)
Parallelization of a dot product with processes (joblib)

Compares matrix multiplication implementations with timeit
Compares matrix multiplication implementations with timeit

Parallelization of a dot product with processes (concurrent.futures)
Parallelization of a dot product with processes (concurrent.futures)

Compares dot implementations (numpy, python, blas)
Compares dot implementations (numpy, python, blas)

Compares filtering implementations (numpy, cython)
Compares filtering implementations (numpy, cython)

Compares dot implementations (numpy, c++, sse, openmp)
Compares dot implementations (numpy, c++, sse, openmp)

Measuring CPU performance with a parallelized vector sum
Measuring CPU performance with a parallelized vector sum

Measuring CUDA performance with a vector addition with streams
Measuring CUDA performance with a vector addition with streams

Compares dot implementations (numpy, cython, c++, sse)
Compares dot implementations (numpy, cython, c++, sse)

Measuring CPU performance with a parallelized vector sum and AVX
Measuring CPU performance with a parallelized vector sum and AVX