These results show the performance of the GPU or host PC when calculating the
matrix left division
of an NxN
matrix with an Nx1
vector. The number of operations
is assumed to be 2/3*N^3 + 3/2*N^2
.
This calculation is usually compute-bound, i.e. the performance depends mainly on how fast the GPU or host PC can perform floating-point operations.