Entry-wise multiplication of a sparse matrix on GPU

Question

1 vote

Using a GPU, I need to multiply sparse matrices using the entrywise ' times' (i.e. .*) operation. As far as I can tell, this is not currently supported on a gpuArray.

For example,

>> rand(5).*sparse(rand(5))

and

>> rand(5,'gpuArray').*rand(5,'gpuArray')

both work, but

>> rand(5,'gpuArray').*sparse(rand(5,'gpuArray'))
Error using  .* 
Sparse gpuArrays are not supported for this function.

Is there any way I can get around this without converting matrices back to full (which would negate most/all of the advantage of using the GPU).

Thanks

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Sign in to answer this question.

Follow Question

Answer 1

Joss Knight on 8 Aug 2017

Open in MATLAB Online

1 vote

Is the sparsity the same for both matrices?

[I, J, VA] = find(Asparse);
[~, ~, VB] = find(Bsparse);
C = sparse(I, J, VA.*VB);

If you just want to multiply it by itself you can use SPFUN:

Asqr = spfun(@(x)x.^2, Asparse);

If the sparsity is different then you have to merge the indices unfortunately. It's a horrible sequence of operations involving sorting and indexing, and not very efficient which is why it hasn't been implemented. I'm curious as to what your application is - maybe there's another solution that doesn't involve element-wise operations.

9 Comments
Show 7 older comments Hide 7 older comments

Ben Ward on 9 Aug 2017

Edited: Ben Ward on 10 Aug 2017

Open in MATLAB Online

Thanks,

Unfortunately the matrices may be of different sparsity.

Any work-around I can think of seems to fall apart because of the inability to index sparse arrays on the gpu.

i.e.

    >> Asparse(1,1)
    Error using gpuArray/bsxfun
    Sparse gpuArrays are not supported for this function.

If you will excuse my very naive question, I am curious as to why is this not possible for sparse arrays on a GPU?

---------------------------------------------------------

I am working on an ecosystem model. The main state variable matrix P is [ns x np], where np is the number of populations, and ns is the number of points in a spatial grid.

The main overhead is multiplication of P by a trophic interaction matrix G [np x np] that describes who is eating whom. This is the main cost of the model, but I can save a lot of time because P is sparse.

However, there are several other functions that require entry-wise multiplication.

For example, I need to multiply the growth rate mu by a temperature dependence function gamma by the resource affinity alpha by the resource concentration N by the population size P. (N and P are time-dependent state variables)

(mu*gamma).*(alpha*N).*P

with dimensions

mu        np x 1
alpha     np x 1
gamma     1  x ns
N         1  x ns
P         np x ns

I cannot see away of avoiding the entry-wise product here. It is also worth noting that this is the simplest linear form of the problem I could not think of a solution to (avoiding times). The actual equation is non-linear, but if you have any suggestions for this case, it might set me in the right direction.

Joss Knight on 9 Aug 2017

Edited: Joss Knight on 9 Aug 2017

Thanks for your answer Jan - I should have noticed that intersect already did all the things I needed!

However, Jan's answer says it all I think - it gives the right answer, but on my Kepler card it's 15x slower. It's not efficient on the GPU and so the workaround is just to do it on the CPU. Any native implementation we wrote would have to do much of what intersect and the three-input form of sparse are having to do, and it's none too efficient on a GPU. Nevertheless, I'm sure something better could be done.

I will take it as an enhancement request (TBH we already have one). When we added support for sparse gpuArrays we just put in the most useful functionality first. The most common use case for sparse arrays is solving linear systems, which is all matrix-matrix and matrix-vector multiplication. But it's great to have a real use case for this. times is actually one of the easier element-wise functions because the output sparsity is the intersection rather than the union.

You refer to "inability to extract individual elements from a sparse array on the gpu" - but of course this is possible using find. It's indexing that isn't supported.

It was hard to tell from your example equation but it looks like a lot of those operations were multiplication by a scalar - that is supported.

Ben Ward on 10 Aug 2017

Edited: Ben Ward on 10 Aug 2017

Hi Joss and Jan, Thanks both for your replies.

Joss, I've edited my initial reply to make it clearer for future readers. For reference, 'a real use case' can be found here...

https://en.wikipedia.org/wiki/Matrix_population_models

The core of these models is the mtimes operation, but entry-wise operations will frequently be essential as well. Adding this functionality would therefore be very useful in supporting this kind of research.

thanks again.

Ben

Joss Knight on 12 Aug 2017

Thanks Ben. Just to reiterate, there's no guarantee a GPU implementation will be faster than the CPU. Don't convert your sparse matrices to dense to work around this issue, gather them to the CPU.

Sign in to comment.

Entry-wise multiplication of a sparse matrix on GPU

0 Comments
Show -2 older comments Hide -2 older comments

Answers (1)

9 Comments
Show 7 older comments Hide 7 older comments

Categories

Products

Tags

Community Treasure Hunt

Entry-wise multiplication of a sparse matrix on GPU

0 Comments Show -2 older comments Hide -2 older comments

Answers (1)

9 Comments Show 7 older comments Hide 7 older comments

Categories

Products

Tags

See Also

Community Treasure Hunt

0 Comments
Show -2 older comments Hide -2 older comments

9 Comments
Show 7 older comments Hide 7 older comments