Speed up this bottleneck line of code

Question

0 votes

Hello.

I have a big chunk of code, here is a part of it (arrays a, b and c, indeces ind1 and ind2 are actually different, its just an artificially constructed example):

n = 10000;
m = 5000;
k = 20;
a = rand(n,m);
b = rand(n,m);
c = rand(m+k,n)*1.1-0.1;
ind1 = repmat([ones(1,m/k),0]==1,1,k)';
ind2 = repmat([0,ones(1,m/k)]==1,1,k)';
tic;
d = m * sum(a .* b .* max(0, c(ind1,:) - c(ind2,:))',2);
toc

I am not satisfied with it's speed. After profiling my code i found that string

d = m * sum(a .* b .* max(0, c(ind1,:) - c(ind2,:))',2);

is a bottleneck of my code. Indeces ind1 and ind2 are almost entirely true.

Now im not sure now to improve from here, i've tried to google how to speed up .* operation or speed up indexing of matrix c but with no luck. Any advice would be great.

Thanks!

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Sign in to answer this question.

Follow Question

Answer 1

Matt J on 22 Feb 2025

Edited: Matt J on 22 Feb 2025

Open in MATLAB Online

0 votes

If you can construct c in transposed form, there are some savings to be had,

n = 10000;
m = 5000;
k = 20;
a = rand(n,m);
b = rand(n,m);
c = rand(m+k,n)*1.1-0.1;
tic;
s=m/k+1;
[ind1,ind2]=deal((1:m+k)');
ind1(s:s:end)=[];
ind2(1:s:end)=[];
toc
Elapsed time is 0.004463 seconds.
tic;
d1 = m * sum(a .* b .* max(0, c(ind1,:)-c(ind2,:))',2);
toc
Elapsed time is 0.555621 seconds.
ct=c';
I=[(1:m)',(1:m)'];
S=repmat(m*[1,-1],m,1);
tic;
J=[ind1,ind2]; 
A=sparse(J,I,S,m+k,m);
d2 = sum( a.*b.*max(0,ct*A) ,2);
toc
Elapsed time is 0.347437 seconds.
percentDifference = norm(d1-d2)/norm(d1)*100
percentDifference = 4.0847e-14

2 Comments
Show None Hide None

Alex on 23 Feb 2025

Thanks!

It helps to cut off about 1/3 of time from this line!

Any suggestions about .* operator? Is it optimizable?

Matt J on 23 Feb 2025

Edited: Matt J on 23 Feb 2025

No, all the basic matrix operators are very well optimized by MathWorks.

Sign in to comment.

Answer 2

Walter Roberson on 22 Feb 2025

Open in MATLAB Online

0 votes

n = 10000;
m = 5000;
k = 20;
tic
a = rand(n,m);
b = rand(n,m);
c = rand(m+k,n)*1.1-0.1;
toc
Elapsed time is 1.545849 seconds.
ind1 = repmat([ones(1,m/k),0]==1,1,k)';
ind2 = repmat([0,ones(1,m/k)]==1,1,k)';
tic;
d = m * sum(a .* b .* max(0, c(ind1,:) - c(ind2,:))',2);
toc
Elapsed time is 0.560316 seconds.

Your bottleneck is in your generation of a, b, and c -- not in your calculation of d.

3 Comments
Show 1 older comment Hide 1 older comment

Walter Roberson on 22 Feb 2025

ind1 is equivalent to selecting 1:m/k . ind2 is equivalent to selecting 2:m/k+1. It might potentially be faster to use the numeric indices instead of logical indices.

Alex on 23 Feb 2025

What i mean by

"arrays a, b and c, indeces ind1 and ind2 are actually different, its just an artificially constructed example"

is that i dont post this part of code here, this code is actually different and optimized. These a, b, c, ind1 and ind2 are generated just to demonstrate calculations in row that is a bottleneck.

Sign in to comment.

Speed up this bottleneck line of code

0 Comments
Show -2 older comments Hide -2 older comments

Accepted Answer

2 Comments
Show None Hide None

More Answers (1)

3 Comments
Show 1 older comment Hide 1 older comment

Categories

Products

Release

Tags

Community Treasure Hunt

Speed up this bottleneck line of code

0 Comments Show -2 older comments Hide -2 older comments

Accepted Answer

2 Comments Show None Hide None

More Answers (1)

3 Comments Show 1 older comment Hide 1 older comment

Categories

Products

Release

Tags

See Also

Community Treasure Hunt

0 Comments
Show -2 older comments Hide -2 older comments

2 Comments
Show None Hide None

3 Comments
Show 1 older comment Hide 1 older comment