The most efficient way to add a new column or new row to a sparse matrix

Question

Ming on 13 Jul 2017

0
Link

Direct link to this question

https://www.mathworks.com/matlabcentral/answers/348521-the-most-efficient-way-to-add-a-new-column-or-new-row-to-a-sparse-matrix

Answered: James Tursa on 14 Jul 2017

Hi, suppose I have a large scale sparse matrix A, i.e. size(A)=[100k,10k]. I am wondering, what is the most efficient way to append a new row/column at the end of matrix A?

1 Comment
Show -1 older commentsHide -1 older comments

Walter Roberson on 14 Jul 2017

The document http://www.mathworks.com/help/pdf_doc/otherdocs/simax.pdf talks about the implementation.

Sign in to comment.

Sign in to answer this question.

Answer 1

James Tursa on 14 Jul 2017

1
Link

Direct link to this answer

https://www.mathworks.com/matlabcentral/answers/348521-the-most-efficient-way-to-add-a-new-column-or-new-row-to-a-sparse-matrix#answer_274100

Open in MATLAB Online

In general, every time you significantly change the size of a sparse matrix (so that you exceed the current nzmax of the matrix) all of the current data must be copied to a new memory block. If you do this repeatedly in your program, this can seriously drag your performance. So typically the most efficient way to change the size of the sparse matrix is to gather up all of the changes you wish to make and then apply them to the sparse matrix all at once. That way this copying of the current data only happens once.

If you have to add more data to the sparse matrix in a piecemeal fashion, then the best course of action would be to allocate enough extra memory up front to hold that additional data, and then add that data in as appended column data at the end. That way the existing sparse data would not need to be copied to a new memory block every time you added the new data. E.g., a simplistic example:

I pre-allocate space for three elements, but only have one non-zero element to begin with. As new non-zero elements are appended, the memory block stays the same (i.e., the pr data pointer points to the same memory block). So the current data is undisturbed and the only thing that happens is the new data gets appended to the end of the current data. All well and good. But then with the fourth element I exceed the three element nzmax limit. This causes the entire data set to be copied over into a new memory block (i.e. the pr pointer changes).

>> format debug
>> S = sparse(1,1,1,10,10,3)
S =
Structure address = 649a7b8 
m = 1
n = 1
pr = 1eda8d80 
pi = 0
   (1,1)        1
>> S(2,1) = 2
S =
Structure address = 649a7b8 
m = 2
n = 1
pr = 1eda8d80    <-- Same memory block
pi = 0
   (1,1)        1
   (2,1)        2
>> S(3,1) = 3
S =
Structure address = 649a7b8 
m = 3
n = 1
pr = 1eda8d80    <-- Same memory block 
pi = 0
   (1,1)        1
   (2,1)        2
   (3,1)        3
>> S(4,1) = 4
S =
Structure address = 649a7b8 
m = 4
n = 1
pr = 1edf1530   <-- New memory block
pi = 0
   (1,1)        1
   (2,1)        2
   (3,1)        3
   (4,1)        4

At this point you could keep appending data until the new nzmax(S) is exceeded (whatever that happens to be), at which point the entire data must be copied again into a newly allocated memory block.

Bottom line is that this incremental process of appending data can get very expensive to your runtime if you don't have that extra space pre-allocated ahead of time. But if you create your sparse matrix from the get-go with the extra memory and you append your new data as columns at the end, then the current data will not need to be recopied every time you append new data and things will go much faster.

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Answer 2

Andrei Bobrov on 13 Jul 2017

0
Link

Direct link to this answer

https://www.mathworks.com/matlabcentral/answers/348521-the-most-efficient-way-to-add-a-new-column-or-new-row-to-a-sparse-matrix#answer_273983

Open in MATLAB Online

% Adding a new row to the end of the matrix A:
A(end+1,[190 200]) = [7 89];
% Adding a new column to the end of the matrix A:
A([190 200],end+1) = [7 89];

1 Comment
Show -1 older commentsHide -1 older comments

Ming on 14 Jul 2017

why this way is the most efficient way?

Sign in to comment.

The most efficient way to add a new column or new row to a sparse matrix

1 Comment
Show -1 older commentsHide -1 older comments

Accepted Answer

0 Comments
Show -2 older commentsHide -2 older comments

More Answers (1)

1 Comment
Show -1 older commentsHide -1 older comments

See Also

Categories

Tags

Products

Community Treasure Hunt

The most efficient way to add a new column or new row to a sparse matrix

1 Comment Show -1 older commentsHide -1 older comments

Accepted Answer

0 Comments Show -2 older commentsHide -2 older comments

More Answers (1)

1 Comment Show -1 older commentsHide -1 older comments

See Also

Categories

Tags

Products

Community Treasure Hunt

1 Comment
Show -1 older commentsHide -1 older comments

0 Comments
Show -2 older commentsHide -2 older comments

1 Comment
Show -1 older commentsHide -1 older comments