MTIMESX - Fast Matrix Multiply with Multi-Dimensional Support - File Exchange

MTIMESX - Fast Matrix Multiply with Multi-Dimensional Support

Version 1.10.0.0 (256 KB) by James Tursa

Beats MATLAB 300% - 400% in some cases ... really!

4.6

(51)

19K Downloads

Updated 23 Feb 2011

View License

MTIMESX is a fast general purpose matrix and scalar multiply routine that has the following features:

- Supports multi-dimensional (nD, n>2) arrays directly
- Supports Transpose, Conjugate Transpose, and Conjugate pre-operations
- Supports singleton expansion
- Utilizes BLAS calls, custom C loop code, or OpenMP multi-threaded C loop code
- Can match MATLAB results exactly or approximately as desired
- Can meet or beat MATLAB for speed in most cases

MTIMESX has six basic operating modes:

- BLAS: Always uses BLAS library calls
- LOOPS: Always uses C loops if available
- LOOPSOMP: Always uses OpenMP multi-threaded C loops if available
- MATLAB: Fastest BLAS or LOOPS method that matches MATLAB exactly (default)
- SPEED: Fastest BLAS or LOOPS method even if it doesn't match MATLAB exactly
- SPEEDOMP: Fastest BLAS, LOOPS, or LOOPOMP method even if it doesn't match MATLAB exactly

MTIMESX inputs can be:

single
double
double sparse

The general syntax is (arguments in brackets [ ] are optional):

mtimesx( [directive] )
mtimesx( A [,transa] ,B [,transb] [,directive] )

Where transa, transb, and directive are the optional inputs:

transa = A character indicating a pre-operation on A:
transb = A character indicating a pre-operation on B:
The pre-operation can be any of:
'N' or 'n' = No pre-operation (the default if trans_ is missing)
'T' or 't' = Transpose
'C' or 'c' = Conjugate Transpose
'G' or 'g' = Conjugate (no transpose)
directive = One of the modes listed above, or other directives

Examples:

C = mtimesx(A,B) % performs the calculation C = A * B
C = mtimesx(A,'T',B) % performs the calculation C = A.' * B
C = mtimesx(A,B,'g') % performs the calculation C = A * conj(B)
C = mtimesx(A,'c',B,'C') % performs the calculation C = A' * B'
mtimesx('SPEEDOMP','OMP_SET_NUM_THREADS(4)') % sets SPEEDOMP mode with number of threads = 4

For nD cases, the first two dimensions specify the matrix multiply involved. The remaining dimensions are duplicated and specify the number of individual matrix multiplies to perform for the result. i.e., MTIMESX treats these cases as arrays of 2D matrices and performs the operation on the associated parings. For example:

If A is (2,3,4,5) and B is (3,6,4,5), then

mtimesx(A,B) would result in C(2,6,4,5), where C(:,:,i,j) = A(:,:,i,j) * B(:,:,i,j), i=1:4, j=1:5

which would be equivalent to the MATLAB m-code:

C = zeros(2,6,4,5);
for m=1:4
for n=1:5
C(:,:,m,n) = A(:,:,m,n) * B(:,:,m,n);
end
end

The first two dimensions must conform using the standard matrix multiply rules taking the transa and transb pre-operations into account, and dimensions 3:end must match exactly or be singleton (equal to 1). If a dimension is singleton then it is virtually expanded to the required size (i.e., equivalent to a repmat operation to get it to a conforming size but without the actual data copy). This is equivalent to a bsxfun capability for matrix multiplication.

Cite As

James Tursa (2024). MTIMESX - Fast Matrix Multiply with Multi-Dimensional Support (https://www.mathworks.com/matlabcentral/fileexchange/25977-mtimesx-fast-matrix-multiply-with-multi-dimensional-support), MATLAB Central File Exchange. Retrieved April 26, 2024.

MATLAB Release Compatibility

Created with R2007a

Compatible with any release

Platform Compatibility

Windows macOS Linux

Tags Add Tags

Acknowledgements

Inspired: Small size linear solver, frontal, Merton Structural Credit Model (Matrixwise Solver)

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Version	Published	Release Notes
1.10.0.0	23 Feb 2011	Fixed typos in the dsyrk, dsyr2k, ssyrk, and ssyr2k BLAS function prototypes. (These typo fixes should not change any results)	Download
1.9.0.0	7 Oct 2010	Added OpenMP support for custom code Expanded sparse * single and sparse * nD support Fixed (nD complex scalar)C * (nD array) bug	Download
1.8.0.0	10 Aug 2010	Added capability for nD scalar multiplies (i.e., 1x1xN * MxKxN). Replaced the buggy mxRealloc API routine with custom code. Updated mtimesx_test_nd.m file.	Download
1.7.0.0	23 Feb 2010	Fixed a bug for some of the (row vector) * (matrix) and (matrix transposed) * (column vector) operations in MATLAB mode that involved incorrect dimensions in the dgemv and sgemv calls.	Download
1.6.0.0	16 Feb 2010	Fixed a typo in the build routine for 64-bit systems, changed -largearraydims to -largeArrayDims	Download
1.5.0.0	7 Jan 2010	Added the multi-dimensional test routine mtimesx_test_nd.m for speed and equality tests. Added its description to the pdf file.	Download
1.4.0.0	11 Dec 2009	Fixed a bug for empty transa or transb inputs. Now treats these the same as 'N'. Also simplified the nD singleton expansion code a bit.	Download
1.3.0.0	10 Dec 2009	Added singleton expansion capability for multi-dimensional matrix multiplies.	Download
1.2.0.0	7 Dec 2009	Fixed bug in scalar multiply code that was causing incorrect results & crashes.	Download
1.1.0.0	4 Dec 2009	Fixed bug for (scalar) * (sparse) when the scalar contains an inf or NaN (in which case the zeros do not stay zero). Slight update to pdf doc.	Download
1.0.0.0	30 Nov 2009		Download