gemm¶
Computes a matrix-matrix product with general matrices.
gemm
supports the following precisions.
Ts
Ta
Tb
Tc
float
half
half
float
half
half
half
half
float
float
float
float
double
double
double
double
std::complex<float>
std::complex<float>
std::complex<float>
std::complex<float>
std::complex<double>
std::complex<double>
std::complex<double>
std::complex<double>
Description
The gemm
routines compute a scalar-matrix-matrix product and add the
result to a scalar-matrix product, with general matrices. The
operation is defined as
C <- alpha*op(A)*op(B) + beta*C
where:
op(X)
is one of op(X) = X
, or op(X) = XT
, or
op(X) = XH
,
alpha
and beta
are scalars,
A
, B
and C
are matrices:
op(A)
is an m
-by-k
matrix,
op(B)
is a k
-by-n
matrix,
C
is an m
-by-n
matrix.
gemm (Buffer Version)¶
Syntax
-
void
onemkl::blas
::
gemm
(sycl::queue &queue, onemkl::transpose transa, onemkl::transpose transb, std::int64_t m, std::int64_t n, std::int64_t k, Ts alpha, sycl::buffer<Ta, 1> &a, std::int64_t lda, sycl::buffer<Tb, 1> &b, std::int64_t ldb, Ts beta, sycl::buffer<Tc, 1> &c, std::int64_t ldc)¶
Input Parameters
- queue
The queue where the routine should be executed.
- transa
Specifies the form of
op(A)
, the transposition operation applied toA
.- transb
Specifies the form of
op(B)
, the transposition operation applied toB
.- m
Specifies the number of rows of the matrix
op(A)
and of the matrixC
. The value of m must be at least zero.- n
Specifies the number of columns of the matrix
op(B)
and the number of columns of the matrixB
. The value of n must be at least zero.- k
Specifies the number of columns of the matrix
op(A)
and the number of rows of the matrixop(B)
. The value of k must be at least zero.- alpha
Scaling factor for the matrix-matrix product.
- a
The buffer holding the input matrix
A
. IfA
is not transposed,A
is anm
-by-k
matrix so the arraya
must have size at leastlda
*k
. IfA
is transposed,A
is ank
-by-m
matrix so the arraya
must have size at leastlda
*m
. See Matrix and Vector Storage for more details.- lda
The leading dimension of
A
. Must be at least m ifA
is not transposed, and at least k ifA
is transposed. It must be positive.- b
The buffer holding the input matrix
B
. IfB
is not transposed,B
is ank
-by-n
matrix so the arrayb
must have size at leastldb
*n
. IfB
is transposed,B
is ann
-by-k
matrix so the arrayb
must have size at leastldb
*k
. See Matrix and Vector Storage for more details.- ldb
The leading dimension of
B
. Must be at least k ifB
is not transposed, and at least n ifB
is transposed. It must be positive.- beta
Scaling factor for matrix
C
.- c
The buffer holding the input/output matrix
C
. It must have a size of at least ldc*n. See Matrix and Vector Storage for more details.- ldc
The leading dimension of
C
. It must be positive and at least the size of m.
Output Parameters
- c
The buffer, which is overwritten by
alpha*op(A)*op(B) + beta*C
.
Notes
If beta
= 0, matrix C
does not need to be initialized before
calling gemm
.
gemm (USM Version)¶
Syntax
-
sycl::event
onemkl::blas
::
gemm
(sycl::queue &queue, onemkl::transpose transa, onemkl::transpose transb, std::int64_t m, std::int64_t n, std::int64_t k, Ts alpha, const Ta *a, std::int64_t lda, const Tb *b, std::int64_t ldb, Ts beta, Tc *c, std::int64_t ldc, const sycl::vector_class<sycl::event> &dependencies = {})¶
Input Parameters
- queue
The queue where the routine should be executed.
- transa
Specifies the form of
op(A)
, the transposition operation applied toA
.- transb
Specifies the form of
op(B)
, the transposition operation applied toB
.- m
Specifies the number of rows of the matrix
op(A)
and of the matrixC
. The value of m must be at least zero.- n
Specifies the number of columns of the matrix
op(B)
and the number of columns of the matrixC
. The value of n must be at least zero.- k
Specifies the number of columns of the matrix
op(A)
and the number of rows of the matrixop(B)
. The value of k must be at least zero.- alpha
Scaling factor for the matrix-matrix product.
- a
Pointer to input matrix
A
. IfA
is not transposed,A
is anm
-by-k
matrix so the arraya
must have size at leastlda
*k
. IfA
is transposed,A
is ank
-by-m
matrix so the arraya
must have size at leastlda
*m
. See Matrix and Vector Storage for more details.- lda
The leading dimension of
A
. Must be at least m ifA
is not transposed, and at least k ifA
is transposed. It must be positive.- b
Pointer to input matrix
B
. IfB
is not transposed,B
is ank
-by-n
matrix so the arrayb
must have size at leastldb
*n
. IfB
is transposed,B
is ann
-by-k
matrix so the arrayb
must have size at leastldb
*k
. See Matrix and Vector Storage for more details.- ldb
The leading dimension of
B
. Must be at least k ifB
is not transposed, and at least n ifB
is transposed. It must be positive.- beta
Scaling factor for matrix
C
.- c
The pointer to input/output matrix
C
. It must have a size of at least ldc*n. See Matrix and Vector Storage for more details.- ldc
The leading dimension of
C
. It must be positive and at least the size of m.- dependencies
List of events to wait for before starting computation, if any. If omitted, defaults to no dependencies.
Output Parameters
- c
Pointer to the output matrix, overwritten by
alpha*op(A)*op(B) + beta*C
.
Notes
If beta
= 0, matrix C
does not need to be initialized
before calling gemm
.
Return Values
Output event to wait on to ensure computation is complete.
Parent topic: BLAS Level 3 Routines