Perform a quantized matrix multiplication of a
by the matrix b
.
tf.raw_ops.QuantizedMatMul(
a, b, min_a, max_a, min_b, max_b, Toutput=tf.dtypes.qint32, transpose_a=False,
transpose_b=False, Tactivation=tf.dtypes.quint8, name=None
)
The inputs must be two-dimensional matrices and the inner dimension of
a
(after being transposed if transpose_a
is non-zero) must match the
outer dimension of b
(after being transposed if transposed_b
is
non-zero).
Args | |
---|---|
a
|
A Tensor . Must be one of the following types: qint8 , quint8 , qint32 , qint16 , quint16 .
Must be a two-dimensional tensor.
|
b
|
A Tensor . Must be one of the following types: qint8 , quint8 , qint32 , qint16 , quint16 .
Must be a two-dimensional tensor.
|
min_a
|
A Tensor of type float32 .
The float value that the lowest quantized a value represents.
|
max_a
|
A Tensor of type float32 .
The float value that the highest quantized a value represents.
|
min_b
|
A Tensor of type float32 .
The float value that the lowest quantized b value represents.
|
max_b
|
A Tensor of type float32 .
The float value that the highest quantized b value represents.
|
Toutput
|
An optional tf.DType from: tf.qint8, tf.quint8, tf.qint32, tf.qint16, tf.quint16 . Defaults to tf.qint32 .
|
transpose_a
|
An optional bool . Defaults to False .
If true, a is transposed before multiplication.
|
transpose_b
|
An optional bool . Defaults to False .
If true, b is transposed before multiplication.
|
Tactivation
|
An optional tf.DType from: tf.qint8, tf.quint8, tf.qint32, tf.qint16, tf.quint16 . Defaults to tf.quint8 .
The type of output produced by activation function
following this operation.
|
name
|
A name for the operation (optional). |
Returns | |
---|---|
A tuple of Tensor objects (out, min_out, max_out).
|
|
out
|
A Tensor of type Toutput .
|
min_out
|
A Tensor of type float32 .
|
max_out
|
A Tensor of type float32 .
|