From 96d6da4e252b06dcfdc041e7df23e86161c33007 Mon Sep 17 00:00:00 2001 From: rihab kouki Date: Tue, 28 Jul 2020 11:24:49 +0100 Subject: Official ARM version: v5.6.0 --- docs/DSP/html/group__CmplxMatrixMult.html | 63 +++++++++++++++++-------------- 1 file changed, 35 insertions(+), 28 deletions(-) (limited to 'docs/DSP/html/group__CmplxMatrixMult.html') diff --git a/docs/DSP/html/group__CmplxMatrixMult.html b/docs/DSP/html/group__CmplxMatrixMult.html index 8e11533..e7a253b 100644 --- a/docs/DSP/html/group__CmplxMatrixMult.html +++ b/docs/DSP/html/group__CmplxMatrixMult.html @@ -32,7 +32,7 @@ Logo

CMSIS-DSP - Version 1.5.2 + Version 1.7.0

CMSIS DSP Software Library

@@ -127,7 +127,12 @@ Functions

Description

Complex Matrix multiplication is only defined if the number of columns of the first matrix equals the number of rows of the second matrix. Multiplying an M x N matrix with an N x P matrix results in an M x P matrix. When matrix size checking is enabled, the functions check: (1) that the inner dimensions of pSrcA and pSrcB are equal; and (2) that the size of the output matrix equals the outer dimensions of pSrcA and pSrcB.

When matrix size checking is enabled, the functions check:

that the inner dimensions of pSrcA and pSrcB are equal;
that the size of the output matrix equals the outer dimensions of pSrcA and pSrcB.

Function Documentation

@@ -161,15 +166,17 @@ Functions

Floating-point, complex, matrix multiplication.

Parameters

- - - + + +

[in]	*pSrcA	points to the first input complex matrix structure
[in]	*pSrcB	points to the second input complex matrix structure
[out]	*pDst	points to output complex matrix structure
[in]	pSrcA	points to first input complex matrix structure
[in]	pSrcB	points to second input complex matrix structure
[out]	pDst	points to output complex matrix structure

Returns: The function returns either ARM_MATH_SIZE_MISMATCH or ARM_MATH_SUCCESS based on the outcome of size checking.

- -

References ARM_MATH_SIZE_MISMATCH, ARM_MATH_SUCCESS, arm_matrix_instance_f32::numCols, arm_matrix_instance_f32::numRows, arm_matrix_instance_f32::pData, and status.

Returns

execution status

ARM_MATH_SUCCESS : Operation successful
ARM_MATH_SIZE_MISMATCH : Matrix size check failed

@@ -211,21 +218,20 @@ Functions

Q15, complex, matrix multiplication.

Parameters

- - - - + + + +

[in]	*pSrcA	points to the first input complex matrix structure
[in]	*pSrcB	points to the second input complex matrix structure
[out]	*pDst	points to output complex matrix structure
[in]	*pScratch	points to the array for storing intermediate results
[in]	pSrcA	points to first input complex matrix structure
[in]	pSrcB	points to second input complex matrix structure
[out]	pDst	points to output complex matrix structure
[in]	pScratch	points to an array for storing intermediate results

Returns: The function returns either ARM_MATH_SIZE_MISMATCH or ARM_MATH_SUCCESS based on the outcome of size checking.

Returns

execution status

ARM_MATH_SUCCESS : Operation successful
ARM_MATH_SIZE_MISMATCH : Matrix size check failed

Conditions for optimum performance: Input, output and state buffers should be aligned by 32-bit

Restrictions: If the silicon does not support unaligned memory access enable the macro UNALIGNED_SUPPORT_DISABLE In this case input, output, scratch buffers should be aligned by 32-bit

Scaling and Overflow Behavior:

: The function is implemented using a 64-bit internal accumulator. The inputs to the multiplications are in 1.15 format and multiplications yield a 2.30 result. The 2.30 intermediate results are accumulated in a 64-bit accumulator in 34.30 format. This approach provides 33 guard bits and there is no risk of overflow. The 34.30 result is then truncated to 34.15 format by discarding the low 15 bits and then saturated to 1.15 format.

: Refer to arm_mat_mult_fast_q15() for a faster but less precise version of this function.

- -

References __SIMD32, __SMUADX(), __SMUSD(), ARM_MATH_SIZE_MISMATCH, ARM_MATH_SUCCESS, arm_matrix_instance_q15::numCols, arm_matrix_instance_q15::numRows, arm_matrix_instance_q15::pData, and status.

Scaling and Overflow Behavior: The function is implemented using an internal 64-bit accumulator. The inputs to the multiplications are in 1.15 format and multiplications yield a 2.30 result. The 2.30 intermediate results are accumulated in a 64-bit accumulator in 34.30 format. This approach provides 33 guard bits and there is no risk of overflow. The 34.30 result is then truncated to 34.15 format by discarding the low 15 bits and then saturated to 1.15 format.

@@ -261,17 +267,18 @@ Functions

Q31, complex, matrix multiplication.

Parameters

- - - + + +

[in]	*pSrcA	points to the first input complex matrix structure
[in]	*pSrcB	points to the second input complex matrix structure
[out]	*pDst	points to output complex matrix structure
[in]	pSrcA	points to first input complex matrix structure
[in]	pSrcB	points to second input complex matrix structure
[out]	pDst	points to output complex matrix structure

Returns: The function returns either ARM_MATH_SIZE_MISMATCH or ARM_MATH_SUCCESS based on the outcome of size checking.

Scaling and Overflow Behavior:

: The function is implemented using an internal 64-bit accumulator. The accumulator has a 2.62 format and maintains full precision of the intermediate multiplication results but provides only a single guard bit. There is no saturation on intermediate additions. Thus, if the accumulator overflows it wraps around and distorts the result. The input signals should be scaled down to avoid intermediate overflows. The input is thus scaled down by log2(numColsA) bits to avoid overflows, as a total of numColsA additions are performed internally. The 2.62 accumulator is right shifted by 31 bits and saturated to 1.31 format to yield the final result.

- -

References ARM_MATH_SIZE_MISMATCH, ARM_MATH_SUCCESS, clip_q63_to_q31(), arm_matrix_instance_q31::numCols, arm_matrix_instance_q31::numRows, arm_matrix_instance_q31::pData, and status.

Returns

execution status

ARM_MATH_SUCCESS : Operation successful
ARM_MATH_SIZE_MISMATCH : Matrix size check failed

Scaling and Overflow Behavior: The function is implemented using an internal 64-bit accumulator. The accumulator has a 2.62 format and maintains full precision of the intermediate multiplication results but provides only a single guard bit. There is no saturation on intermediate additions. Thus, if the accumulator overflows it wraps around and distorts the result. The input signals should be scaled down to avoid intermediate overflows. The input is thus scaled down by log2(numColsA) bits to avoid overflows, as a total of numColsA additions are performed internally. The 2.62 accumulator is right shifted by 31 bits and saturated to 1.31 format to yield the final result.

@@ -280,7 +287,7 @@ Functions