Add accelerated functions used in Synet Framework. More...

Functions
SIMD_API void *	SimdSynetAdd16bInit (const size_t aShape, size_t aCount, SimdTensorDataType aType, const size_t bShape, size_t bCount, SimdTensorDataType bType, SimdTensorDataType dstType, SimdTensorFormatType format)
	Initilizes add algorithm. More...

SIMD_API void	SimdSynetAdd16bForward (void context, const uint8_t a, const uint8_t b, uint8_t dst)
	Performs forward propagation of add algorithm. More...

SIMD_API void	SimdSynetAddBias (const float bias, size_t channels, size_t spatial, float dst, SimdTensorFormatType format)
	Adds a bias to given vector. More...

SIMD_API void	SimdSynetAdd8i (const uint8_t aData, const float aScale, const float aShift, const uint8_t bData, const float bScale, const float bShift, uint8_t cData, const float cScale, const float *cShift, size_t batch, size_t channels, size_t spatial, SimdTensorFormatType format, SimdSynetCompatibilityType compatibility)
	Adds two INT8 tensors. More...

Detailed Description

Add accelerated functions used in Synet Framework.

Function Documentation

◆ SimdSynetAdd16bInit()

void * SimdSynetAdd16bInit	(	const size_t *	aShape,
		size_t	aCount,
		SimdTensorDataType	aType,
		const size_t *	bShape,
		size_t	bCount,
		SimdTensorDataType	bType,
		SimdTensorDataType	dstType,
		SimdTensorFormatType	format
	)

Initilizes add algorithm.

Parameters

[in]	aShape	- a pointer to shape of input A tensor.
[in]	aCount	- a count of dimensions of input A tensor.
[in]	aType	- a type of input A tensor. Can be FP32 of BF16.
[in]	bShape	- a pointer to shape of input B tensor.
[in]	bCount	- a count of dimensions of input B tensor.
[in]	bType	- a type of input B tensor. Can be FP32 of BF16.
[in]	dstType	- a type of output tensor. Can be FP32 of BF16.
[in]	format	- a format of input / output tensors.

Returns: a pointer to add context. On error it returns NULL. It must be released with using of function SimdRelease. This pointer is used in function SimdSynetAdd16bForward.

◆ SimdSynetAdd16bForward()

void SimdSynetAdd16bForward	(	void *	context,
		const uint8_t *	a,
		const uint8_t *	b,
		uint8_t *	dst
	)

Performs forward propagation of add algorithm.

Parameters

[in]	context	- a pointer to add context. It must be created by function SimdSynetAdd16bInit and released by function SimdRelease.
[in]	a	- a pointer to input A tensor.
[in]	b	- a pointer to input B tensor.
[out]	dst	- a pointer to output tensor.

◆ SimdSynetAddBias()

void SimdSynetAddBias	(	const float *	bias,
		size_t	channels,
		size_t	spatial,
		float *	dst,
		SimdTensorFormatType	format
	)

Adds a bias to given vector.

Algorithm's details (example for NCHW tensor format):

for(c = 0; c < channels; ++c)
    for(j = 0; j < spatial; ++j)
         dst[c*spatial + s] += bias[c];

Note: This function is used in Synet Framework.

Parameters

[in]	bias	- a pointer to the 32-bit float array with bias coefficients. The size of the array is eqial to channels.
[in]	channels	- a number of channels in the image tensor.
[in]	spatial	- a spatial size of image tensor.
[in,out]	dst	- a pointer to cumulative 32-bit image tensor. The size of the array is equal to channels * spatial.
[in]	format	- a format of image tensor.

◆ SimdSynetAdd8i()

void SimdSynetAdd8i	(	const uint8_t *	aData,
		const float *	aScale,
		const float *	aShift,
		const uint8_t *	bData,
		const float *	bScale,
		const float *	bShift,
		uint8_t *	cData,
		const float *	cScale,
		const float *	cShift,
		size_t	batch,
		size_t	channels,
		size_t	spatial,
		SimdTensorFormatType	format,
		SimdSynetCompatibilityType	compatibility
	)

Adds two INT8 tensors.

Algorithm's details (example for NCHW tensor format):

for(b = 0; b < batch; ++b)
    for(c = 0; c < channels; ++c)
        for(s = 0; s < spatial; ++s)
        {
             offs = (b*channels + c)*spatial + s;
             A = aData[offs]*aScale[c] + aShift[c]; 
             B = bData[offs]*bScale[c] + bShift[c];
             cData[offs] = round((A + B)*cScale[c] + cShift[c]);
        }

Note: This function is used in Synet Framework.

Parameters

[in]	aData	- a pointer to the first input 8-bit integer tensor.
[in]	aScale	- a pointer to the 32-bit float array with scale coefficients of the first input tensor.
[in]	aShift	- a pointer to the 32-bit float array with shift coefficients of the first input tensor.
[in]	bData	- a pointer to the second input 8-bit integer tensor.
[in]	bScale	- a pointer to the 32-bit float array with scale coefficients of the second input tensor.
[in]	bShift	- a pointer to the 32-bit float array with shift coefficients of the second input tensor.
[out]	cData	- a pointer to the output 8-bit integer tensor.
[in]	cScale	- a pointer to the 32-bit float array with scale coefficients of the output tensor.
[in]	cShift	- a pointer to the 32-bit float array with shift coefficients of the output tensor.
[in]	batch	- a batch size of input and output image tensors.
[in]	channels	- a number of channels in input and output image tensors.
[in]	spatial	- a spatial size of input and output image tensors.
[in]	format	- a format of input and output image tensors.
[in]	compatibility	- a flags of calculation compatibility.

Simd Library Documentation.

Functions

Detailed Description

Function Documentation

◆ SimdSynetAdd16bInit()

◆ SimdSynetAdd16bForward()

◆ SimdSynetAddBias()

◆ SimdSynetAdd8i()