Simd Library Release Notes (2023).

Home | Release Notes | Download | Documentation | Issues | GitHub

2024 | 2023 | 2022 | 2021 | 2020 | 2019 | 2018 | 2017 | 2016 | 2015 | 2014 | 2013

December 2, 2023 (version 5.4.132)

Algorithms

New features
  • Support of RGBA-32 input image format in base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of function SynetSetInput.
Bug fixing
  • Wrong order of SIMD_DEPRECATED macro.
  • Error in AVX-512BW optimizations of function SynetSoftmaxLayerForward.

Python wrapper

New features
  • Wrapper for function SimdVersion.
  • Wrapper for function SimdRelease.
  • Wrapper for function SimdCpuDesc.
  • Wrapper for function SimdCpuInfo.
  • Wrapper for enumeration SimdCpuDescType.
  • Wrapper for enumeration SimdCpuInfoType.
  • Wrapper for function SimdPerformanceStatistic.
  • Wrapper for function SimdAllocate.
  • Wrapper for function SimdFree.
  • Wrapper for function SimdAlign.
  • Wrapper for function SimdAlignment.
  • Wrapper for function SimdGetThreadNumber.
  • Wrapper for function SimdSetThreadNumber.
  • Wrapper for function SimdEmpty.
  • Wrapper for function SimdGetFastMode.
  • Wrapper for function SimdSetFastMode.
  • Wrapper for enumeration SimdPixelFormatType.
  • Class Simd.Image.
  • Wrapper for function SimdCrc32.
  • Wrapper for function SimdCrc32c.
  • Wrapper for enumeration SimdImageFileType.
  • Wrapper for function SimdImageSaveToFile.
  • Wrapper for function SimdImageLoadFromFile.
  • Wrapper for enumeration Simd::View::Position.
  • Method Simd.Image.Region.
  • Method Simd.Image.RegionAt.
  • Wrapper for enumeration Simd.ResizeMethod.
  • Wrapper for enumeration Simd.ResizeChannel.
  • Wrapper for function SimdResizerInit.
  • Wrapper for function SimdResizerRun.
  • Function Simd.Resize.
  • Function Simd.Resized.
  • Wrapper for function SimdSynetSetInput.
  • Function Simd.SynetSetInput.
  • Wrapper for enumeration Simd.TensorFormat.
  • Wrapper for enumeration Simd.TensorData.
  • Wrapper for enumeration Simd.YuvType.
  • Wrapper for function SimdFillPixel.
  • Function Simd.FillPixel.

Infrastructure

New features
  • SimdPy MSVS project.

Documentation

New features
  • Doxygen generation of documentation for Python wrapper.
Home

November 1, 2023 (version 5.3.131)

Algorithms

New features
  • NEON optimizations of function DescrIntCosineDistance.
  • NEON optimizations of function DescrIntCosineDistancesMxNa.
  • NEON optimizations of function DescrIntCosineDistancesMxNp.
Improving
  • NEON optimizations of function DescrIntDecode32f.
  • NEON optimizations of function CorrelationSum.
  • Base implementation and SSE4.1 optimizations of ImageJpegLoader class.
Bug fixing
  • Error in defenition of SIMD_CPP_2011_ENABLE macro for Visual Studio.
  • Error in defenition of SIMD_CPP_2014_ENABLE macro for Visual Studio.
  • Error in defenition of SIMD_CPP_2017_ENABLE macro for Visual Studio.
  • Compiler warning in method Detection::InitLevels.
  • Compiler warning in method Detection::FillLevels.

Infrastructure

New features
  • Cmake SIMD_UNINSTALL option.
  • Cmake SIMD_INSTALL option.

The use examples

New features
  • An example how to use Simd::ImageMatcher to find image duplicates.
Home

October 2, 2023 (version 5.3.130)

Algorithms

New features
  • Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of function BgrToYuv420pV2.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of function BgrToYuv422pV2.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of function BgrToYuv444pV2.
  • Parameter yuvType in function Simd::BgrToYuv420p.
  • Parameter yuvType in function Simd::BgrToYuv422p.
  • Parameter yuvType in function Simd::BgrToYuv444p.
  • The mark of function SimdBgrToYuv420p as deprecated.
  • The mark of function SimdBgrToYuv422p as deprecated.
  • The mark of function SimdBgrToYuv444p as deprecated.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of function Yuv420pToRgbV2.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of function Yuv422pToRgbV2.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of function Yuv444pToRgbV2.
  • Parameter yuvType in function Simd::Yuv420pToRgb.
  • Parameter yuvType in function Simd::Yuv422pToRgb.
  • Parameter yuvType in function Simd::Yuv444pToRgb
  • The mark of function SimdYuv420pToRgb as deprecated.
  • The mark of function SimdYuv422pToRgb as deprecated.
  • The mark of function SimdYuv444pToRgb as deprecated.
  • NEON optimizations of function AlphaBlendingBgraToYuv420p.
  • NEON optimizations of function DescrIntEncode32f.
  • NEON optimizations of function DescrIntEncode16f.
  • NEON optimizations of function DescrIntDecode32f.
  • NEON optimizations of function DescrIntDecode16f.
Bug fixing
  • Error in AVX-512BW optimizations of function SynetSoftmaxLayerForward.
  • Error in AVX2 optimizations of class ResizerByteArea2x2 (internal buffer overflow).
  • Error in function Simd::BgraToYuv420p.
  • Error in function Simd::BgraToYuv422p.
  • Error in function Simd::BgraToYuv444p.
  • Error in NEON optimizations of class MergedConvolution32fCd.

Test framework

New features
  • Tests for verifying functionality of function BgrToYuv420pV2.
  • Tests for verifying functionality of function BgrToYuv422pV2.
  • Tests for verifying functionality of function BgrToYuv444pV2.
  • Tests for verifying functionality of function Yuv420pToRgbV2.
  • Tests for verifying functionality of function Yuv422pToRgbV2.
  • Tests for verifying functionality of function Yuv444pToRgbV2.
Home

September 4, 2023 (version 5.3.129)

Algorithms

New features
  • Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of function Yuv420pToBgrV2.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of function Yuv422pToBgrV2.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of function Yuv444pToBgrV2.
  • Parameter yuvType in function Simd::Yuv420pToBgr.
  • Parameter yuvType in function Simd::Yuv422pToBgr.
  • Parameter yuvType in function Simd::Yuv444pToBgr.
  • The mark of function SimdYuv420pToBgr as deprecated.
  • The mark of function SimdYuv422pToBgr as deprecated.
  • The mark of function SimdYuv444pToBgr as deprecated.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of function Yuv422pToBgraV2.
  • Parameter yuvType in function Simd::Yuv420pToBgra.
  • Parameter yuvType in function Simd::Yuv422pToBgra.
  • Parameter yuvType in function Simd::Yuv444pToBgra.
  • The mark of function SimdYuv420pToBgra as deprecated.
  • The mark of function SimdYuv422pToBgra as deprecated.
  • The mark of function SimdYuv444pToBgra as deprecated.
  • Parameter yuvType in function Simd::BgraToYuv420p.
  • Parameter yuvType in function Simd::BgraToYuv422p.
  • Parameter yuvType in function Simd::BgraToYuv444p.
  • Parameter yuvType in function Simd::BgraToYuva420p.
  • The mark of function SimdBgraToYuv420p as deprecated.
  • The mark of function SimdBgraToYuv422p as deprecated.
  • The mark of function SimdBgraToYuv444p as deprecated.
  • The mark of function SimdBgraToYuva420p as deprecated.
  • The mark of function SimdResizeBilinear as deprecated.
  • The mark of function Simd::ResizeBilinear as deprecated.
  • The mark of function Simd::ResizeAreaGray as deprecated.
  • The mark of function Simd::ResizeArea as deprecated.
  • The mark of function Simd::InterferenceIncrement as deprecated.
  • The mark of function Simd::InterferenceIncrementMasked as deprecated.
  • The mark of function Simd::InterferenceDecrement as deprecated.
  • The mark of function Simd::InterferenceDecrementMasked as deprecated.
  • The mark of function SimdSynetFusedLayerForward0 as deprecated.
  • The mark of function SimdSynetFusedLayerForward1 as deprecated.
  • The mark of function SimdSynetFusedLayerForward2 as deprecated.
  • The mark of function SimdSynetFusedLayerForward3 as deprecated.
  • The mark of function SimdSynetFusedLayerForward4 as deprecated.
  • The mark of function SimdSynetFusedLayerForward8 as deprecated.
  • The mark of function SimdSynetFusedLayerForward9 as deprecated.
Bug fixing
  • Error in NEON optimizations of Resizer engine.
  • The memory reading outside border of input array in Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function DescrIntDecode32f.
  • The memory reading outside border of input array in Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function DescrIntDecode16f.
  • The memory reading outside border of input array in Base implementation, SSE4.1, AVX2 optimizations of function DescrIntCosineDistance.
  • The memory reading outside border of input array in Base implementation, SSE4.1, AVX2 optimizations of function DescrIntCosineDistancesMxNa.
  • The memory reading outside border of input array in Base implementation, SSE4.1, AVX2 optimizations of function DescrIntCosineDistancesMxNp.
  • Error in AVX-512BW optimizations of function DescrIntEncode32f.
  • Error in AVX-512BW optimizations of function DescrIntEncode16f.
  • Compiler error in function Simd::ResizeArea.
  • Error in SSE4.1, AVX, AVX2, AVX-512BW, NEON optimizations of class MergedConvolution32fCd.

Test framework

New features
  • Tests for verifying functionality of function Yuv420pToBgrV2.
  • Tests for verifying functionality of function Yuv422pToBgrV2.
  • Tests for verifying functionality of function Yuv444pToBgrV2.
  • Tests for verifying functionality of function Yuv422pToBgraV2.
  • Special test for verifying functionality of function Simd::ResizeAreaGray.
  • Special test for verifying functionality of function Simd::ResizeArea.
Home

August 1, 2023 (version 5.3.128)

Algorithms

New features
  • Support of SimdCpuInfoRam in function SimdCpuInfo.
  • Support of SimdCpuInfoRam in function Simd::PrintInfo.
  • Base implementation of function SimdCpuDesc.
  • Base implementation of SynetGridSample2dRef class.
  • Base implementation, SSE4.1, AVX2 optimizations of SynetGridSample2d32fBlZ class.
Bug fixing
  • Error in AVX-512VNNI optimizations of class SynetMergedConvolution8iCdc.
  • Error in AVX-512VNNI optimizations of class SynetMergedConvolution8iCd.
  • Error in AVX-512VNNI optimizations of class SynetMergedConvolution8iDc.
  • Error (assert) in Base implementation of class ResizerNearest.
  • Error in SSE4.1, AVX, AVX2, AVX-512BW optimizations of class SynetMergedConvolution32fCd.

Test framework

New features
  • Tests for verifying functionality of SynetGridSample2d engine.
Improving
  • WIN32 performance report.

Infrastructure

New features
  • Github actions script for CMake (build and test for GCC-13 (instead of GCC-12), Linux).

Documentation

Bug fixing
  • Wrong description of function SimdDescrIntInit.
Home

July 4, 2023 (version 5.3.127)

Algorithms

New features
  • Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function DescrIntEncode16f.
  • Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function DescrIntDecode16f.
  • Support of 4-bit and 5-bit depth in Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function DescrIntEncode32f.
  • Support of 4-bit and 5-bit depth in Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function DescrIntEncode16f.
  • Support of 4-bit and 5-bit depth in Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function DescrIntDecode32f.
  • Support of 4-bit and 5-bit depth in Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function DescrIntDecode16f.
  • Support of 4-bit and 5-bit depth in Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function DescrIntCosineDistance.
  • Support of 4-bit and 5-bit depth in Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512VNNI optimizations of function DescrIntCosineDistancesMxNp.
  • Support of 4-bit and 5-bit depth in Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512VNNI optimizations of function DescrIntCosineDistancesMxNa.
  • Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function SynetNormalizeLayerForwardV3.
Improving
  • SSE4.1, AVX2, AVX-512BW optimizations of function DescrIntCosineDistancesMxNp for 4, 5, 6, 7-bits depth.
  • SSE4.1, AVX2, AVX-512BW optimizations of function DescrIntCosineDistancesMxNa for 4, 5, 6, 7-bits depth.
Bug fixing
  • Compiler error in file SimdYuvToBgr.h.
Renaming
  • Function DescrIntEncode to DescrIntEncode32f.
  • Function DescrIntDecode to DescrIntDecode32f.

Test framework

New features
  • Tests for verifying functionality of function DescrIntEncode16f.
  • Tests for verifying functionality of function DescrIntDecode16f.
  • Tests for verifying functionality of function SynetNormalizeLayerForwardV3
Improving
  • WIN32 exception handling.

Infrastructure

Improving
  • Host Properties step in Github actions script for MSBuild.
Home

June 5, 2023 (version 5.3.126)

Algorithms

New features
  • Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function DescrIntEncode.
  • Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function DescrIntDecode.
  • Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function DescrIntCosineDistance.
  • Base implementation optimizations of function DescrIntVectorNorm.
  • Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function DescrIntCosineDistancesMxNp.
  • Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function DescrIntCosineDistancesMxNa.
  • SimdSynetUnaryOperation32fRcp member in enumeration SimdSynetUnaryOperationType.
  • Support of SimdSynetUnaryOperation32fRcp in Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of function SynetUnaryOperation32f.
  • SimdSynetUnaryOperation32fNot member in enumeration SimdSynetUnaryOperationType.
  • Support of SimdSynetUnaryOperation32fNot in Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of function SynetUnaryOperation32f.
  • Support of SimdSynetUnaryOperation32fNot in Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of function SynetUnaryOperation32f.
  • Helper function Simd::InvertAffineTransform.
Improving
  • SSE4.1, AVX, AVX-512BW optimizations of method SynetDeconvolution32fGemmNN::RowToImg.
  • AVX-512BW optimizations of function SynetUnaryOperation32f (case of SimdSynetUnaryOperation32fLog, SimdSynetUnaryOperation32fExp).
  • AVX-512BW optimizations of function SynetSoftplus32f.
Bug fixing
  • Error in AVX2 optimizations of function SynetSoftmaxLayerForward.
  • GCC compiler error in file SimdDrawing.hpp (Windows, MinGW).
  • GCC compiler error in function Test::FileExists (Windows, MinGW).
  • Crash in function SimdSynetDeconvolution32fForward (Linux, GCC-12, GCC-13).
  • Crash in function Base::AlgCacheL3 (Windows, MinGW).
  • Using of _WIN32 macro instead of WIN32.

Test framework

New features
  • Tests for verifying functionality of function DescrIntEncode.
  • Tests for verifying functionality of function DescrIntDecode.
  • Tests for verifying functionality of function DescrIntCosineDistance.
  • Tests for verifying functionality of function DescrIntCosineDistancesMxNp.
  • Tests for verifying functionality of function DescrIntCosineDistancesMxNa.
  • Test command line argument '-cc' to run CheckCpp tests.
Bug fixing
  • Test log messages after test error stoppage (multithreaded run).
  • Error in CheckCpp tests.

Infrastructure

New features
  • Github actions script for CMake (build and test for MinGW, Windows).
  • Github actions script for CMake (build and test for GCC-12, Linux).
Home

May 1, 2023 (version 5.2.125)

Algorithms

New features
  • AVX-512BW, NEON optimizations of function SynetGelu32f.
  • SimdConvolutionActivationGelu member in enumeration SimdConvolutionActivationType.
  • Support of SynetUnaryOperation32fErf in NEON optimizations of function SynetUnaryOperation32f.
  • Support of SimdConvolutionActivationGelu in Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of class SynetConvolution32fGemmNN.
  • Support of SimdConvolutionActivationGelu in Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of class SynetConvolution32fWinograd.
  • Support of SimdConvolutionActivationGelu in Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of class SynetConvolution32fGemmNT.
  • Support of SimdConvolutionActivationGelu in Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of class SynetConvolution32fDirectNchw.
  • Support of SimdConvolutionActivationGelu in Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of class SynetConvolution32fDirectNhwc.
  • Support of SimdConvolutionActivationGelu in Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of class SynetConvolution32fDepthwiseDotProduct.
  • Support of SimdConvolutionActivationGelu in Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of class SynetConvolution32fNhwcDirect.
  • Support of SimdConvolutionActivationGelu in Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512BF16, AMX-BF16 optimizations of class SynetConvolution32fBf16Gemm.
  • Support of SimdConvolutionActivationGelu in Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512BF16, AMX-BF16 optimizations of class SynetConvolution32fBf16Nhwc.
  • Support of SimdConvolutionActivationGelu in Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of class SynetDeconvolution32fGemmNN.
  • Support of SimdConvolutionActivationGelu in Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of class SynetDeconvolution32fNhwcDirect2x2.
  • Support of SimdConvolutionActivationGelu in Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512VNNI, AMX-INT8, NEON optimizations of class SynetConvolution8iGemmNN.
  • Support of SimdConvolutionActivationGelu in Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512VNNI, AMX-INT8, NEON optimizations of class SynetConvolution8iNhwcDirect.
  • Support of SimdConvolutionActivationGelu in Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512VNNI, AMX-INT8, NEON optimizations of class SynetConvolution8iNhwcDepthwise.
  • Support of SimdConvolutionActivationGelu in Base implementation, SSE4.1, AVX, AVX2, AVX-512BW, NEON optimizations of class SynetMergedConvolution32fCd.
  • Support of SimdConvolutionActivationGelu in Base implementation, SSE4.1, AVX, AVX2, AVX-512BW, NEON optimizations of class SynetMergedConvolution32fCdc.
  • Support of SimdConvolutionActivationGelu in Base implementation, SSE4.1, AVX, AVX2, AVX-512BW, NEON optimizations of class SynetMergedConvolution32fDc.
  • Support of SimdConvolutionActivationGelu in Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512BF16, AMX-BF16 optimizations of class SynetMergedConvolution32fBf16Cd.
  • Support of SimdConvolutionActivationGelu in Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512BF16, AMX-BF16 optimizations of class SynetMergedConvolution32fBf16Cdc.
  • Support of SimdConvolutionActivationGelu in Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512BF16, AMX-BF16 optimizations of class SynetMergedConvolution32fBf16Dc.
  • Support of SimdConvolutionActivationGelu in Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512VNNI, AMX-INT8 optimizations of class SynetMergedConvolution8iCd.
  • Support of SimdConvolutionActivationGelu in Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512VNNI, AMX-INT8 optimizations of class SynetMergedConvolution8iCdc.
  • Support of SimdConvolutionActivationGelu in Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512VNNI, AMX-INT8 optimizations of class SynetMergedConvolution8iDc.
  • Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function SynetNormalizeLayerForwardV2.
Improving
  • SSE4.1, AVX2, AVX-512BW, NEON optimizations of function Erf.
  • Performance of function Simd::Parallel.
  • Using of resize method SimdResizeMethodArea in ImageMatcher::Create (more precise than SimdResizeMethodBilinear).
Bug fixing
  • Compiler error in SimdImageMatcher.hpp (using of internal functions).
  • Wrong API of function ImageMatcher.Skip().

Test framework

New features
  • Tests for verifying functionality of function SynetNormalizeLayerForwardV2.

Infrastructure

Bug fixing
  • Github actions script for MSBuild (Restore NuGet packages timeout exit).
Home

April 3, 2023 (version 5.2.124)

Algorithms

New features
  • Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of function BgraToYuv422pV2.
  • NEON optimizations of function BgraToYuv444pV2.
  • NEON optimizations of function BgraToYuv420pV2.
  • NEON optimizations of function Float32ToBFloat16.
  • NEON optimizations of function BFloat16ToFloat32.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of function BgraToYuva420pV2.
  • Support of SynetUnaryOperation32fErf in Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function SynetUnaryOperation32f.
  • Base implementation, SSE4.1, AVX2 optimizations of function SynetGelu32f.
Improving
  • Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function SynetSoftmaxLayerForward.
Bug fixing
  • Error in method View::ToOcv.
Removing
  • Support of all formats besides NHWC and NCHW for function SynetAddBias.
  • Support of all formats besides NHWC and NCHW for function SynetLrnLayerCrossChannels.
  • Support of all formats besides NHWC and NCHW for function SynetPreluLayerForward.
  • Support of all formats besides NHWC and NCHW for function SynetScaleLayerForward.
  • Support of all formats besides NHWC and NCHW for function SynetFusedLayerForward0.
  • Support of all formats besides NHWC and NCHW for function SynetFusedLayerForward1.
  • Support of all formats besides NHWC and NCHW for function SynetFusedLayerForward2.
  • Support of all formats besides NHWC and NCHW for function SynetFusedLayerForward3.
  • Support of all formats besides NHWC and NCHW for function SynetFusedLayerForward4.
  • Support of all formats besides NHWC and NCHW for function SynetFusedLayerForward8.
  • Support of all formats besides NHWC and NCHW for function SynetFusedLayerForward9.
  • Function SynetReorderFilter.
  • Function SynetReorderImage.
  • Function SynetTensorAlignment.
  • Function SynetSpecifyTensorFormat.
  • Support of all formats besides NHWC and NCHW for enumeration SimdTensorFormatType.
Renaming
  • Function from SynetUnaryOperation32fLayerForward to SynetUnaryOperation32f.

Test framework

New features
  • Test command line argument '-ts' to print statistics of time of tests execution.
  • Tests for verifying functionality of function BgraToYuv422pV2.
  • Tests for verifying functionality of function BgraToYuva420pV2.
  • Improving header of performance report.
  • Tests for verifying functionality of function SynetGelu32f.
Bug fixing
  • Error in test SynetUnaryOperation32fLayerForward.
Home

Match 9, 2023 (version 5.2.123)

Algorithms

Bug fixing
  • MSVS-2022 compiler errors in AmxBf16 project.
  • Clang compiler error in method Array::Release.
  • MSVS-2022 compiler warnings in file SimdSse41ResizerNearest.cpp.
  • MSVS-2022 compiler warnings in file SimdAvx512bwResizerNearest.cpp.
  • Error in SSE4.1, AVX, AVX2, AVX-512BW kernels of ConvolutionNhwcDirect_2 (fixed type).
  • MSVS-2022 compiler warnings in file SimdAvx2RecursiveBilateralFilter.cpp.
  • Error in file SimdFrame.hpp (function Simd::Convert).
  • Error in AVX2 optimizations of class RecursiveBilateralFilterFast (x86 only).
  • MSVS-2022 compiler error in file SimdInit.h (ARM64).
  • MSVS-2022 compiler errors in file SimdLog.h (ARM64).
  • MSVS-2022 compiler errors in file SimdConversion.h (ARM64).
  • MSVS-2022 compiler errors in file SimdNeonYuvToHue.cpp (ARM64).
  • MSVS-2022 compiler errors in file SimdNeonAbsDifferenceSum.cpp (ARM64).
  • MSVS-2022 compiler errors in file SimdNeonDetection.cpp (ARM64).
  • MSVS-2022 compiler errors in file SimdNeonHog.cpp (ARM64).
  • MSVS-2022 compiler errors in file SimdNeonLaplace.cpp (ARM64).
  • MSVS-2022 compiler errors in file SimdNeonMeanFilter3x3.cpp (ARM64).
  • MSVS-2022 compiler errors in file SimdNeonNeural.cpp (ARM64).
  • MSVS-2022 compiler warnings in file SimdNeonNeuralConvolution.cpp (ARM64).
  • MSVS-2022 compiler errors in file SimdNeonSobel.cpp (ARM64).
  • MSVS-2022 compiler errors in file SimdNeonSynetConversion.cpp (ARM64).
  • MSVS-2022 compiler errors in file SimdNeonSynetConvolution8i.cpp (ARM64).
  • MSVS-2022 compiler errors in file SimdNeonYuvToHue.cpp (ARM64).
  • Wrong assert in AVX-512BW optimization of function BgraToYuv420pV2.
  • MSVS-2017 compiler errors in AVX-512BW optimizations of function AlphaBlendingBgraToYuv420p.
  • MSVS-2017 compiler errors in AVX-512BW optimizations of WarpAffine engine.
  • MSVS-2015 compiler errors in file SimdFmadd.h (Win32).
  • MSVS-2015 compiler errors in SSE4.1 and AVX2 optimizations of WarpAffine engine (Win32).
  • Crash in AVX2 optimizations of function CosineDistance16f.
  • Crash in SSE4.1, AVX2, AVX-512BW, NEON optimizations of function HogLiteResizeFeatures.
  • Clang compiler warnings in file SimdBaseRecursiveBilateralFilter.cpp.
  • Clang compiler warnings in file SimdSse41RecursiveBilateralFilter.cpp.
  • Clang linker error in method Motion::Detector::GenerateSearchRegionScanlines.
  • Crashes in AVX-512BW optimizations of WarpAffine engine (MSVS-2022, Release).
  • Internal compiler error in file SimdAvx512bwSynetConvolution32f.cpp (MSVS-2017, Release).
  • Error in SSE4.1, AVX2 optimizations of function BgraToYuv444pV2 (MSVS-2015, Release, Win32).
  • Error in SSE4.1, AVX2 optimizations of function BgraToYuv420pV2 (MSVS-2015, Release, Win32).
  • Error in AVX2 optimizations of function Yuva444pToBgraV2 (MSVS-2015, Release, Win32).
  • Error in AVX2 optimizations of function Yuv444pToBgraV2 (MSVS-2015, Release, Win32).
  • Error in AVX2 optimizations of function Yuv420pToBgraV2 (MSVS-2015, Release, Win32).
  • Error in AVX2 optimizations of function AlphaBlendingBgraToYuv420p (MSVS-2015, Release, Win32).
  • Error in AVX2 optimizations of class ResizerByteArea2x2 (MSVS-2015, Release, Win32).
  • Error in AVX2 optimizations of function Uyvy422ToBgr (MSVS-2015, Release, Win32).

Test framework

New features
  • Handling of Windows exceptions in AutoTest.
Bug fixing
  • Error in test Nv12SaveAsJpegToMemoryAutoTest.
  • Error in test SynetAdd8iAutoTest.
  • Error in test SynetConvolution8iForwardAutoTest.
  • Error in test SynetScale8iForwardAutoTest.
  • Error in test WarpAffineAutoTest.
  • Error in test ResizeBilinearAutoTest.
  • Error in test SynetMergedConvolution8iForwardAutoTest.
  • Error in function MakeAutoTests (multithreaded environment).
  • Error in test Float32ToBFloat16AutoTest.
  • Error in test SynetConvert32fTo8uAutoTest (MSVS-2015 and MSVS-2017, Release, Win32).
  • Error in test SynetMergedConvolution32fForwardAutoTest (MSVS-2015 and MSVS-2017, Release, Win32).
  • Error in test CosineDistancesMxNp16fAutoTest.
  • Error in test VectorNormNp16fAutoTest.

Infrastructure

New features
  • Script BuildAll.cmd to build MSVS solution for all configurations and platforms.
  • Github actions script for CMake (build and test for x86_x64, Linux).
  • Github actions script for CMake (cross platform build for ARM, Linux).
  • Github actions script for Cmake (cross platform build for ARM64, Linux).
  • Github actions script for CMake (cross platform build for PowerPC, Linux).
  • Github actions script for CMake (build and test for clang, Linux).
  • Github actions script for MSBuild (build and test for Visual Studio 2022, Windows).
  • Script GetThreadCount.cmd.
  • Github actions script for MSBuild (build and test for Visual Studio 2019, Windows).
  • Github actions script for MSBuild (build and test for Visual Studio 2017, Windows).
  • Github actions script for MSBuild (build and test for Visual Studio 2015, Windows).
Renaming
  • Script TestVisualStudio.cmd to TestAll.cmd.

Documentation

Bug fixing
  • Wrong description of Cmake parameters.
Home

February 1, 2023 (version 5.2.122)

Algorithms

New features
  • New API of function Avx512bw::TileZero (AMX emulation).
  • New API of function Avx512bw::TileLoad (AMX emulation).
  • New API of function Avx512bw::TileStore (AMX emulation).
  • New API of function Avx512bw::TileMatMulBf16 (AMX emulation).
  • New API of function Avx512bw::TileMatMul8u8i (AMX emulation).
  • Function Avx512bw::TileMatMulFp16(AMX emulation).
  • The mark of function SimdInterferenceIncrement as deprecated.
  • The mark of function SimdInterferenceIncrementMasked as deprecated.
  • The mark of function SimdInterferenceDecrement as deprecated.
  • The mark of function SimdInterferenceDecrementMasked as deprecated.
  • The mark of function SimdSynetReorderImage as deprecated.
  • The mark of function SimdSynetReorderFilter as deprecated.
  • SimdTensorData16f (16-bit floating point) tensor type.
  • The mark of function SimdSynetSpecifyTensorFormat as deprecated.
  • The mark of function SimdSynetTensorAlignment as deprecated.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of SynetPermute engine.
  • NEON optimizations of function Yuva444pToBgraV2.
  • NEON optimizations of function AlphaBlending2x.
  • SSE4.1, AVX2, AVX-512BW optimizations of function BgraToYuv444pV2.
  • SSE4.1, AVX2, AVX-512BW optimizations of function BgraToYuv420pV2.
  • Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function AlphaBlendingBgraToYuv420p.
  • Parameter 'copy' to View::Capure.
  • Method Array::Release.
Improving
  • NEON optimizations of function TransformImage.
Bug fixing
  • Clang compiler error in function Simd::WarpAffine.
  • MSVS-2022 compiler warnings in file SimdBaseRecursiveBilateralFilter.cpp.
  • MSVS-2022 compiler warnings in file SimdSse41RecursiveBilateralFilter.cpp.
  • MSVS-2015 compiler error in file SimdAvx2RecursiveBilateralFilter.cpp.
  • Error in method MergConvParam32f::Valid.
  • Crash in constructor of Simd::TileConf.
  • Crash in AVX and AVX2 optimizations of function SynetInnerProductLayerForward.
  • MSVS-2022 compiler error in file SimdAvx2RecursiveBilateralFilter.cpp (Win32 target).
  • GCC compiler error in file SimdParallel.hpp (for AVX2 optimizations).

Test framework

New features
  • Tests for verifying functionality of SynetPermut engine.
  • Tests for verifying functionality of function AlphaBlendingBgraToYuv420p.
Bug fixing
  • Crash in test GaussianBlurAutoTest.

Infrastructure

New features
  • Install target in Cmake.
  • Uninstall target in Cmake.
Renaming
  • Project Amx to AmxBf16.
Home

January 3, 2023 (version 5.2.121)

Algorithms

New features
  • SIMD_DEPRECATED macro.
  • The mark of function SimdSvmSumLinear as deprecated.
  • SSE4.1, AVX2, AVX-512BW optimizations of function SynetNormalizeLayerForward.
  • Enumeration SimdWarpAffineFlags.
  • Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of class WarpAffineNearest.
  • Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of class WarpAffineBilinear.
  • Multi-threaded optimizations of class WarpAffineNearest.
  • Multi-threaded optimizations of class WarpAffineBilinear.
  • Function Simd::WarpAffine.
  • Function Simd::Mean.
  • Function Simd::OtsuThreshold.
  • Function Simd::RecursiveBilateralFilter.
  • The mark of function SimdEdgeBackgroundGrowRangeSlow as deprecated.
  • The mark of function SimdEdgeBackgroundGrowRangeFast as deprecated.
  • The mark of function SimdEdgeBackgroundIncrementCount as deprecated.
  • The mark of function SimdEdgeBackgroundAdjustRange as deprecated.
  • The mark of function SimdEdgeBackgroundAdjustRangeMasked as deprecated.
  • The mark of function SimdEdgeBackgroundShiftRange as deprecated.
  • The mark of function SimdEdgeBackgroundShiftRangeMasked as deprecated.
  • The mark of function Simd::EdgeBackgroundGrowRangeSlow as deprecated.
  • The mark of function Simd::EdgeBackgroundGrowRangeFast as deprecated.
  • The mark of function Simd::EdgeBackgroundIncrementCount as deprecated.
  • The mark of function Simd::EdgeBackgroundAdjustRange as deprecated.
  • The mark of function Simd::EdgeBackgroundAdjustRangeMasked as deprecated.
  • The mark of function Simd::EdgeBackgroundShiftRange as deprecated.
  • The mark of function Simd::EdgeBackgroundShiftRangeMasked as deprecated.
Bug fixing
  • Wrong assert in AVX-512BW optimizations of function BgrToRgb.
  • MSVS compiler bug (Windows, Arm64).
  • Error in function Simd::DrawLine.

Test framework

New features
  • Tests for verifying functionality of WarpAffine engine.
  • Special tests for verifying functionality of WarpAffine engine.

Infrastructure

New features
  • SIMD_OPENCV Cmake option to test Simd with OpenCV support.

Documentation

Improving
  • Using example in description of function RecursiveBilateralFilterInit.
Home
2024 | 2023 | 2022 | 2021 | 2020 | 2019 | 2018 | 2017 | 2016 | 2015 | 2014 | 2013