Simd Library Release Notes (2021).

Home | Release Notes | Download | Documentation | Issues | GitHub

2021 | 2020 | 2019 | 2018 | 2017 | 2016 | 2015 | 2014 | 2013

July X, 2021 (version X.X.103)

Algorithms

New features
  • Base implementation, SSE4.1, AVX2, AVX-512BW and NEON optimizations of class ResizerShortBilinear.
  • Base implementation, AVX2, AVX-512BW and NEON optimizations of function VectorNormNa16f.
  • Base implementation, AVX2, AVX-512BW and NEON optimizations of function VectorNormNp16f.
  • Parameter of ROI mask in Motion::Model.
  • SSE2, AVX-512BW and NEON optimizations of function AbsDifference.
  • NEON optimizations of function AlphaUnpremultiply.
  • NEON optimizations of function AlphaPremultiply.
Bug fixing
  • Linker warning in file SimdImageLoad.h (MSVS).
Replacing
  • Replace SSE optimizations to SSE2 for function SvmSumLinear.
  • Replace SSE optimizations to SSE2 for function Fill32f.
  • Replace SSE optimizations to SSE2 for function CosineDistance32f.

Tests

New features
  • Tests to verify functionality function of VectorNormNa16f.
  • Tests to verify functionality function of VectorNormNp16f.
Home

June 2, 2021 (version 4.7.102)

Algorithms

New features
  • Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function ValueSquareSums.
Improving
  • Performance of AVX2, AVX-512F and NEON optimizations of SynetConvolution32fGemmNN class.
  • Performance of Neural::FullyConnectedLayer::Forward method.
Bug fixing
  • Error in class SynetMergedConvolution32fDc (large weights case).
  • Compiler error in file SimdAvx2SynetConversion.cpp (MSVS-2015, Win32).
  • Error in SSSE3 optimization of ImageTransform function.
  • Compiler error in file SimdImageSaveJpeg.h (Clang, Mac mini).
  • Compiler warnings (Clang).
  • Error in function ImagePngLoader::ReadTransparency (test tbbn0g04.png).
  • Error in Base implementation, SSE4.1 optimization of class ImagePngLoader (test basn0g16.png).
  • Error in SSE4.1 optimization of class ImagePngLoader (test s02i3p01.png).

Tests

New features
  • Tests to verify functionality function of ValueSquareSums.
Improving
  • Header of performance report table.
Bug fixing
  • Compiler error in file TestFile.h (Clang, Mac mini).
Home

May 3, 2021 (version 4.7.101)

Algorithms

New features
  • Parameter a in function DeinterleaveBgra can be NULL.
  • Simd::DeinterleaveBgra C++ wrapper.
  • Simd::DeinterleaveRgb C++ wrapper.
  • Simd::DeinterleaveRgba C++ wrappers.
  • Method View::Load (from memory).
  • Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of ImageJpegSaver class.
  • Base implementation of ImageJpegLoader class.
  • Base implementation of ImagePngLoader class.
  • NEON optimizations of ImagePngSaver class.
  • SIMD_SYNET_DISABLE macro.
  • Base implementation, AVX2, AVX-512BW, NEON optimizations of function СosineDistancesMxNp16f.
Bug fixing
  • Error in NEON optimizations of function СosineDistancesMxNa16f.

Tests

New features
  • Parameter '-ri' to set real image name in runtime.
  • Tests to verify functionality function of СosineDistancesMxNp16f.
  • Special tests for verifying functionality of function ImageLoadFromMemory.
Bug fixing
  • Error in saving of output log.

Infrastructure

New features
  • Real images to test encoding/decoding algorithms.
  • SIMD_SYNET cmake option.
  • SIMD_HIDE cmake option.
Removing
  • Project files of Microsoft Visual Studio 2017 (for Android).

Documentation

New features
  • Description of Cmake parameters.
Home

April 1, 2021 (version 4.6.100)

Algorithms

New features
  • Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of ImagePngSaver class.
  • SynetInnerProduct32f framework.
  • Base implementation, SSE4.1, AVX, AVX2, AVX-512F optimizations of SynetInnerProduct32fGemm class.
  • Base implementation, SSE4.1, AVX, AVX2, AVX-512F optimizations of SynetInnerProduct32fProd class.
  • Rgba32 format in View structure.
  • Pixel::Rgba32 structure.
  • Simd::RgbToBgr C++ wrapper.
  • Simd::GrayToRgb C++ wrapper.
  • Simd::GrayToRgba C++ wrapper.
  • Simd::BgrToRgba C++ wrapper.
  • Simd::RgbaToRgb C++ wrapper.
  • Base implementation, SSE2, AVX2, AVX-512BW, NEON optimizations of function RgbaToGray.
  • Base implementation, SSSE3, AVX2, AVX-512BW, NEON optimizations of function BgraToRgba.
  • Simd::RgbToRgba C++ wrapper.
  • Simd::RgbaToBgra C++ wrapper.
  • Rgba32 format in Convert function.
  • Rgba32 format in function ImageSave.
Improving
  • Reduce memory allocations in Simd::ContourDetector.
Bug fixing
  • Assert in function Avx::SynetMergedConvolution32fCdc::SynetMergedConvolution32fCdc.
  • Assert in function Avx::SynetMergedConvolution32fCd::SynetMergedConvolution32fCd.
  • Assert in function Avx::SynetMergedConvolution32fDc::SynetMergedConvolution32fDc.
  • Freezes in function SynetConvolution32fNhwcDirect::OldReorderWeight (ARMv7 architecture).
  • Freezes in file SimdGemm.h (ARMv7 architecture).

Tests

New features
  • Tests for verifying functionality of SynetInnerProduct32f framework.
  • Performance report use milliseconds or microseconds (choosing in runtime).
  • Special test to verify functionality function of Simd::Convert.
  • Tests to verify functionality function of RgbaToGray.
  • Tests to verify functionality function of BgraToRgba.
Bug fixing
  • Crash in test BgrToRgbAutoTest.
  • Error in test of SynetMergedConvolution8i.

Infrastructure

Removing
  • Remove project files of Microsoft Visual Studio 2013.
Home

March 1, 2021 (version 4.6.99)

Algorithms

New features
  • SimdImageFileType enumeration.
  • ImageSaveToFile function.
  • ImageSaveToMemory function.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of ImagePgmTxtSaver class.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of ImagePgmBinSaver class.
  • Change order of parameters in function BgrToRgb.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of ImagePpmBinSaver class.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of ImagePpmTxtSaver class.
  • Additional parameters in function View::Save.
  • Method View::Release.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of ImagePgmTxtLoader class.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of ImagePgmBinLoader class.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of ImagePpmTxtLoader class.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of ImagePpmBinLoader class.
  • Additional parameter in function View::Load.
  • Base implementation of Crc32 function.
Bug fixing
  • Crash in Simd::Detection on Python (using of std::unique_ptr).

Tests

New features
  • Possibility to write output video in UseFaceDetection.cpp example.
  • Test parameter '-o=' to write annotated output video.
  • Tests for verifying functionality of function ImageSaveToFile.
  • Tests for verifying functionality of function ImageSaveToMemory.
  • Tests for verifying functionality of function ImageLoadFromMemory.
  • Tests for verifying functionality of function Crc32.

Documentation

New features
  • Example of use into description of Font.
Bug fixing
  • Errors in Simd Library description.
Home

February 1, 2021 (version 4.6.98)

Algorithms

New features
  • Add parameter epsilon to GaussianBlur engine.
  • Add function SynetConvolution32fInfo.
  • Add function SynetConvolution8iInfo.
  • Add function SynetDeconvolution32fInfo.
  • Add function SynetMergedConvolution32fInfo.
  • Add function SynetMergedConvolution8iInfo.
Improving
  • Performance of SynetConvolution8iNhwcDirect class (case of horizontal padding of small image).
Renaming
  • GaussianBlur engine parameter from radius to sigma.
Bug fixing
  • Error in GaussianBlur engine (case of small images).
  • Performance degradation of AVX-512VNNI optimization of SynetConvolution8i framework.
  • Performance degradation of AVX-512VNNI optimization of SynetMergedConvolution8i framework.
  • Error in GaussianBlur engine (wrong processing of last rows).
  • Error in trajectory averaging algorithm in Motion::Detector.

Tests

New features
  • Possibility to write output video in UseMotionDetector.cpp example.
Bug fixing
  • Error in files: TestVideo.cpp, UseMotionDetector.cpp, UseFaceDetector.cpp (MSVS-2019, OpenCV enabled).

Documentation

Improving
  • Description of GaussianBlur engine.
  • Description of Motion::Detector.

Infrastructure

New feature
  • Ocv.prop.default for Visual Studio 2019.
Renaming
  • Cmake parameter from LIBRARY to SIMD_SHARED.
  • Cmake parameter from CHECK_VERSION to SIMD_GET_VERSION.
  • Cmake parameter from TOOLCHAIN to SIMD_TOOLCHAIN.
  • Cmake parameter from TARGET to SIMD_TARGET.
Home

January 4, 2021 (version 4.6.97)

Algorithms

New features
  • Base implementation, SSE2, AVX2, AVX-512F and NEON optimizations of function SynetMish32f.
  • Support of Mish activation function in SynetConvolution32f framework.
  • Support of Mish activation function in SynetMergedConvolution32f framework.
  • Support of Mish activation function in SynetConvolution8i framework.
  • Support of Mish activation function in SynetMergedConvolution8i framework.
  • Support of Mish activation function in SynetDeconvolution32f framework.
  • Base implementation, SSE4.1, AVX2, AVX-512BW and NEON optimizations of GaussianBlur engine.
Improving
  • AVX-512F optimization of SynetConvolution32fNhwcDirect class.
  • AVX-512F optimization of SynetConvolution32fGemmNN class.
  • AVX-512F optimization of SynetConvolution32fWinograd class.
  • AVX-512F optimization of function Gemm32fNN.
Bug fixing
  • Error in Base implementation of SynetMergedConvolution32f (type=CDC, add=1).
  • Error in function SimdAlignment.
  • Visual Studio 2017 compiler error in files SimdAvx512bwSynet.cpp, SimdAvx512bwSynetScale.cpp, SimdAvx512bwAlphaBlending.cpp.

Test framework

New features
  • Tests for verifying functionality of function SynetMish32f.
  • Tests for verifying functionality of GaussianBlur engine.
Home
2021 | 2020 | 2019 | 2018 | 2017 | 2016 | 2015 | 2014 | 2013