Skip to content

Simd v4.7.102

Compare
Choose a tag to compare
@ermig1979 ermig1979 released this 02 Jun 11:02
· 1325 commits to master since this release

Algorithms

New features
  • Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function ValueSquareSums.
Improving
  • Performance of AVX2, AVX-512F and NEON optimizations of SynetConvolution32fGemmNN class.
  • Performance of Neural::FullyConnectedLayer::Forward method.
Bug fixing
  • Error in class SynetMergedConvolution32fDc (large weights case).
  • Compiler error in file SimdAvx2SynetConversion.cpp (MSVS-2015, Win32).
  • Error in SSSE3 optimization of ImageTransform function.
  • Compiler error in file SimdImageSaveJpeg.h (Clang, Mac mini).
  • Compiler warnings (Clang).
  • Error in function ImagePngLoader::ReadTransparency (test tbbn0g04.png).
  • Error in Base implementation, SSE4.1 optimization of class ImagePngLoader (test basn0g16.png).
  • Error in SSE4.1 optimization of class ImagePngLoader (test s02i3p01.png).

Tests

New features
  • Tests to verify functionality function of ValueSquareSums.
Improving
  • Header of performance report table.
Bug fixing
  • Compiler error in file TestFile.h (Clang, Mac mini).