Skip to content

Simd v6.1.139

Latest
Compare
Choose a tag to compare
@ermig1979 ermig1979 released this 01 Jul 14:09
· 27 commits to master since this release

Algorithms

New features
  • API of SynetInnerProduct16b framework.
  • Base implementation of class SynetInnerProduct16bRef.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, AMX-BF16 optimizations of class SynetInnerProduct16bGemmNN.
Bug fixing
  • Error in AVX-512BF16 optimizations of class SynetConvolution16bNhwcDirect.
  • Error in Base implementation of class SynetConvolution16bNhwcGemm.
  • Error in SSE4.1, AVX2, AVX-512BW, AMX-BF16 optimizations of function Convert16bNhwcDirect.
  • Error in SSE4.1, AVX2, AVX-512BW, AMX-BF16 optimizations of function Reorder16bNhwcDirect.
  • Error in Base implementation of class SynetMergedConvolution16bCdc.
  • Error in Base implementation of class SynetMergedConvolution16bDc.
  • Error in Base implementation of class SynetMergedConvolution16bCd.
  • Error in AMX-BF16 optimizations of class SynetMergedConvolution16bDc.

Test framework

New features
  • Tests for verifying functionality of SynetInnerProduct16b framework.