Skip to content

Simd v6.1.142

Latest
Compare
Choose a tag to compare
@ermig1979 ermig1979 released this 01 Oct 07:21
· 8 commits to master since this release

Algorithms

New features
  • Base implementation of class SynetDeconvolution16bGemm.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, AMX-BF16 optimizations of class SynetDeconvolution16bNhwcGemm.
  • AMX-BF16 (AVX-512VBMI) optimizations of function DeinterleaveUv.
  • AMX-BF16 (AVX-512VBMI) optimizations of function DeinterleaveBgr.
  • AMX-BF16 (AVX-512VBMI) optimizations of function DeinterleaveBgra.
Improving
  • AVX-512BW optimizations of function ConvolutionDirectNhwcConvolutionBiasActivationDepthwise.
Removing
  • Base implementation, SSE4.1, AVX2, AVX-512BW, AMX-BF16 optimizations of class SynetConvolution32fBf16NhwcGemm.
  • Base implementation of class SynetConvolution32fBf16Gemm.
  • Parameter 'compatibility' from function SynetConvolution32fInit.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, AMX-BF16 optimizations of class SynetMergedConvolution32fBf16Cdc.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, AMX-BF16 optimizations of class SynetMergedConvolution32fBf16Cd.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, AMX-BF16 optimizations of class SynetMergedConvolution32fBf16Dc.
  • Base implementation of class SynetMergedConvolution32fBf16.
  • Parameter 'compatibility' from function SynetMergedConvolution32fInit.

Test framework

New features
  • Tests for verifying functionality of SynetDeconvolution16b framework.