Skip to content

Simd v4.3.75

Compare
Choose a tag to compare
@ermig1979 ermig1979 released this 07 Mar 11:12
· 2187 commits to master since this release

Algorithms

New features
  • Base implementation, SSE2, SSSE3, AVX2 and AVX-512BW optimizations of function BgraToYuva420p.
  • NEON optimization of function NeuralSigmoid.
  • NEON optimization of function NeuralTanh.
  • NEON optimization of function NeuralPow.
  • NEON version of functions GetFlushToZero and SetFlushToZero.
  • NEON optimization of function Fill32f.
  • NEON optimization of function AlphaFilling.
  • NEON optimization of function CosineDistance16f.
  • NEON optimization of function CosineDistance32f.
  • NEON optimization of function Gemm32fNN.
  • NEON optimization of function Gemm32fNT.
  • NEON optimization of function FillPixel.
  • NEON optimization of function ReduceColor2x2.
  • NEON optimization of function BayerToBgra.
  • NEON optimization of function BayerToBgr.
  • NEON optimization of function TransformImage.
  • NEON optimization of function BgraToYuva420p.
  • NEON optimization of function Yuva420pToBgra.
  • NEON optimization of function Resizer.
  • NEON optimization of function HogLiteFindMax7x7.
  • NEON optimization of function HogLiteCreateMask.
  • NEON optimization of function HogLiteFilterSeparable.
  • NEON optimization of function HogLiteCompressFeatures.
  • NEON optimization of function HogLiteResizeFeatures.
  • NEON optimization of function HogLiteFilterFeatures.
  • NEON optimization of function HogLiteExtractFeatures.
  • NEON optimization of function Winograd2x3SetFilter.
  • NEON optimization of function Winograd4x3SetFilter.
  • NEON optimization of function Winograd2x3SetInput.
  • NEON optimization of function Winograd2x3SetOutput.
  • NEON optimization of function SynetAddBias.
  • NEON optimization of function SynetEltwiseLayerForward.
  • NEON optimization of function SynetPoolingForwardMax.
  • NEON optimization of function SynetFusedLayerForward0.
  • NEON optimization of function SynetFusedLayerForward1.
  • NEON optimization of function SynetFusedLayerForward2.
  • NEON optimization of function SynetFusedLayerForward3.
  • NEON optimization of function SynetFusedLayerForward4.
  • NEON optimization of function SynetInnerProductLayerForward.
  • NEON optimization of function SynetLrnLayerCrossChannels.
  • NEON optimization of function SynetPreluLayerForward.
  • NEON optimization of function SynetRestrictRange.
  • NEON optimization of function SynetScaleLayerForward.
  • NEON optimization of function SynetSoftmaxLayerForward.
  • NEON optimization of function ConvolutionForward.
Improving
  • AVX, AVX2 and AVX-512F optimizations of function ConvolutionForward.
  • SSE, AVX, AVX2 and AVX-512F optimizations of function Resizer.
Bug fixing
  • Error in AVX-512BW optimization of function ChangeColors.
  • Error in AVX-512BW optimization of function NormalizeHistogram.
  • Error in AVX-512F optimization of function NeuralConvolutionForward.
  • Error in NEON optimization of function Uint8ToFloat32.
  • Error in NEON optimization of function SquaredDifferenceSum16f.
  • Error in SSE version of functions GetFlushToZero.
  • Error in Base implementation of function SynetFusedLayerForward0.

Test framework

New features
  • Tests for verifying functionality of function BgraToYuva420p.
  • Tests for verifying NEON optimization of of function NeuralSigmoid.
  • Tests for verifying NEON optimization of of function NeuralTanh.
  • Tests for verifying NEON optimization of of function NeuralPow.
  • Tests for verifying NEON optimization of of function Fill32f.
  • Tests for verifying NEON optimization of of function AlphaFilling.
  • Tests for verifying NEON optimization of of function CosineDistance16f.
  • Tests for verifying NEON optimization of of function CosineDistance32f.
  • Tests for verifying NEON optimization of of function Gemm32fNN.
  • Tests for verifying NEON optimization of of function Gemm32fNT.
  • Tests for verifying NEON optimization of of function FillPixel.
  • Tests for verifying NEON optimization of of function ReduceColor2x2.
  • Tests for verifying NEON optimization of of function BayerToBgra.
  • Tests for verifying NEON optimization of of function BayerToBgr.
  • Tests for verifying NEON optimization of of function TransformImage.
  • Tests for verifying NEON optimization of of function BgraToYuva420p.
  • Tests for verifying NEON optimization of of function Yuva420pToBgra.
  • Tests for verifying NEON optimization of of function Resizer.
  • Tests for verifying NEON optimization of of function HogLiteFindMax7x7.
  • Tests for verifying NEON optimization of of function HogLiteCreateMask.
  • Tests for verifying NEON optimization of of function HogLiteFilterSeparable.
  • Tests for verifying NEON optimization of of function HogLiteCompressFeatures.
  • Tests for verifying NEON optimization of of function HogLiteResizeFeatures.
  • Tests for verifying NEON optimization of of function HogLiteFilterFeatures.
  • Tests for verifying NEON optimization of of function HogLiteExtractFeatures.
  • Tests for verifying NEON optimization of of function Winograd2x3SetFilter.
  • Tests for verifying NEON optimization of of function Winograd4x3SetFilter.
  • Tests for verifying NEON optimization of of function Winograd2x3SetInput.
  • Tests for verifying NEON optimization of of function Winograd2x3SetOutput.
  • Tests for verifying NEON optimization of of function SynetAddBias.
  • Tests for verifying NEON optimization of of function SynetEltwiseLayerForward.
  • Tests for verifying NEON optimization of of function SynetPoolingForwardMax.
  • Tests for verifying NEON optimization of of function SynetFusedLayerForward0.
  • Tests for verifying NEON optimization of of function SynetFusedLayerForward1.
  • Tests for verifying NEON optimization of of function SynetFusedLayerForward2.
  • Tests for verifying NEON optimization of of function SynetFusedLayerForward3.
  • Tests for verifying NEON optimization of of function SynetFusedLayerForward4.
  • Tests for verifying NEON optimization of of function SynetInnerProductLayerForward.
  • Tests for verifying NEON optimization of of function SynetLrnLayerCrossChannels.
  • Tests for verifying NEON optimization of of function SynetPreluLayerForward.
  • Tests for verifying NEON optimization of of function SynetRestrictRange.
  • Tests for verifying NEON optimization of of function SynetScaleLayerForward.
  • Tests for verifying NEON optimization of of function SynetSoftmaxLayerForward.
  • Tests for verifying NEON optimization of of function ConvolutionForward.
Bug fixing
  • Error (at 32-bit OS) in test of function HogLiteFindMax7x7.