Simd v4.3.75
Algorithms
New features
- Base implementation, SSE2, SSSE3, AVX2 and AVX-512BW optimizations of function BgraToYuva420p.
- NEON optimization of function NeuralSigmoid.
- NEON optimization of function NeuralTanh.
- NEON optimization of function NeuralPow.
- NEON version of functions GetFlushToZero and SetFlushToZero.
- NEON optimization of function Fill32f.
- NEON optimization of function AlphaFilling.
- NEON optimization of function CosineDistance16f.
- NEON optimization of function CosineDistance32f.
- NEON optimization of function Gemm32fNN.
- NEON optimization of function Gemm32fNT.
- NEON optimization of function FillPixel.
- NEON optimization of function ReduceColor2x2.
- NEON optimization of function BayerToBgra.
- NEON optimization of function BayerToBgr.
- NEON optimization of function TransformImage.
- NEON optimization of function BgraToYuva420p.
- NEON optimization of function Yuva420pToBgra.
- NEON optimization of function Resizer.
- NEON optimization of function HogLiteFindMax7x7.
- NEON optimization of function HogLiteCreateMask.
- NEON optimization of function HogLiteFilterSeparable.
- NEON optimization of function HogLiteCompressFeatures.
- NEON optimization of function HogLiteResizeFeatures.
- NEON optimization of function HogLiteFilterFeatures.
- NEON optimization of function HogLiteExtractFeatures.
- NEON optimization of function Winograd2x3SetFilter.
- NEON optimization of function Winograd4x3SetFilter.
- NEON optimization of function Winograd2x3SetInput.
- NEON optimization of function Winograd2x3SetOutput.
- NEON optimization of function SynetAddBias.
- NEON optimization of function SynetEltwiseLayerForward.
- NEON optimization of function SynetPoolingForwardMax.
- NEON optimization of function SynetFusedLayerForward0.
- NEON optimization of function SynetFusedLayerForward1.
- NEON optimization of function SynetFusedLayerForward2.
- NEON optimization of function SynetFusedLayerForward3.
- NEON optimization of function SynetFusedLayerForward4.
- NEON optimization of function SynetInnerProductLayerForward.
- NEON optimization of function SynetLrnLayerCrossChannels.
- NEON optimization of function SynetPreluLayerForward.
- NEON optimization of function SynetRestrictRange.
- NEON optimization of function SynetScaleLayerForward.
- NEON optimization of function SynetSoftmaxLayerForward.
- NEON optimization of function ConvolutionForward.
Improving
- AVX, AVX2 and AVX-512F optimizations of function ConvolutionForward.
- SSE, AVX, AVX2 and AVX-512F optimizations of function Resizer.
Bug fixing
- Error in AVX-512BW optimization of function ChangeColors.
- Error in AVX-512BW optimization of function NormalizeHistogram.
- Error in AVX-512F optimization of function NeuralConvolutionForward.
- Error in NEON optimization of function Uint8ToFloat32.
- Error in NEON optimization of function SquaredDifferenceSum16f.
- Error in SSE version of functions GetFlushToZero.
- Error in Base implementation of function SynetFusedLayerForward0.
Test framework
New features
- Tests for verifying functionality of function BgraToYuva420p.
- Tests for verifying NEON optimization of of function NeuralSigmoid.
- Tests for verifying NEON optimization of of function NeuralTanh.
- Tests for verifying NEON optimization of of function NeuralPow.
- Tests for verifying NEON optimization of of function Fill32f.
- Tests for verifying NEON optimization of of function AlphaFilling.
- Tests for verifying NEON optimization of of function CosineDistance16f.
- Tests for verifying NEON optimization of of function CosineDistance32f.
- Tests for verifying NEON optimization of of function Gemm32fNN.
- Tests for verifying NEON optimization of of function Gemm32fNT.
- Tests for verifying NEON optimization of of function FillPixel.
- Tests for verifying NEON optimization of of function ReduceColor2x2.
- Tests for verifying NEON optimization of of function BayerToBgra.
- Tests for verifying NEON optimization of of function BayerToBgr.
- Tests for verifying NEON optimization of of function TransformImage.
- Tests for verifying NEON optimization of of function BgraToYuva420p.
- Tests for verifying NEON optimization of of function Yuva420pToBgra.
- Tests for verifying NEON optimization of of function Resizer.
- Tests for verifying NEON optimization of of function HogLiteFindMax7x7.
- Tests for verifying NEON optimization of of function HogLiteCreateMask.
- Tests for verifying NEON optimization of of function HogLiteFilterSeparable.
- Tests for verifying NEON optimization of of function HogLiteCompressFeatures.
- Tests for verifying NEON optimization of of function HogLiteResizeFeatures.
- Tests for verifying NEON optimization of of function HogLiteFilterFeatures.
- Tests for verifying NEON optimization of of function HogLiteExtractFeatures.
- Tests for verifying NEON optimization of of function Winograd2x3SetFilter.
- Tests for verifying NEON optimization of of function Winograd4x3SetFilter.
- Tests for verifying NEON optimization of of function Winograd2x3SetInput.
- Tests for verifying NEON optimization of of function Winograd2x3SetOutput.
- Tests for verifying NEON optimization of of function SynetAddBias.
- Tests for verifying NEON optimization of of function SynetEltwiseLayerForward.
- Tests for verifying NEON optimization of of function SynetPoolingForwardMax.
- Tests for verifying NEON optimization of of function SynetFusedLayerForward0.
- Tests for verifying NEON optimization of of function SynetFusedLayerForward1.
- Tests for verifying NEON optimization of of function SynetFusedLayerForward2.
- Tests for verifying NEON optimization of of function SynetFusedLayerForward3.
- Tests for verifying NEON optimization of of function SynetFusedLayerForward4.
- Tests for verifying NEON optimization of of function SynetInnerProductLayerForward.
- Tests for verifying NEON optimization of of function SynetLrnLayerCrossChannels.
- Tests for verifying NEON optimization of of function SynetPreluLayerForward.
- Tests for verifying NEON optimization of of function SynetRestrictRange.
- Tests for verifying NEON optimization of of function SynetScaleLayerForward.
- Tests for verifying NEON optimization of of function SynetSoftmaxLayerForward.
- Tests for verifying NEON optimization of of function ConvolutionForward.
Bug fixing
- Error (at 32-bit OS) in test of function HogLiteFindMax7x7.