You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm seeing a lot of code examples which make use of the flexibility of determining the architecture or setting it at compile time. I was curious if there was a way to hint or specify the size of register that I'm expecting the api to be able to use instead.
This is namely to do with the swizzle operations. I've got an algorithm that only really needs to work with the 128 bit wide registers.
//so it seems a bit strange to do:
xsimd::batch<uint8_t, xsimd::ssse3> mask;
//to potentially make use oftemplate <classA>
XSIMD_INLINE batch<uint8_t, A> swizzle(batch<uint8_t, A> const& self, batch<uint8_t, A> mask, requires_arch<ssse3>) noexcept
{
return_mm_shuffle_epi8(self, mask);
}
//when I'd also want to make sure that this algorithm works on other architectures
I'm imagining something like this would make sense?
I'm seeing a lot of code examples which make use of the flexibility of determining the architecture or setting it at compile time. I was curious if there was a way to hint or specify the size of register that I'm expecting the api to be able to use instead.
This is namely to do with the
swizzle
operations. I've got an algorithm that only really needs to work with the 128 bit wide registers.I'm imagining something like this would make sense?
xsimd::batch<uint8_t, xsimd::register_width_128> mask;
Let me know if I'm missing something in the library that'd let me do this.
The text was updated successfully, but these errors were encountered: