I suggest to use [GNUC Vector Extensions](https://gcc.gnu.org/onlinedocs/gcc/Vector-Extensions.html), which are avaliable for every architecture and are supported by both clang and gcc.