Merge pull request #102 from szabadka/master Restrict the ARM optimizations to little endian architectures.