Introduction - If you have any usage issues, please Google them yourself
sboxlib automatically deals with endianness. Its solution
is not maxmimally efficient for little-endian processors, but
sboxlib s tasks should be so I/O dominated that this will not
be an issue. (In an ideal world, compilers would recognize
the standard assemble-bytes-into-integers sequences and output
efficient code for them.)