arm64: blake2s using scalar instructions All inline rotates omitted. Faster on TX2 but not on A57 Signed-off-by: Ard Biesheuvel <ardb@kernel.org>