Add superscalar fletcher4

This is the Fletcher4 algorithm implemented in pure C, but using multiple counters using algorithms identical to those used for SSE/NEON and AVX2. This allows for faster execution on core with strong superscalar capabilities but weak SIMD capabilities. Reviewed-by: Brian Behlendorf <[email protected]> Signed-off-by: Romain Dolbeau <[email protected]> Closes #5317
author: Romain Dolbeau <[email protected]> 2016-11-04 18:53:03 +0100
committer: Brian Behlendorf <[email protected]> 2016-11-04 10:53:03 -0700
commit: 7f3194932d22c667026aff1b263ceaa1ebd012ee (patch)
tree: b68eee7abd34de2410db38208c35c184af2ddd70 /module/zcommon/zfs_fletcher_aarch64_neon.c
parent: ace1eae84cca8579596f46262d99df19f6d7e963 (diff)
1 files changed, 1 insertions, 1 deletions
diff --git a/module/zcommon/zfs_fletcher_aarch64_neon.c b/module/zcommon/zfs_fletcher_aarch64_neon.c
index f8991b171..b72c1bafd 100644
--- a/module/zcommon/zfs_fletcher_aarch64_neon.c
+++ b/module/zcommon/zfs_fletcher_aarch64_neon.c
@@ -2,7 +2,7 @@
  * Implement fast Fletcher4 with NEON instructions. (aarch64)
  *
  * Use the 128-bit NEON SIMD instructions and registers to compute
- * Fletcher4 in four incremental 64-bit parallel accumulator streams,
+ * Fletcher4 in two incremental 64-bit parallel accumulator streams,
  * and then combine the streams to form the final four checksum words.
  * This implementation is a derivative of the AVX SIMD implementation by
  * James Guilford and Jinshan Xiong from Intel (see zfs_fletcher_intel.c).
author	Romain Dolbeau <[email protected]>	2016-11-04 18:53:03 +0100
committer	Brian Behlendorf <[email protected]>	2016-11-04 10:53:03 -0700
commit	7f3194932d22c667026aff1b263ceaa1ebd012ee (patch)
tree	b68eee7abd34de2410db38208c35c184af2ddd70 /module/zcommon/zfs_fletcher_aarch64_neon.c
parent	ace1eae84cca8579596f46262d99df19f6d7e963 (diff)