diff options
author | Alexander Motin <[email protected]> | 2019-06-14 17:07:34 -0400 |
---|---|---|
committer | Brian Behlendorf <[email protected]> | 2019-06-14 14:07:33 -0700 |
commit | c1b5801bb5af0055e5f3d263beaa07026103e212 (patch) | |
tree | 63e8e3b65273d1546b3352d06e5c9c85461feb8f /include/sys/dmu_traverse.h | |
parent | b1b4ac27082aede8522e479c87897026519f1dd7 (diff) |
Minimize aggsum_compare(&arc_size, arc_c) calls.
For busy ARC situation when arc_size close to arc_c is desired. But
then it is quite likely that aggsum_compare(&arc_size, arc_c) will need
to flush per-CPU buckets to find exact comparison result. Doing that
often in a hot path penalizes whole idea of aggsum usage there, since it
replaces few simple atomic additions with dozens of lock acquisitions.
Replacing aggsum_compare() with aggsum_upper_bound() in code increasing
arc_p when ARC is growing (arc_size < arc_c) according to PMC profiles
allows to save ~5% of CPU time in aggsum code during sequential write
to 12 ZVOLs with 16KB block size on large dual-socket system.
I suppose there some minor arc_p behavior change due to lower precision
of the new code, but I don't think it is a big deal, since it should
affect only very small window in time (aggsum buckets are flushed every
second) and in ARC size (buckets are limited to 10 average ARC blocks
per CPU).
Reviewed-by: Chris Dunlop <[email protected]>
Reviewed-by: Richard Elling <[email protected]>
Reviewed-by: George Melikov <[email protected]>
Reviewed-by: Allan Jude <[email protected]>
Reviewed-by: Brian Behlendorf <[email protected]>
Signed-off-by: Alexander Motin <[email protected]>
Closes #8901
Diffstat (limited to 'include/sys/dmu_traverse.h')
0 files changed, 0 insertions, 0 deletions