aboutsummaryrefslogtreecommitdiff
path: root/tools/perf/scripts/python/netdev-times.py
diff options
context:
space:
mode:
authorWill Deacon <[email protected]>2016-02-02 12:46:25 +0000
committerCatalin Marinas <[email protected]>2016-02-16 15:12:33 +0000
commit223e23e8aa26b0bb62c597637e77295e14f6a62c (patch)
tree264cb0aa4882664aba7561bcde45537b42935aa5 /tools/perf/scripts/python/netdev-times.py
parentd5370f754875460662abe8561388e019d90dd0c4 (diff)
arm64: lib: improve copy_page to deal with 128 bytes at a time
We want to avoid lots of different copy_page implementations, settling for something that is "good enough" everywhere and hopefully easy to understand and maintain whilst we're at it. This patch reworks our copy_page implementation based on discussions with Cavium on the list and benchmarking on Cortex-A processors so that: - The loop is unrolled to copy 128 bytes per iteration - The reads are offset so that we read from the next 128-byte block in the same iteration that we store the previous block - Explicit prefetch instructions are removed for now, since they hurt performance on CPUs with hardware prefetching - The loop exit condition is calculated at the start of the loop Signed-off-by: Will Deacon <[email protected]> Tested-by: Andrew Pinski <[email protected]> Signed-off-by: Catalin Marinas <[email protected]>
Diffstat (limited to 'tools/perf/scripts/python/netdev-times.py')
0 files changed, 0 insertions, 0 deletions