aboutsummaryrefslogtreecommitdiff
path: root/tools/perf/scripts/python
diff options
context:
space:
mode:
authorChristophe Leroy <[email protected]>2015-09-22 16:34:32 +0200
committerScott Wood <[email protected]>2016-03-04 23:03:45 -0600
commitf867d556dd8525fe6ff0d22a34249528e590f994 (patch)
tree32ebba9cfc1b00d1f394b480d5cfab443382864e /tools/perf/scripts/python
parent48821a34b1bdc5d89505cb814b3f7c166940f200 (diff)
powerpc32: optimise csum_partial() loop
On the 8xx, load latency is 2 cycles and taking branches also takes 2 cycles. So let's unroll the loop. This patch improves csum_partial() speed by around 10% on both: * 8xx (single issue processor with parallel execution) * 83xx (superscalar 6xx processor with dual instruction fetch and parallel execution) Signed-off-by: Christophe Leroy <[email protected]> Signed-off-by: Scott Wood <[email protected]>
Diffstat (limited to 'tools/perf/scripts/python')
0 files changed, 0 insertions, 0 deletions