aboutsummaryrefslogtreecommitdiff
path: root/tools/perf/scripts/python
diff options
context:
space:
mode:
authorKAMEZAWA Hiroyuki <[email protected]>2010-10-27 15:34:15 -0700
committerLinus Torvalds <[email protected]>2010-10-27 18:03:13 -0700
commit478735e38887077ac77a9756121b6ce0cb956e2f (patch)
tree7b42110f71b1711f56cbbc0a01f00894b221337d /tools/perf/scripts/python
parentf2c66cd8eeddedb440f33bc0f5cec1ed7ae376cb (diff)
/proc/stat: fix scalability of irq sum of all cpu
In /proc/stat, the number of per-IRQ event is shown by making a sum each irq's events on all cpus. But we can make use of kstat_irqs(). kstat_irqs() do the same calculation, If !CONFIG_GENERIC_HARDIRQ, it's not a big cost. (Both of the number of cpus and irqs are small.) If a system is very big and CONFIG_GENERIC_HARDIRQ, it does for_each_irq() for_each_cpu() - look up a radix tree - read desc->irq_stat[cpu] This seems not efficient. This patch adds kstat_irqs() for CONFIG_GENRIC_HARDIRQ and change the calculation as for_each_irq() look up radix tree for_each_cpu() - read desc->irq_stat[cpu] This reduces cost. A test on (4096cpusp, 256 nodes, 4592 irqs) host (by Jack Steiner) %time cat /proc/stat > /dev/null Before Patch: 2.459 sec After Patch : .561 sec [[email protected]: unexport kstat_irqs, coding-style tweaks] [[email protected]: fix unused variable 'per_irq_sum'] Signed-off-by: KAMEZAWA Hiroyuki <[email protected]> Tested-by: Jack Steiner <[email protected]> Acked-by: Jack Steiner <[email protected]> Cc: Yinghai Lu <[email protected]> Cc: Ingo Molnar <[email protected]> Cc: Thomas Gleixner <[email protected]> Signed-off-by: Andrew Morton <[email protected]> Signed-off-by: Linus Torvalds <[email protected]>
Diffstat (limited to 'tools/perf/scripts/python')
0 files changed, 0 insertions, 0 deletions