autonuma: reduce cache footprint when scanning page tables - blaster4385/linux-IllusionX - Linux kernel with personal config changes for arch linux

diff options

author	Huang Ying <[email protected]>	2019-11-30 17:57:32 -0800
committer	Linus Torvalds <[email protected]>	2019-12-01 12:59:09 -0800
commit	a818f5363a0eba04bcff986c64c919d3f44b8017 (patch)
tree	fc05add8627a44fe3082cfe47933f9c2a096bb3a /tools/perf/scripts/python/stackcollapse.py
parent	bfe9d006c971a5daefe7a8b27819ccd497090fd8 (diff)

autonuma: reduce cache footprint when scanning page tables

In auto NUMA balancing page table scanning, if the pte_protnone() is true, the PTE needs not to be changed because it's in target state already. So other checking on corresponding struct page is unnecessary too. So, if we check pte_protnone() firstly for each PTE, we can avoid unnecessary struct page accessing, so that reduce the cache footprint of NUMA balancing page table scanning. In the performance test of pmbench memory accessing benchmark with 80:20 read/write ratio and normal access address distribution on a 2 socket Intel server with Optance DC Persistent Memory, perf profiling shows that the autonuma page table scanning time reduces from 1.23% to 0.97% (that is, reduced 21%) with the patch. Link: http://lkml.kernel.org/r/[email protected] Signed-off-by: "Huang, Ying" <[email protected]> Acked-by: Mel Gorman <[email protected]> Cc: Michal Hocko <[email protected]> Cc: Rik van Riel <[email protected]> Cc: Peter Zijlstra <[email protected]> Cc: Ingo Molnar <[email protected]> Cc: Dave Hansen <[email protected]> Cc: Dan Williams <[email protected]> Cc: Fengguang Wu <[email protected]> Signed-off-by: Andrew Morton <[email protected]> Signed-off-by: Linus Torvalds <[email protected]>

Diffstat (limited to 'tools/perf/scripts/python/stackcollapse.py')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: