mm/rmap: minimize folio->_nr_pages_mapped updates when batching PTE (un)mapping - blaster4385/linux-IllusionX - Linux kernel with personal config changes for arch linux

diff options

author	David Hildenbrand <[email protected]>	2024-08-07 13:55:15 +0200
committer	Andrew Morton <[email protected]>	2024-09-01 20:26:04 -0700
commit	43c9074e6f093d304d55c43638732c402be75e2b (patch)
tree	cc9fa5c56e36395a8f4fcfc3b01dc933f0aa4415 /tools/perf/scripts/python
parent	67203f3f2a63d429272f0c80451e5fcc469fdb46 (diff)

mm/rmap: minimize folio->_nr_pages_mapped updates when batching PTE (un)mapping

It is not immediately obvious, but we can move the folio->_nr_pages_mapped update out of the loop and reduce the number of atomic ops without affecting the stats. The important point to realize is that only removing the last PMD mapping will result in _nr_pages_mapped going below ENTIRELY_MAPPED, not the individual atomic_inc_return_relaxed() calls. Concurrent races with removal of PMD mappings should be handled as expected, just like when we would have such races right now on a single mapcount update. In a simple munmap() microbenchmark [1] on 1 GiB of memory backed by the same PTE-mapped folio size (only mapped by a single process such that they will get completely unmapped), this change results in a speedup (positive is good) per folio size on a x86-64 Intel machine of roughly (a bit of noise expected): * 16 KiB: +10% * 32 KiB: +15% * 64 KiB: +17% * 128 KiB: +21% * 256 KiB: +22% * 512 KiB: +22% * 1024 KiB: +23% * 2048 KiB: +27% [1] https://gitlab.com/davidhildenbrand/scratchspace/-/blob/main/pte-mapped-folio-benchmarks.c Link: https://lkml.kernel.org/r/[email protected] Signed-off-by: David Hildenbrand <[email protected]> Signed-off-by: Andrew Morton <[email protected]>

Diffstat (limited to 'tools/perf/scripts/python')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: