diff options
| author | Joonsoo Kim <[email protected]> | 2015-02-10 14:09:35 -0800 | 
|---|---|---|
| committer | Linus Torvalds <[email protected]> | 2015-02-10 14:30:30 -0800 | 
| commit | ccaafd7fd039aebc9359a9799f8558b01f1c2adc (patch) | |
| tree | c4a32ede5bb661489da8846cfe947bcb251f6c11 /drivers/scsi/mpt3sas/mpi/mpi2_raid.h | |
| parent | 9aabf810a67cd97e2d1a48f0bab338b7680f1929 (diff) | |
mm: don't use compound_head() in virt_to_head_page()
compound_head() is implemented with assumption that there would be race
condition when checking tail flag.  This assumption is only true when we
try to access arbitrary positioned struct page.
The situation that virt_to_head_page() is called is different case.  We
call virt_to_head_page() only in the range of allocated pages, so there
is no race condition on tail flag.  In this case, we don't need to
handle race condition and we can reduce overhead slightly.  This patch
implements compound_head_fast() which is similar with compound_head()
except tail flag race handling.  And then, virt_to_head_page() uses this
optimized function to improve performance.
I saw 1.8% win in a fast-path loop over kmem_cache_alloc/free, (14.063
ns -> 13.810 ns) if target object is on tail page.
Signed-off-by: Joonsoo Kim <[email protected]>
Acked-by: Christoph Lameter <[email protected]>
Cc: Pekka Enberg <[email protected]>
Cc: David Rientjes <[email protected]>
Cc: Joonsoo Kim <[email protected]>
Cc: Jesper Dangaard Brouer <[email protected]>
Signed-off-by: Andrew Morton <[email protected]>
Signed-off-by: Linus Torvalds <[email protected]>
Diffstat (limited to 'drivers/scsi/mpt3sas/mpi/mpi2_raid.h')
0 files changed, 0 insertions, 0 deletions