diff options
| author | Somnath Kotur <[email protected]> | 2024-04-02 02:37:49 -0700 |
|---|---|---|
| committer | Jakub Kicinski <[email protected]> | 2024-04-04 09:13:19 -0700 |
| commit | fba2e4e5dbab399eb8801801471ac69f9baeba98 (patch) | |
| tree | 4adf6f69876da975d7f2d1a5310147d446e7f6ca /tools/perf/scripts/python/failed-syscalls-by-pid.py | |
| parent | 8635ae8e99a670b38198b7561c6c57b13418f108 (diff) | |
bnxt_en: Allocate page pool per numa node
Driver's Page Pool allocation code looks at the node local
to the PCIe device to determine where to allocate memory.
In scenarios where the core count per NUMA node is low (< default rings)
it makes sense to exhaust page pool allocations on
Node 0 first and then moving on to allocating page pools
for the remaining rings from Node 1.
With this patch, and the following configuration on the NIC
$ ethtool -L ens1f0np0 combined 16
(core count/node = 12, first 12 rings on node#0, last 4 rings node#1)
and traffic redirected to a ring on node#1 , we see a performance
improvement of ~20%
Signed-off-by: Somnath Kotur <[email protected]>
Reviewed-by: Andy Gospodarek <[email protected]>
Reviewed-by: Michael Chan <[email protected]>
Signed-off-by: Pavan Chebbi <[email protected]>
Acked-by: Paolo Abeni <[email protected]>
Link: https://lore.kernel.org/r/[email protected]
Signed-off-by: Jakub Kicinski <[email protected]>
Diffstat (limited to 'tools/perf/scripts/python/failed-syscalls-by-pid.py')
0 files changed, 0 insertions, 0 deletions