aboutsummaryrefslogtreecommitdiff
path: root/tools/perf/scripts/python/sched-migration.py
diff options
context:
space:
mode:
authorOded Gabbay <[email protected]>2022-04-06 12:07:19 +0300
committerOded Gabbay <[email protected]>2022-09-18 13:29:49 +0300
commitcd6b0cea89862a5b3411246a2410881a988d5b0f (patch)
tree13d4dac9ec6ee80bf9de34db99d0f30ad7fe55ed /tools/perf/scripts/python/sched-migration.py
parent913bd4179b82adfeece29243711ccaf4330772b6 (diff)
habanalabs/gaudi: increase default cs timeout to 10 minutes
In order to improve scalability and reduce host overhead, it is better to increase the default TDR timeout of Gaudi1 from 30 seconds to 10 minutes. This will allow the DL Framework (e.g. PyTorch, TensorFlow) to remove the host sync they are using now and improve overall performance on scaleout training. Note that one can always set the timeout to a custom value via a kernel module parameter given during driver load. Signed-off-by: Oded Gabbay <[email protected]>
Diffstat (limited to 'tools/perf/scripts/python/sched-migration.py')
0 files changed, 0 insertions, 0 deletions