drm/amd/display: Optimize gamma calculations - blaster4385/linux-IllusionX - Linux kernel with personal config changes for arch linux

diff options

author	Krunoslav Kovac <[email protected]>	2019-06-18 17:38:43 -0400
committer	Alex Deucher <[email protected]>	2019-07-18 14:27:25 -0500
commit	e752058b8671c6c87e484cff144c5c6309a37253 (patch)
tree	aa7835b4ae24d597f31f33b5947324d109b1f4f5 /tools/perf/scripts/python/net_dropmonitor.py
parent	c43f89f81cc0c54c7c460f1d6365445939069c83 (diff)

drm/amd/display: Optimize gamma calculations

[Why&How] 1. Stack usage is pretty high as fixed31_32 struct is 8 bytes and we have functions with >30 vars on the stack. 2. Optimize gamma calculation by reducing number of calls to dc_fixpt_pow Our X points are divided into 32 regions wth 16 pts each. Each region is 2x the previous, meaning x[i] = 2*x[i-16] for i>=16. Using (2x)^gamma = 2^gamma * x^gamma, we can recursively compute powers of gamma, we just need first 16 pts to start it up. dc_fixpt_pow() is expensive, it computes x^y by doing exp(y*logx) Exp is done by Taylor series approximation, and log by Newton-like approximation that also uses exp internally. In short, it's significantly heavier than run-of-the-mill addition/subtraction/multiply. Signed-off-by: Krunoslav Kovac <[email protected]> Reviewed-by: Anthony Koo <[email protected]> Acked-by: Aric Cyr <[email protected]> Acked-by: Leo Li <[email protected]> Signed-off-by: Alex Deucher <[email protected]>

Diffstat (limited to 'tools/perf/scripts/python/net_dropmonitor.py')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: