summaryrefslogtreecommitdiffstats
path: root/src/compiler/glsl/ir_expression_flattening.cpp
diff options
context:
space:
mode:
authorKenneth Graunke <[email protected]>2016-04-29 22:06:37 -0700
committerKenneth Graunke <[email protected]>2016-06-06 14:14:31 -0700
commit0756e3a25ce5e81773a54942d737ba4643e2f7dc (patch)
tree97e136540ef99ab55219f5b295aadbaa93c8d791 /src/compiler/glsl/ir_expression_flattening.cpp
parent08ddfe7b2fa9f577ba00c8c05c5604460942f5a8 (diff)
glsl: Make opt_copy_propagation actually propagate into loops.
We've had a FINISHME here since Eric originally wrote the code in 2010. This patch implements his suggested approach, which makes us actually able to copy propagate into the loops, at the unfortunate cost of making this pass even more expensive. The shader-db statistics are not terribly impressive: total instructions in shared programs: 9008589 -> 9008613 (0.00%) instructions in affected programs: 4293 -> 4317 (0.56%) helped: 0 HURT: 10 total cycles in shared programs: 78550978 -> 78575760 (0.03%) cycles in affected programs: 655426 -> 680208 (3.78%) helped: 75 HURT: 88 GAINED: 2 Most of the "regressions" appear to be us successfully copy propagating uniforms, which i965 is loading as pull constants instead of push, so we occasionally have two pulls instead of one. That doesn't seem like this pass's job - it's propagating correctly, and we should be smarter about pull loads in the backend. This patch is also useful for a couple of reasons: 1. It can clean up copies created by varying packing (previously, we couldn't if the uses were inside a loop). This fixes a bug when interpolateAt*() is used on a packed varying inside a loop: glsl_to_nir struggles to see through the extra copy and mistakenly believed the variable was not an input. 2. It will help propagate uniform array access created by lower_const_array_to_uniforms(). Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Matt Turner <[email protected]>
Diffstat (limited to 'src/compiler/glsl/ir_expression_flattening.cpp')
0 files changed, 0 insertions, 0 deletions