diff options
author | Francisco Jerez <[email protected]> | 2020-01-03 15:58:05 -0800 |
---|---|---|
committer | Francisco Jerez <[email protected]> | 2020-01-17 13:22:39 -0800 |
commit | 9c9e80103ca2477d71e96750232da26028b4973d (patch) | |
tree | e9fb4cdd774f875db61465f8c0c1791aa3441ae8 /src/intel | |
parent | 0dd18d70aecd3bafb7dcde15430a024942fe243c (diff) |
intel/fs/gen6: Use SEL instead of bashing thread payload for unlit centroid workaround.
This prevents regressions on SNB due to the redundant MOVs lying
around in cases where fetch_payload_reg() returns a VGRF (currently
only in SIMD32 but soon in pretty much all cases). The MOVs can't be
register-coalesced due to their source being a FIXED_GRF, and they
can't be copy-propagated either due to the unlit centroid workaround
partial writes. They can be copy-propagated just fine into a SEL
instruction though.
On SNB this prevents the following shader-db regressions (including
SIMD32 programs) in combination with the interpolation rework part of
this series:
total instructions in shared programs: 13996898 -> 14001982 (0.04%)
instructions in affected programs: 197461 -> 202545 (2.57%)
helped: 0
HURT: 1251
Reviewed-by: Kenneth Graunke <[email protected]>
Diffstat (limited to 'src/intel')
-rw-r--r-- | src/intel/compiler/brw_fs_visitor.cpp | 13 |
1 files changed, 8 insertions, 5 deletions
diff --git a/src/intel/compiler/brw_fs_visitor.cpp b/src/intel/compiler/brw_fs_visitor.cpp index ce51268ec8d..951b9f49e48 100644 --- a/src/intel/compiler/brw_fs_visitor.cpp +++ b/src/intel/compiler/brw_fs_visitor.cpp @@ -351,14 +351,17 @@ fs_visitor::emit_interpolation_setup_gen6() if (!(centroid_modes & (1 << i))) continue; + const fs_reg centroid_delta_xy = delta_xy[i]; const fs_reg &pixel_delta_xy = delta_xy[i - 1]; - for (unsigned q = 0; q < dispatch_width / 8; q++) { - for (unsigned c = 0; c < 2; c++) { + delta_xy[i] = bld.vgrf(BRW_REGISTER_TYPE_F, 2); + + for (unsigned c = 0; c < 2; c++) { + for (unsigned q = 0; q < dispatch_width / 8; q++) { const unsigned idx = c + (q & 2) + (q & 1) * dispatch_width / 8; - set_predicate_inv( - BRW_PREDICATE_NORMAL, true, - bld.half(q).MOV(horiz_offset(delta_xy[i], idx * 8), + set_predicate(BRW_PREDICATE_NORMAL, + bld.half(q).SEL(horiz_offset(delta_xy[i], idx * 8), + horiz_offset(centroid_delta_xy, idx * 8), horiz_offset(pixel_delta_xy, idx * 8))); } } |