glsl: Use array deref for access to vector components

We've assumed that we could lower per-component vector access from vec[i] = scalar to vec = ir_triop_vector_insert(vec, scalar, i) but with SSBOs (and compute shader SLM and tesselation outputs) this is no longer valid. If a vector is "externally visible", multiple threads can write independent components simultaneously. With lowering to ir_triop_vector_insert, each thread read the entire vector, changes one component, then writes out the entire vector. This is racy. Instead of generating a ir_binop_vector_extract when we see v[i], we generate ir_dereference_array. We then add a lowering pass to lower the ir_dereference_array to ir_binop_vector_extract for rvalues and for to vector_insert for lvalues in a separate lowering pass. The resulting IR is the same as before, but we now have a window between ast->ir conversion and the lowering pass where v[i] appears in the IR as an array deref. This lets us run lowering passes that lower the vector access to I/O (eg for SSBO load/store) before we lower the per-component access to full vector writes. Reviewed-by: Jordan Justen <[email protected]> Signed-off-by: Kristian Høgsberg Kristensen <[email protected]>
author: Kristian Høgsberg Kristensen <[email protected]> 2015-11-04 14:58:54 -0800
committer: Kristian Høgsberg Kristensen <[email protected]> 2015-11-10 12:02:46 -0800
commit: 96b22fb080894ba1840af2372f28a46cc0f40c76 (patch)
tree: 197f2454ecfd1778eeea2d81146682ff35fce01e /src/glsl/ir_optimization.h
parent: 60dd5287ff8dbbbe0dbe76bdff6d13c7a5ea9ef0 (diff)
1 files changed, 1 insertions, 0 deletions
diff --git a/src/glsl/ir_optimization.h b/src/glsl/ir_optimization.h
index 6d19a6ca476..2fee81c09c2 100644
--- a/src/glsl/ir_optimization.h
+++ b/src/glsl/ir_optimization.h
@@ -129,6 +129,7 @@ void lower_packed_varyings(void *mem_ctx,
                            unsigned locations_used, ir_variable_mode mode,
                            unsigned gs_input_vertices, gl_shader *shader);
 bool lower_vector_insert(exec_list *instructions, bool lower_nonconstant_index);
+bool lower_vector_derefs(gl_shader *shader);
 void lower_named_interface_blocks(void *mem_ctx, gl_shader *shader);
 bool optimize_redundant_jumps(exec_list *instructions);
 bool optimize_split_arrays(exec_list *instructions, bool linked);
author	Kristian Høgsberg Kristensen <[email protected]>	2015-11-04 14:58:54 -0800
committer	Kristian Høgsberg Kristensen <[email protected]>	2015-11-10 12:02:46 -0800
commit	96b22fb080894ba1840af2372f28a46cc0f40c76 (patch)
tree	197f2454ecfd1778eeea2d81146682ff35fce01e /src/glsl/ir_optimization.h
parent	60dd5287ff8dbbbe0dbe76bdff6d13c7a5ea9ef0 (diff)