summaryrefslogtreecommitdiffstats
path: root/src/gallium/drivers
diff options
context:
space:
mode:
authorNicolai Hähnle <[email protected]>2016-05-06 11:52:17 -0500
committerNicolai Hähnle <[email protected]>2016-05-09 11:52:46 -0500
commitfe102f7677547a48e2985b78ff6671a2ac9da9c4 (patch)
tree10886e839dad78a674a70fca0fd1ef02babc7190 /src/gallium/drivers
parentd8f3e8e6263214caa9daf914487595e6bd5aa0df (diff)
radeonsi: workaround for tesselation on SI
We request more than 32KB of LDS here, which SI doesn't have. Since LLVM recently started checking the size of declared LDS allocations, all shaders involved in tesselation fail to compile on SI. Note that the entire calculation here seems wrong, given how we calculate indices for generic attributes, so the number ends up wrong on CI+ as well. A proper solution is clearly needed, but this patch should serve as a band-aid for SI in the meantime. Also note that the real size of the LDS allocation in hardware is independent from what we tell LLVM, so this is really more of a "cosmetic" change. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=95198 Cc: "11.2" <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
Diffstat (limited to 'src/gallium/drivers')
-rw-r--r--src/gallium/drivers/radeonsi/si_shader.c8
1 files changed, 8 insertions, 0 deletions
diff --git a/src/gallium/drivers/radeonsi/si_shader.c b/src/gallium/drivers/radeonsi/si_shader.c
index 211db9f6f2f..12ccbab04e1 100644
--- a/src/gallium/drivers/radeonsi/si_shader.c
+++ b/src/gallium/drivers/radeonsi/si_shader.c
@@ -4952,6 +4952,14 @@ static void declare_tess_lds(struct si_shader_context *ctx)
unsigned patch_dw_size = vertex_data_dw_size*2 + patch_data_dw_size;
unsigned lds_dwords = patch_dw_size;
+ if (ctx->screen->b.chip_class <= SI) {
+ /* This is a horrible temporary workaround to make tesselation
+ * not be completely broken on SI now that LLVM checks that
+ * the declared LDS size fits into the device maximum of 32KB.
+ */
+ lds_dwords = 8 * 1024;
+ }
+
/* The actual size is computed outside of the shader to reduce
* the number of shader variants. */
ctx->lds =