summaryrefslogtreecommitdiffstats
path: root/doxygen/math_subset.doxy
diff options
context:
space:
mode:
authorMarek Olšák <[email protected]>2017-05-29 00:40:39 +0200
committerMarek Olšák <[email protected]>2017-06-07 20:14:15 +0200
commit2b8b9a56efc24cc0f27469bf1532c288cdca2076 (patch)
tree338aae3dc064c1ff78befc7c8d26ff3d21bf780c /doxygen/math_subset.doxy
parent2c4ec3f93fcab3fddcbe132200b210e7def1facc (diff)
radeonsi: move PSIZE and CLIPDIST unique IO indices after GENERIC
Heaven LDS usage for LS+HS is below. The masks are "outputs_written" for LS and HS. Note that 32K is the maximum size. Before: heaven_x64: ls=1f1 tcs=1f1, lds=32K heaven_x64: ls=31 tcs=31, lds=24K heaven_x64: ls=71 tcs=71, lds=28K After: heaven_x64: ls=3f tcs=3f, lds=24K heaven_x64: ls=7 tcs=7, lds=13K heaven_x64: ls=f tcs=f, lds=17K All other apps have a similar decrease in LDS usage, because the "outputs_written" masks are similar. Also, most apps don't write POSITION in these shader stages, so there is room for improvement. (tight per-component input/output packing might help even more) It's unknown whether this improves performance. Tested-by: Edmondo Tommasina <[email protected]> Tested-by: Dieter Nützel <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
Diffstat (limited to 'doxygen/math_subset.doxy')
0 files changed, 0 insertions, 0 deletions