diff options
author | Tim Rowley <[email protected]> | 2016-10-04 12:59:30 -0500 |
---|---|---|
committer | Tim Rowley <[email protected]> | 2016-10-11 11:22:04 -0500 |
commit | 2966d9c691fd0cd51d83204cac6b3194b9dcb878 (patch) | |
tree | 066bd6639329f655ffd19301d500bff2d3ef75f6 /src/gallium/drivers/swr/rasterizer/common | |
parent | 6b3691c8762320df5afc8a7e79b9da09e272695b (diff) |
swr: [rasterizer core] align Macrotile FIFO memory to SIMD size
Align and use streaming store instructions for BE fifo queues.
Provides slightly faster enqueue and doesn't pollute the caches.
Add appropriate memory fences to ensure streaming writes are
globally visible.
Signed-off-by: Tim Rowley <[email protected]>
Diffstat (limited to 'src/gallium/drivers/swr/rasterizer/common')
-rw-r--r-- | src/gallium/drivers/swr/rasterizer/common/simdintrin.h | 1 |
1 files changed, 1 insertions, 0 deletions
diff --git a/src/gallium/drivers/swr/rasterizer/common/simdintrin.h b/src/gallium/drivers/swr/rasterizer/common/simdintrin.h index 4a91e95f7ff..3ad37de3d49 100644 --- a/src/gallium/drivers/swr/rasterizer/common/simdintrin.h +++ b/src/gallium/drivers/swr/rasterizer/common/simdintrin.h @@ -114,6 +114,7 @@ OSALIGNSIMD(union) simdvector #define _simd_round_ps _mm256_round_ps #define _simd_castpd_ps _mm256_castpd_ps #define _simd_broadcast_ps(a) _mm256_broadcast_ps((const __m128*)(a)) +#define _simd_stream_ps _mm256_stream_ps #define _simd_load_sd _mm256_load_sd #define _simd_movemask_pd _mm256_movemask_pd |