mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	nir: rename nir_var_function to nir_var_function_temp	Karol Herbst	2019-01-19	1	-1/+1
\| \| \| \| \| \| \| \|	Signed-off-by: Karol Herbst <[email protected]> Acked-by: Jason Ekstrand <[email protected]> Reviewed-by: Eric Anholt <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	freedreno: make cmdstream bo's read-only to GPU	Rob Clark	2019-01-10	2	-5/+11
\| \| \| \| \| \| \| \| \| \| \| \| \|	If nothing else, this will make problems with cmdstream getting blit over with pixels easier to track down (ie. faults when it first happens rather than strange failures later from corrupted cmdstream when a stateobj is later reused). (NOTE this somewhat depends on the kernel supporting the flag, and the iommu implementation. But the worst case is just that the cmdstream ends up writeable as before.) Signed-off-by: Rob Clark <[email protected]>
*	freedreno: Move register constant files to src/freedreno.	Bas Nieuwenhuizen	2019-01-08	10	-2/+22471
\| \| \| \| \| \| \| \|	This way they can be shared. Build tested with meson, but not too sure on the autotools stuff though. Reviewed-by: Dylan Baker <[email protected]> Acked-by: Rob Clark <[email protected]>
*	freedreno/drm: sync uapi again	Chia-I Wu	2019-01-08	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	"pad" was missing in Mesa's msm_drm.h. sizeof(drm_msm_gem_info) remains the same, but now the compiler initializes the field to zero. Buffer allocation results in EINVAL without this for me. Cc: Rob Clark <[email protected]> Cc: Kristian Høgsberg <[email protected]> Signed-off-by: Chia-I Wu <[email protected]> Reviewed-by: Kristian H. Kristensen <[email protected]>
*	nir: rename global/local to private/function memory	Karol Herbst	2019-01-08	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	the naming is a bit confusing no matter how you look at it. Within SPIR-V "global" memory is memory accessible from all threads. glsl "global" memory normally refers to shader thread private memory declared at global scope. As we already use "shared" for memory shared across all thrads of a work group the solution where everybody could be happy with is to rename "global" to "private" and use "global" later for memory usually stored within system accessible memory (be it VRAM or system RAM if keeping SVM in mind). glsl "local" memory is memory only accessible within a function, while SPIR-V "local" memory is memory accessible within the same workgroup. v2: rename local to function as well v3: rename vtn_variable_mode_local as well Signed-off-by: Karol Herbst <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	freedreno/ir3: don't treat all inputs/outputs as vec4	Rob Clark	2018-12-22	2	-14/+38
\| \| \| \| \| \| \| \| \|	This was a hold-over from the early TGSI days, and mostly not needed with NIR. This avoids burning an entire 4 consecutive scalar regs for vec3 outputs, for example. Which fixes a few places that we were doing worse that we should on register usage. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: fix fallout of extra assert	Rob Clark	2018-12-21	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Fixes the following crash that happened after d6110d4d The problem happens if we first compile a "vanilla" shader with nothing lowered in NIR, which perform the final lowering passes on so->shader-> nir (including nir_lower_locals_to_regs()), and then later we have compile a shader with some lowering. The second time through we would have already done nir_lower_locals_to_regs(). Arguably this was already a bug, just one we hadn't noticed yet. Fixes: d6110d4d547 intel/compiler: move nir_lower_bool_to_int32 before nir_lower_locals_to_regs Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: Handle GL_NONE in get_num_components_for_glformat()	Eduardo Lima Mitev	2018-12-19	1	-3/+8
\| \| \| \| \| \| \| \| \| \|	An earlier patch that introduced the function failed to handle the case where an image format layout qualifier is not specified, which is allowed on desktop GL profiles. In these cases, nir_variable's image format is GL_NONE, and we don't need to print a debug message for those. Reviewed-by: Ilia Mirkin <[email protected]> Reviewed-by: Rob Clark <[email protected]>
*	freedreno/ir3: Make imageStore use num components from image format	Eduardo Lima Mitev	2018-12-18	1	-2/+70
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	emit_intrinsic_store_image() is always using 4 components when collecting registers for the value. When image has less than 4 components (e.g, r32f, rg32i, etc) this results in extra mov instructions. This patch uses the actual number of components from the image format. For example, in a shader like: layout (r32f, binding=0) writeonly uniform imageBuffer u_image; ... void main(void) { ... imageStore (u_image, some_offset, vec4(1.0)); ... } instruction count is reduced in at least 3 instructions (note image format is r32f, 1 component only). This obviously reduces register pressure as well. v2: - Added support for image formats from NV_image_format extension (Ilia Mirkin). - Return 4 components by default instead of asserting. (Rob Clark). v3: Added more missing formats (Ilia Mirkin). v4: Added a debug message for unknown image formats (Rob Clark). Reviewed-by: Ilia Mirkin <[email protected]> Reviewed-by: Rob Clark <[email protected]>
*	nir/opt_peephole_select: Don't peephole_select expensive math instructions	Ian Romanick	2018-12-17	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	On some GPUs, especially older Intel GPUs, some math instructions are very expensive. On those architectures, don't reduce flow control to a csel if one of the branches contains one of these expensive math instructions. This prevents a bunch of cycle count regressions on pre-Gen6 platforms with a later patch (intel/compiler: More peephole select for pre-Gen6). v2: Remove stray #if block. Noticed by Thomas. Signed-off-by: Ian Romanick <[email protected]> Reviewed-by: Thomas Helland <[email protected]> Reviewed-by: Lionel Landwerlin <[email protected]>
*	nir/opt_peephole_select: Don't try to remove flow control around indirect loads	Ian Romanick	2018-12-17	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	That flow control may be trying to avoid invalid loads. On at least some platforms, those loads can also be expensive. No shader-db changes on any Intel platform (even with the later patch "intel/compiler: More peephole select"). v2: Add a 'indirect_load_ok' flag to nir_opt_peephole_select. Suggested by Rob. See also the big comment in src/intel/compiler/brw_nir.c. v3: Use nir_deref_instr_has_indirect instead of deref_has_indirect (from nir_lower_io_arrays_to_elements.c). v4: Fix inverted condition in brw_nir.c. Noticed by Lionel. Signed-off-by: Ian Romanick <[email protected]> Reviewed-by: Lionel Landwerlin <[email protected]>
*	nir: Add a bool to int32 lowering pass	Jason Ekstrand	2018-12-16	1	-0/+1
\| \| \| \| \| \| \| \|	We also enable it in all of the NIR drivers. Reviewed-by: Eric Anholt <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]> Tested-by: Bas Nieuwenhuizen <[email protected]>
*	nir: Rename Boolean-related opcodes to include 32 in the name	Jason Ekstrand	2018-12-16	1	-11/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a squash of a bunch of individual changes: nir/builder: Generate 32-bit bool opcodes transparently nir/algebraic: Remap Boolean opcodes to the 32-bit variant Use 32-bit opcodes in the NIR producers and optimizations Generated with a little hand-editing and the following sed commands: sed -i 's/nir_op_ball_fequal/nir_op_b32all_fequal/g' */.c sed -i 's/nir_op_bany_fnequal/nir_op_b32any_fnequal/g' */.c sed -i 's/nir_op_ball_iequal/nir_op_b32all_iequal/g' */.c sed -i 's/nir_op_bany_inequal/nir_op_b32any_inequal/g' */.c sed -i 's/nir_op_$[fiu]lt$/nir_op_\132/g' */.c sed -i 's/nir_op_$[fiu]ge$/nir_op_\132/g' */.c sed -i 's/nir_op_$[fiu]ne$/nir_op_\132/g' */.c sed -i 's/nir_op_$[fiu]eq$/nir_op_\132/g' */.c sed -i 's/nir_op_$[fi]$ne32g/nir_op_\1neg/g' */.c sed -i 's/nir_op_bcsel/nir_op_b32csel/g' */.c Use 32-bit opcodes in the NIR back-ends Generated with a little hand-editing and the following sed commands: sed -i 's/nir_op_ball_fequal/nir_op_b32all_fequal/g' */.c sed -i 's/nir_op_bany_fnequal/nir_op_b32any_fnequal/g' */.c sed -i 's/nir_op_ball_iequal/nir_op_b32all_iequal/g' */.c sed -i 's/nir_op_bany_inequal/nir_op_b32any_inequal/g' */.c sed -i 's/nir_op_$[fiu]lt$/nir_op_\132/g' */.c sed -i 's/nir_op_$[fiu]ge$/nir_op_\132/g' */.c sed -i 's/nir_op_$[fiu]ne$/nir_op_\132/g' */.c sed -i 's/nir_op_$[fiu]eq$/nir_op_\132/g' */.c sed -i 's/nir_op_$[fi]$ne32g/nir_op_\1neg/g' */.c sed -i 's/nir_op_bcsel/nir_op_b32csel/g' */.c Reviewed-by: Eric Anholt <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]> Tested-by: Bas Nieuwenhuizen <[email protected]>
*	freedreno/ir3: don't remove unused input components	Rob Clark	2018-12-13	1	-1/+7
\| \| \| \| \|	Fixes: 0d240c22141 freedreno/ir3: don't fetch unused tex components Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: fix crash	Rob Clark	2018-12-13	1	-14/+8
\| \| \| \| \| \| \|	Fixes a crash in dEQP-GLES3.functional.shaders.fragdepth.compare.fragcoord_z Fixes: 0d240c22141 freedreno/ir3: don't fetch unused tex components Signed-off-by: Rob Clark <[email protected]>
*	freedreno: debug GEM obj names	Rob Clark	2018-12-13	5	-4/+72
\| \| \| \| \| \| \|	With a recent enough kernel, set debug names for GEM BOs, which will show up in $debugfs/gem Signed-off-by: Rob Clark <[email protected]>
*	freedreno/drm: sync uapi and enable softpin	Rob Clark	2018-12-13	6	-25/+30
\| \| \| \| \| \| \| \| \|	Pull in updated UAPI and use kernel API version to enable softpin. Since MSM_SUBMIT_BO_DUMP flag was added at same time, use that to signal to kernel that cmdstream buffers are useful to dump for debugging/cmdstream-traces. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: Add .dir-locals to the common directory	Neil Roberts	2018-12-11	1	-0/+8
\| \| \| \| \| \| \| \|	The commit aa0fed10d35 moved a bunch of Freedreno code to a common directory. The previous directory had a .dir-locals file for Emacs. This patch copies it to the new directory as well. Reviewed-by: Kristian H. Kristensen <[email protected]>
*	freedreno: Fix the Makefile.am fix	Kristian H. Kristensen	2018-12-10	1	-3/+0
\| \| \| \| \| \| \| \| \| \| \| \|	Commit b028ce29f090938d12b0999fe4b0e712d2adc431 fixed a typo in src/freedreno/Makefile.am, but ended up breaking the build for freedreno. The typo inadvertently made things work, as we were not supposed to link with libnir or libmesautil to begin with. Those come in through libmesagallium and the typo prevented the duplicated linkage. Fixes: b028ce29f ("freedreno: add the missing _la in libfreedreno_ir3_la") Cc: Emil Velikov <[email protected]>
*	freedreno: add the missing _la in libfreedreno_ir3_la	Emil Velikov	2018-12-10	1	-1/+1
\| \| \| \| \|	Fixes: aa0fed10d35 ("freedreno: move ir3 to common location") Signed-off-by: Emil Velikov <[email protected]>
*	freedreno: drop duplicate MKDIR_GEN declaration	Emil Velikov	2018-12-10	1	-1/+0
\| \| \| \| \|	Fixes: aa0fed10d35 ("freedreno: move ir3 to common location") Signed-off-by: Emil Velikov <[email protected]>
*	freedreno/drm: fix memory leak	Rob Clark	2018-12-07	1	-0/+3
\| \| \| \| \| \| \| \|	Fix an emberrasing memory leak with the non-softpin submit/rb implementation. Fixes: f3cc0d27475 freedreno: import libdrm_freedreno + redesign submit Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: track max flow control depth for a5xx/a6xx	Rob Clark	2018-12-07	3	-0/+33
\| \| \| \| \| \|	Rather than just hard-coding BRANCHSTACK size. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: code-motion	Rob Clark	2018-12-07	6	-838/+942
\| \| \| \| \| \| \|	Split up ir3_compiler_nir.c a bit before starting to add new stuff for a6xx SSBO/image instructions. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: sync instr/disasm	Rob Clark	2018-12-07	3	-24/+131
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: don't fetch unused tex components	Rob Clark	2018-12-07	2	-0/+29
\| \| \| \| \| \| \|	Detect when a component of an (for example) texture fetch is unused and propagate the updated wrmask back to the parent instruction. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/drm: fix relocs in nested stateobjs	Rob Clark	2018-12-07	1	-3/+15
\| \| \| \| \| \| \| \|	If we have an reloc from stateobjA to stateobjB, we would previously leave stateobjB's bos out of the submit's bos table. Handle this case by copying into stateobjA's reloc_bos table. Signed-off-by: Rob Clark <[email protected]>
*	nir: Make boolean conversions sized just like the others	Jason Ekstrand	2018-12-05	1	-4/+7
\| \| \| \| \| \| \| \| \|	Instead of a single i2b and b2i, we now have i2b32 and b2iN where N is one if 8, 16, 32, or 64. This leads to having a few more opcodes but now everything is consistent and booleans aren't a weird special case anymore. Reviewed-by: Connor Abbott <[email protected]>
*	freedreno: use MSM_BO_SCANOUT with scanout buffers	Jonathan Marek	2018-11-27	2	-0/+4
\| \| \| \|	Signed-off-by: Jonathan Marek <[email protected]>
*	freedreno: move ir3 to common location	Rob Clark	2018-11-27	24	-1/+13731
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Move (most of) the ir3 compiler to src/freedreno/ir3 so that it can be re-used by some future vulkan driver. The parts that are gallium specific have been refactored out and remain in the gallium driver. Getting the move done now so that it can happen before further refactoring to support a6xx specific instructions. NOTE also removes ir3_cmdline compiler tool from autotools build since that was easier than fixing it and I normally use meson build. Waiting patiently for the day that we can remove everything from the autotools build. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: move drm to common location	Rob Clark	2018-11-27	19	-0/+3826
	So that we can re-use at least parts of it for vulkan driver, and so that we can move ir3 to a common location (which uses fd_bo to allocate storage for shaders) Signed-off-by: Rob Clark <[email protected]>