mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	softpipe: fix a warning due to an incorrect enum comparison	Marek Olšák	2016-04-22	1	-1/+1
\| \| \| \| \| \|	no change in behavior, because both are defined the same Acked-by: Jose Fonseca <[email protected]>
*	gallium: remove helpers converting to/from TGSI_PROCESSOR_*	Marek Olšák	2016-04-22	4	-37/+3
\| \| \| \|	Acked-by: Jose Fonseca <[email protected]>
*	gallium: use PIPE_SHADER_* everywhere, remove TGSI_PROCESSOR_*	Marek Olšák	2016-04-22	43	-314/+305
\| \| \| \|	Acked-by: Jose Fonseca <[email protected]>
*	gallium: merge PIPE_SWIZZLE_* and UTIL_FORMAT_SWIZZLE_*	Marek Olšák	2016-04-22	60	-449/+439
\| \| \| \| \| \| \| \|	Use PIPE_SWIZZLE_* everywhere. Use X/Y/Z/W/0/1 instead of RED, GREEN, BLUE, ALPHA, ZERO, ONE. The new enum is called pipe_swizzle. Acked-by: Jose Fonseca <[email protected]>
*	gallium: use enums in p_shader_tokens.h (v2)	Marek Olšák	2016-04-22	1	-139/+164
\| \| \| \| \| \| \| \|	Reviewed-by: Edward O'Callaghan <[email protected]> (v1) Reviewed-by: Roland Scheidegger <[email protected]> (v1) Acked-by: Jose Fonseca <[email protected]> (v1) v2: name enums
*	gallium: use enums in p_defines.h (v2)	Marek Olšák	2016-04-22	1	-173/+205
\| \| \| \| \| \| \| \| \| \|	and remove number assignments which are consecutive Reviewed-by: Edward O'Callaghan <[email protected]> (v1) Reviewed-by: Roland Scheidegger <[email protected]> (v1) Acked-by: Jose Fonseca <[email protected]> (v1) v2: name enums
*	radeonsi: remove the shader parameter from si_set_ring_buffer	Marek Olšák	2016-04-22	3	-15/+11
\| \| \| \| \| \| \| \|	not used anymore this is a follow-up to the RW buffer cleanup. Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	radeonsi: decrease GS copy shader user SGPRs to 2	Marek Olšák	2016-04-22	2	-3/+3
\| \| \| \| \| \| \| \|	const buffers are no longer used since the clip plane const buffer was moved to RW buffers Reviewed-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: shorten slot masks to 32 bits	Marek Olšák	2016-04-22	4	-63/+61
\| \| \| \| \|	Reviewed-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: clean up shader resource limit definitions	Marek Olšák	2016-04-22	3	-23/+12
\| \| \| \| \|	Reviewed-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: move default tess level constant buffer to RW buffers	Marek Olšák	2016-04-22	5	-10/+35
\| \| \| \| \|	Reviewed-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: move sample positions constant buffer to RW buffers	Marek Olšák	2016-04-22	3	-4/+5
\| \| \| \| \|	Reviewed-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: move clip plane constant buffer to RW buffers	Marek Olšák	2016-04-22	4	-14/+12
\| \| \| \| \|	Reviewed-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: rework polygon stippling to use constant buffer instead of texture	Marek Olšák	2016-04-22	6	-101/+55
\| \| \| \| \| \| \| \| \|	add it to the RW_BUFFERS descriptor array now the slot masks don't have to have 64 bits Reviewed-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: generalize si_set_constant_buffer	Marek Olšák	2016-04-22	1	-10/+17
\| \| \| \| \| \| \|	this will be used in the next commit Reviewed-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: make RW buffer descriptor array global, not per shader stage	Marek Olšák	2016-04-22	2	-51/+43
\| \| \| \| \| \| \|	v2: also simplify invalidation of RW buffer bindings (squashed) Reviewed-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: rename and rearrange RW buffer slots	Marek Olšák	2016-04-22	4	-30/+39
\| \| \| \| \| \| \| \| \|	- use an enum - use a unique slot number regardless of the shader stage (the per-stage slots will go away for RW buffers) Reviewed-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	gallivm: fix bogus argument order to lp_build_sample_mipmap function	Roland Scheidegger	2016-04-21	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Screwed up since 0753b135f6e83b171d8a1b08aea967374f3542bc. (Only an issue with different min/mag filters, and then only in some cases, which is probably why it went unnoticed for quite a while. The effect should have simply been nearest mip filter instead of linear, iff min was nearest, mag was linear, and all pixels hit the mignifying path.) Fixes a bunch of dEQP failures. Reviewed-by: Jose Fonseca <[email protected]> Cc: "11.1 11.2" <[email protected]>
*	radeonsi: Add config parameter to si_shader_apply_scratch_relocs.	Bas Nieuwenhuizen	2016-04-21	4	-3/+5
\| \| \| \| \| \| \|	shader->config is not updated for compute kernels. Signed-off-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Tom Stellard <[email protected]>
*	swr: add PIPE_CAP_FRAMEBUFFER_NO_ATTACHMENT to get_param	Tim Rowley	2016-04-21	1	-0/+1
\| \| \| \|	Reviewed-by: Ilia Mirkin <[email protected]>
*	st/dri: add 32-bit RGBX/RGBA formats	Rob Herring	2016-04-21	2	-0/+10
\| \| \| \| \| \| \| \|	Add support for 32-bit RGBX/RGBA formats which are preferred for Android. Signed-off-by: Rob Herring <[email protected]> Reviewed-by: Marek Olšák <[email protected]> Reviewed-by: Eric Anholt <[email protected]>
*	gallium/radeon: Silence possibly uninitialized variable warning.	Bas Nieuwenhuizen	2016-04-21	1	-1/+1
\| \| \| \| \|	Signed-off-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	winsys/amdgpu: Silence possibly uninitialized variable warning.	Bas Nieuwenhuizen	2016-04-21	1	-0/+3
\| \| \| \| \|	Signed-off-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: Enable loading into CE RAM.	Bas Nieuwenhuizen	2016-04-21	3	-0/+14
\| \| \| \| \| \| \| \| \| \|	We need to enable a bit in the CONTEXT_CONTROL packet for the loads to work. v2: Style issues. Signed-off-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: Use defines for CONTEXT_CONTROL instead of magic values.	Bas Nieuwenhuizen	2016-04-21	2	-2/+5
\| \| \| \| \| \| \| \|	v2: Use field names provided by Nicolai. v3: Updated to use CONTEXT_CONTROL prefix. Signed-off-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	winsys/amdgpu: fix preamble IB size	Thomas Hindoe Paaboel Andersen	2016-04-21	1	-0/+1
\| \| \| \| \| \| \| \| \|	The missing break caused the IB size to be overwritten with the size of IB_CONST. This was introduced in: 7201230582e060aa2eb79c825d3188b437ef7bb8 Signed-off-by: Marek Olšák <[email protected]>
*	gk110/ir: make use of IMUL32I for all immediates	Samuel Pitoiset	2016-04-20	1	-1/+1
\| \| \| \| \| \|	Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Cc: "11.1 11.2" <[email protected]>
*	gk110/ir: do not overwrite def value with zero for EXCH ops	Samuel Pitoiset	2016-04-20	1	-6/+15
\| \| \| \| \| \| \| \| \| \|	This is only valid for other atomic operations (including CAS). This fixes an invalid opcode error from dmesg. While we are it, make sure to initialize global addr to 0 for other atomic operations. Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Cc: "11.1 11.2" <[email protected]>
*	nir: rename nir_foreach_block() to nir_foreach_block_call()	Connor Abbott	2016-04-20	5	-5/+5
\| \| \| \|	Reviewed-by: Jason Ekstrand <[email protected]>
*	nvc0: avoid tex read fault from compute shaders on GK110	Samuel Pitoiset	2016-04-20	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \|	After some investigation, it seems like that disabling the UNK02C4 command avoid a read fault with texelFetch() from a compute shader. I have no clue on what this method actually does, but this avoid the GPU to hang with basic-texelFetch.shader_test without introducing any compute-related regressions. Signed-off-by: Samuel Pitoiset <[email protected]> Acked-by: Ilia Mirkin <[email protected]>
*	swr: fix resource backed constant buffers	Tim Rowley	2016-04-20	2	-7/+7
\| \| \| \| \| \| \| \| \| \|	Code was using an incorrect address for the base pointer. v2: use swr_resource_data() utility function. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=94979 Reviewed-by: Bruce Cherniak <[email protected]> Tested-by: Markus Wick <[email protected]>
*	nouveau: codegen: Add support for OpenCL global memory buffers	Hans de Goede	2016-04-20	1	-2/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add support for OpenCL global memory buffers, note this has only been tested with regular load and stores and likely needs more work for e.g. atomic ops. Tested with piglet on a gf119 and a gk107: ./piglit run -o shader -t '.arb_shader_storage_buffer_object.' results/shader [9/9] pass: 9 / ./piglit run -o shader -t '.arb_compute_shader.' results/shader [20/20] skip: 4, pass: 16 \| Signed-off-by: Hans de Goede <[email protected]> Reviewed-by: Samuel Pitoiset <[email protected]>
*	nouveau: codegen: Use FILE_MEMORY_BUFFER for buffers	Hans de Goede	2016-04-20	6	-5/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Some of the lowering steps we currently do for FILE_MEMORY_GLOBAL only apply to buffers, making it impossible to use FILE_MEMORY_GLOBAL for OpenCL global buffers. This commits changes the buffer code to use FILE_MEMORY_BUFFER at the ir_from_tgsi and lowering steps, freeing use of FILE_MEMORY_GLOBAL for use with OpenCL global buffers. Note that after lowering buffer accesses use the FILE_MEMORY_GLOBAL register file. Tested with piglet on a gf119 and a gk107: ./piglit run -o shader -t '.arb_shader_storage_buffer_object.' results/shader [9/9] pass: 9 / ./piglit run -o shader -t '.arb_compute_shader.' results/shader [20/20] skip: 4, pass: 16 \| Signed-off-by: Hans de Goede <[email protected]> Reviewed-by: Samuel Pitoiset <[email protected]>
*	st/dri: implement the GL interop DRI extension (v2.2)	Marek Olšák	2016-04-20	1	-0/+258
\| \| \| \| \| \| \| \| \| \|	v2: - set interop_version - simplify the offset_after macro v2.1: - use version numbers, remove offset_after - set "out_driver_data_written" v2.2: - set buf_offset & buf_size for GL_ARRAY_BUFFER too - add whandle.offset to buf_offset - disable the minmax cache for GL_TEXTURE_BUFFER
*	st/dri: Fix RGB565 EGLImage creation	Nicolas Dufresne	2016-04-20	1	-20/+24
\| \| \| \| \| \| \| \| \| \|	When creating egl images we do a bytes to pixel conversion by deviding by 4 regardless of the pixel format. This does not work for RGB565. In this patch, we avoid useless conversion and use proper API when the conversion cannot be avoided. Signed-off-by: Nicolas Dufresne <[email protected]> Reviewed-by: Michel Dänzer <[email protected]>
*	st/dri: Factor out DRI2 to PIPE_FORMAT conversion	Nicolas Dufresne	2016-04-20	1	-34/+27
\| \| \| \| \| \| \| \|	This code is already duplicated twice and will be useful again. This will also help when adding formats. Signed-off-by: Nicolas Dufresne <[email protected]> Reviewed-by: Michel Dänzer <[email protected]>
*	freedreno/a4xx: lower srgb in shader for astc textures	Rob Clark	2016-04-19	7	-6/+62
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This seems like a hw bug, and maybe only applies to certain a4xx variants/revisions. But setting the SRGB bit in sampler view state (texconst0) causes invalid alpha for ASTC textures. Work around this by doing the srgb->linear conversion in the shader instead. This fixes 392 dEQP tests: dEQP-GLES3.functional.texture.astcsrgb* (The remaining fails seem to be a bug w/ ASTC + linear filtering, also possibly a420.0 specific.) Signed-off-by: Rob Clark <[email protected]>
*	freedreno: cleanup fd_set_sampler_views	Rob Clark	2016-04-19	1	-37/+24
\| \| \| \| \| \| \|	The separate FS/VS entrypoints are no longer used since a3ed98f. So just inline them. Signed-off-by: Rob Clark <[email protected]>
*	tgsi/lowering: improved lowering for LRP	Russell King	2016-04-19	1	-35/+20
\| \| \| \| \| \| \| \| \| \| \|	Provide an improved lowering for LRP, which can be implemented in two MAD instructions with a bit of rearranging of the equation, rather than the literal implementation of two multiplies, an add and a subtract. Signed-off-by: Russell King <[email protected]> Reviewed-by: Rob Clark <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	tgsi/lowering: improved lowering for XPD	Russell King	2016-04-19	1	-22/+13
\| \| \| \| \| \| \| \| \|	Improve XPD lowering to consume less instructions by using the MAD instruction to perform the multiply and subtraction together. Signed-off-by: Russell King <[email protected]> Reviewed-by: Rob Clark <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	tgsi/lowering: add support for lowering TRUNC	Russell King	2016-04-19	2	-0/+85
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add support for lowering TRUNC using the following sequence: FRC tmpA, \|src\| SUB tmpA, \|src\|, tmpA CMP dst, -tmpA, tmpA Note that this is incompatible with FRC lowering. Signed-off-by: Russell King <[email protected]> Reviewed-by: Rob Clark <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	tgsi/lowering: add support for lowering FLR and CEIL	Russell King	2016-04-19	2	-20/+149
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add support for lowering FLR and CEIL to FRC/SUB and FRC/ADD instructions for GPUs that support FRC but not FLR or CEIL. Since these uses FRC, it is invalid to ask for FLR or CEIL to be lowered along with FRC, so add an assert to catch this invalid configuration. We also need to deal with FLR instructions emitted by the lowering code. Fix these up with the FRC+SUB equivalent when FLR lowering is enabled. Signed-off-by: Russell King <[email protected]> Reviewed-by: Rob Clark <[email protected]> Reviewed-by: Christian Gmeiner <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	radeonsi: enable TGSI support cap for compute shaders	Bas Nieuwenhuizen	2016-04-19	2	-7/+30
\| \| \| \| \| \| \| \| \| \| \| \|	v2: Use chip_class instead of family. v3: Check kernel version for SI. v4: Preemptively allow amdgpu winsys for SI. Signed-off-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Alex Deucher <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: Consider input SGPR count for compute shader SGPR count.	Bas Nieuwenhuizen	2016-04-19	2	-6/+13
\| \| \| \| \| \| \| \|	si_shader_create corrects the SGPR count with si_fix_num_sgprs. We then recompute the rsrc1 register to use the new SGPR count. Signed-off-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: Add CE synchronization for compute dispatches.	Bas Nieuwenhuizen	2016-04-19	3	-2/+8
\| \| \| \| \|	Signed-off-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: clean up compute flush	Bas Nieuwenhuizen	2016-04-19	2	-18/+8
\| \| \| \| \| \|	Signed-off-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Marek Olšák <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: do not do two full flushes on every compute dispatch	Bas Nieuwenhuizen	2016-04-19	5	-22/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	v2: Add more CS_PARTIAL_FLUSH events. Essentially every place with waits on finishing for pixel shaders also has a write after read hazard with compute shaders. Invalidating L2 waits implicitly on pixel and compute shaders, so, we don't need a CS_PARTIAL_FLUSH for switching FBO. v3: Add CS_PARTIAL_FLUSH events even if we already have INV_GLOBAL_L2. According to Marek the INV_GLOBAL_L2 events don't wait for compute shaders to finish, so wait for them explicitly. Signed-off-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Marek Olšák <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]> Reviewed-by: Edward O'Callaghan <[email protected]>
*	radeonsi: split setting graphics and compute descriptors	Bas Nieuwenhuizen	2016-04-19	4	-14/+59
\| \| \| \| \| \|	Signed-off-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Marek Olšák <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: split texture decompression for compute shaders	Bas Nieuwenhuizen	2016-04-19	4	-4/+16
\| \| \| \| \| \|	Signed-off-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Marek Olšák <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: update predicate condition for compute dispatches	Bas Nieuwenhuizen	2016-04-19	2	-0/+15
\| \| \| \| \| \| \|	Signed-off-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Marek Olšák <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]> Reviewed-by: Edward O'Callaghan <[email protected]>