mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	vc4: Enable LIT lowering in TGSI instead of our own code.	Eric Anholt	2014-10-08	1	-35/+1
\| \| \| \|	This brings us the -128/128 clamping on the w component.
*	vc4: Fix scalar math opcodes to replicate their result from the X channel.	Eric Anholt	2014-10-08	1	-4/+16
\| \| \| \| \|	Thanks to robclark for pointing out that I was probably failing to do this when I reported a "bug" in his lowering code.
*	ilo: fix rectlist on GEN7+	Chia-I Wu	2014-10-09	1	-0/+3
\| \| \| \| \| \|	It was broken by 343b014b57ecc5431477e090100e6a26edbda540. Signed-off-by: Chia-I Wu <[email protected]>
*	vc4: Add support for two-sided color.	Eric Anholt	2014-10-08	2	-18/+51
\| \| \| \| \| \| \| \| \| \|	It's fairly easy, thanks to Rob Clark's lowering code. Fixes two-sided-lighting and 4 vertex-program-two-side testcases, while regressing 8 testcases that involve enabling two-sided color while only initializing one of the two colors in the VS. If you're enabling two sided color, it's of course expected that you really do set up both colors, so this is still an improvement (and when we set up a linker for TGSI, we'll hopefully fix those 8 fails).
*	vc4: Enable POW lowering in TGSI instead of our own code.	Eric Anholt	2014-10-08	1	-11/+1
\|
*	vc4: Enable DP lowering in TGSI instead of our own code.	Eric Anholt	2014-10-08	1	-41/+3
\|
*	vc4: Start using tgsi_lowering for opcodes we haven't supported before.	Eric Anholt	2014-10-08	1	-1/+15
\|
*	vc4: Set unused raddr fields to QPU_R_NOP.	Eric Anholt	2014-10-08	1	-16/+27
\| \| \| \| \| \| \|	The simulator assertion fails if you have a write to a reg and then a read (for example, in the NOP side of an instruction), even if the read isn't used for anything. By setting unused raddrs to NOP, we avoid the problem (since only the phsyical registers are tracked).
*	vc4: Abstract out the field-merging logic for instructions.	Eric Anholt	2014-10-08	1	-11/+17
\| \| \| \|	I'm going to be doing the same logic for some more fields next.
*	r600: Use DMA transfers in r600_copy_global_buffer	Niels Ole Salscheider	2014-10-07	2	-17/+43
\| \| \| \| \| \|	v2: Do not demote items that are already in the pool Signed-off-by: Niels Ole Salscheider <[email protected]>
*	radeonsi: Use dummy pixel shader if compilation of the real shader failed	Michel Dänzer	2014-10-07	3	-7/+22
\| \| \| \| \| \| \|	Instead of crashing. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=79155#c5 Reviewed-by: Marek Olšák <[email protected]>
*	ilo: let shaders determine surface counts	Chia-I Wu	2014-10-06	9	-202/+267
\| \| \| \| \| \| \| \|	When a shader needs N surfaces, we should upload N surfaces and not depend on how many are bound. This commit is larger than it should be because we did not export how many surfaces a surface uses before. Signed-off-by: Chia-I Wu <[email protected]>
*	ilo: let shaders determine sampler counts	Chia-I Wu	2014-10-04	13	-87/+98
\| \| \| \| \| \| \|	When a shader needs N samplers, we should upload N samplers and not depend on how many are bound. Signed-off-by: Chia-I Wu <[email protected]>
*	tgsi: change tgsi_shader_info::properties to a one-dimensional array	Marek Olšák	2014-10-04	8	-15/+15
\| \| \| \| \| \|	Reviewed-by: Roland Scheidegger <[email protected]> v2: fix svga too
*	radeonsi: set number of userdata SGPRs of GS copy shader to 4	Marek Olšák	2014-10-04	3	-10/+23
\| \| \| \| \| \| \|	It only needs the constant buffer with clip planes and read-write resources for the GS->VS ring and streamout. That's 2 pointers. Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: pass the GS shader directly to si_generate_gs_copy_shader	Marek Olšák	2014-10-04	1	-3/+3
\| \| \| \|	Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: set LLVMByValAttribute for all descriptor arrays	Marek Olšák	2014-10-04	1	-10/+7
\| \| \| \| \| \|	I hope this is correct. Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: make the vertex shader key smaller	Marek Olšák	2014-10-04	1	-1/+2
\| \| \| \| \| \|	We only support 16 vertex attribs, not 32. Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: don't flush shader caches when building PM4 shader states	Marek Olšák	2014-10-04	1	-8/+0
\| \| \| \| \| \| \| \| \|	This is a wrong place to flush caches to say the least. I don't think we need to flush the instruction caches if we don't patch shaders with DMA. Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: remove interp_at_sample from the key, use TGSI_INTERPOLATE_LOC_SAMPLE	Marek Olšák	2014-10-04	3	-5/+2
\| \| \| \| \| \| \| \| \|	st/mesa has the same flag in its shader key, we don't need to do it in the driver anymore. Instead, use TGSI_INTERPOLATE_LOC_SAMPLE, which is what st/mesa sets. Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: move geometry shader properties from si_shader to si_shader_selector	Marek Olšák	2014-10-04	4	-29/+38
\| \| \| \|	Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: always compile shaders on demand	Marek Olšák	2014-10-04	1	-13/+3
\| \| \| \| \| \| \|	The first compiled shader is sometimes useless, because the key doesn't match the key for the draw call where it's used. Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: remove unused variable si_shader::gs_input_prim	Marek Olšák	2014-10-04	2	-3/+0
\| \| \| \|	Reviewed-by: Michel Dänzer <[email protected]>
*	tgsi: remove some not so useful variables from tgsi_shader_info	Marek Olšák	2014-10-04	4	-9/+14
\|
*	radeonsi: get fs_write_all from tgsi_shader_info directly	Marek Olšák	2014-10-04	3	-16/+3
\| \| \| \|	Reviewed-by: Michel Dänzer <[email protected]>
*	tgsi: simplify shader properties in tgsi_shader_info	Marek Olšák	2014-10-04	4	-68/+27
\| \| \| \|	Use an array of properties indexed by TGSI_PROPERTY_* definitions.
*	radeonsi: get tgsi_shader_info only once before compilation	Marek Olšák	2014-10-04	3	-21/+16
\| \| \| \|	Reviewed-by: Michel Dänzer <[email protected]>
*	radeonsi: fix CS tracing and remove excessive CS dumping	Marek Olšák	2014-10-04	3	-35/+25
\|
*	gk110/ir: add dnz flag emission for fmul/fmad	Ilia Mirkin	2014-10-03	1	-0/+4
\| \| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]> Cc: "10.2 10.3" <[email protected]>
*	gm107/ir: add dnz emission for fmul	Ilia Mirkin	2014-10-03	1	-1/+1
\| \| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]> Cc: "10.3" <[email protected]>
*	freedreno: query fixes	Rob Clark	2014-10-03	3	-8/+13
\| \| \| \| \| \| \|	Fixes a few issues, including a potential empty-IB (which triggers gpu hangs in piglit occlusion_query_meta_no_fragments) Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx: handle VS only outputting BCOLOR	Rob Clark	2014-10-03	1	-2/+10
\| \| \| \| \| \| \|	Possibly we should map the front color to black (zeroes). But not sure there is a way to do that without generating a shader variant. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: fix lockups with lame FRAG shaders	Rob Clark	2014-10-03	4	-6/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Shaders like: FRAG PROPERTY FS_COLOR0_WRITES_ALL_CBUFS 1 DCL IN[0], GENERIC[0], PERSPECTIVE DCL OUT[0], COLOR DCL SAMP[0] DCL TEMP[0], LOCAL IMM[0] FLT32 { 0.0000, 1.0000, 0.0000, 0.0000} 0: TEX TEMP[0], IN[0].xyyy, SAMP[0], 2D 1: MOV OUT[0], IMM[0].xyxx 2: END cause unhappyness. They have an IN[], but once this is compiled the useless TEX instruction goes away. Leaving a varying that is never fetched, which makes the hw unhappy. In the process fix a signed vs unsigned compare. If the vertex shader has max_reg=-1, MAX2() vs an unsigned would not give the desired result. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: add TXF support	Ilia Mirkin	2014-10-02	1	-1/+39
\| \| \| \| \| \| \|	Still failing a bunch of the fairly picky texelFetch tests, but the 1D(Array) ones are full passes. Signed-off-by: Ilia Mirkin <[email protected]>
*	freedreno/ir3: add TXD support and expose ARB_shader_texture_lod	Ilia Mirkin	2014-10-02	3	-9/+56
\| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]>
*	freedreno/ir3: add texture offset support	Ilia Mirkin	2014-10-02	1	-4/+45
\| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]>
*	freedreno/ir3: shadow comes before array	Ilia Mirkin	2014-10-02	1	-2/+2
\| \| \| \| \| \| \|	Experimentally, this makes *ArrayShadow tex-miplevel-selection tests pass. Signed-off-by: Ilia Mirkin <[email protected]>
*	freedreno/ir3: make TXQ return integers, not floats	Ilia Mirkin	2014-10-02	1	-1/+1
\| \| \| \| \| \|	We're still doing something wrong for array textures. Signed-off-by: Ilia Mirkin <[email protected]>
*	freedreno/ir3: add UMAD support	Ilia Mirkin	2014-10-02	1	-4/+15
\| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]>
*	freedreno/ir3: add ISSG support	Ilia Mirkin	2014-10-02	1	-0/+39
\| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]>
*	freedreno/ir3: add MOD support	Ilia Mirkin	2014-10-02	1	-8/+12
\| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]>
*	freedreno/ir3: add UMOD support, based on UDIV	Ilia Mirkin	2014-10-02	1	-6/+31
\| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]>
*	freedreno/ir3: add IDIV/UDIV support	Ilia Mirkin	2014-10-02	1	-3/+197
\| \| \| \| \| \|	Logic shamelessly copied from nv50 lowering pass. Signed-off-by: Ilia Mirkin <[email protected]>
*	radeonsi: Clear sampler view flags when binding a buffer	Michel Dänzer	2014-10-03	1	-0/+5
\| \| \| \| \| \| \| \| \|	Fixes assertion failure while running the Unreal Engine 4 Elemental demo: .../si_blit.c:322:si_decompress_color_textures: Assertion `tex->cmask.size \|\| tex->fmask.size' failed. Cc: "10.2 10.3" <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	vc4: Add support for framebuffer sRGB encoding.	Eric Anholt	2014-10-02	1	-2/+31
\|
*	vc4: Add support for sampling from sRGB.	Eric Anholt	2014-10-02	2	-9/+51
\| \| \| \| \| \| \| \|	This isn't perfect -- the filtering is happening on the srgb values, and we're decoding afterwards, which is not what you want. I think that's the cause of some additional texwrap(GL_CLAMP, LINEAR) failures, though many other texwrap tests on srgb start to pass since unfiltered values come out correct.
*	freedreno/ir3: avoid fan-in sources referring to same instruction	Ilia Mirkin	2014-10-02	1	-2/+10
\| \| \| \| \| \| \| \| \| \|	Since the RA has to be done s.t. each one gets its own (adjacent) register, it would complicate matters if instructions were allowed to be repeated. This enables copy-propagation use in situations where previously that might have happened. Signed-off-by: Ilia Mirkin <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx: emit all immediates in one shot	Rob Clark	2014-10-02	1	-8/+16
\| \| \| \| \| \| \|	Makes the command stream a bit tighter when there are lots of immediates. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: instanced drawing/compute not yet supported	Ilia Mirkin	2014-10-02	1	-3/+3
\| \| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx: handle large shader program sizes	Rob Clark	2014-10-02	1	-11/+63
\| \| \| \| \| \| \|	Above a certain limit use CACHE mode instead of BUFFER mode. This should solve gpu hangs with large shader programs. Signed-off-by: Rob Clark <[email protected]>