mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	svga: remove unused variable	Brian Paul	2016-07-08	1	-1/+0
\| \| \| \|	Reviewed-by: Charmaine Lee <[email protected]>
*	svga: add dumping for more device commands	Brian Paul	2016-07-08	1	-155/+724
\| \| \| \|	Signed-off-by: Brian Paul <[email protected]>
*	svga: silence a couple unused variable warnings	Brian Paul	2016-07-08	2	-1/+3
\| \| \| \|	Signed-off-by: Brian Paul <[email protected]>
*	svga: rebind using render target surfaces in hw draw state	Charmaine Lee	2016-07-08	1	-6/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently when we rebind framebuffer resources at the beginning of the command buffer, we use the color buffer surfaces saved in the context hw clear state. But the surfaces could be different from the actual emitted render target surfaces if any of the color buffer surfaces is also used for shader resource, in that case, we create a backed surface for the collided render target surface. So to rebind the framebuffer resources correctly, use the render target surfaces saved in the context hw draw state. Tested with Heaven, Lightsmark2008, MTT piglit, glretrace, conform. Reviewed-by: Brian Paul <[email protected]>
*	svga: invalidate gb surface before it is reused	Charmaine Lee	2016-07-08	4	-9/+63
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	With this patch, a guest-backed surface will be invalidated using the SVGA_3D_CMD_INVALIDATE_GB_SURFACE command before the surface is reused. This fixes the updating dirty image error from the device when a surface is reused. v2: Instead of invalidating the surface when it is reused, send the invalidate command before the surface is put into the recycle pool. v3: (1) surface invalidate is a noop operation in Linux winsys, since surface invalidation is not needed for DMA path. (2) Instead of invalidating the surface content in svga_screen_surface_destroy() when a surface is to be destroyed, it is done in svga_screen_cache_flush() when the surface is no longer referenced in a command buffer and is ready to be moved to the unused list. At this point, the surface will be moved to the invalidate list. When the surface invalidation is submitted, the surface will be moved to the unused list. Tested with piglit, glretrace. Reviewed-by: Brian Paul <[email protected]> Reviewed-by: Sinclair Yeh <[email protected]>
*	svga: fix use of provoking vertex control	Brian Paul	2016-07-08	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \| \|	If the SVGA3D_DEVCAP_DX_PROVOKING_VERTEX query returns false, never define rasterizer state objects with provokingVertexLast set. Despite what the device reports, it may interpret the provokingVertexLast flag anyway. This fixes an issue when using capability clamping. Tested with piglit provoking-vertex and glsl-fs-flat-color tests. VMware bug 1550143. Reviewed-by: <[email protected]>
*	vl: add half pixel to v_tex before adding offsets	Nayan Deshmukh	2016-07-08	1	-0/+2
\| \| \| \| \| \| \| \|	Since pixel center lies at 0.5, add half_pixel to vtex before adding offsets to it. Signed-off-by: Nayan Deshmukh <[email protected]> Reviewed-by: Christian König <[email protected]>
*	nvc0/ir: remove unused resource info loading helpers	Samuel Pitoiset	2016-07-08	2	-28/+0
\| \| \| \| \|	Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	nvc0/ir: refactor the surfaces info loading logic	Samuel Pitoiset	2016-07-08	2	-82/+44
\| \| \| \| \|	Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	nvc0/ir: move the shift left op inside loadTexHandle()	Samuel Pitoiset	2016-07-08	1	-8/+6
\| \| \| \| \|	Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	radeonsi: disable multi-threading when shader dumps are enabled	Nicolai Hähnle	2016-07-08	1	-0/+1
\| \| \| \| \| \| \|	Otherwise, shader dumps can become interleaved and unusable. Reviewed-by: Edward O'Callaghan <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: use multi-threaded compilation in debug contexts	Nicolai Hähnle	2016-07-08	1	-4/+4
\| \| \| \| \| \| \| \|	We only have to stay single-threaded when debug output must be synchronous. This yields better parallelism in shader-db runs for me. Reviewed-by: Edward O'Callaghan <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	gallium: add async flag to pipe_debug_callback	Nicolai Hähnle	2016-07-08	2	-1/+10
\| \| \| \| \| \| \|	v2: fix typo db -> cb Reviewed-by: Edward O'Callaghan <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: catch a potential state tracker error with non-MSAA FBs	Nicolai Hähnle	2016-07-08	1	-0/+6
\| \| \| \| \| \|	At least st/mesa ensures this, so I'd rather not handle deviations in radeonsi. Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: explicitly choose center locations for 1xAA on Polaris	Nicolai Hähnle	2016-07-08	5	-18/+48
\| \| \| \| \| \| \| \| \| \| \| \| \|	Unlike SC, the small primitive filter does not automatically use center locations in 1xAA mode, so this is needed to avoid artifacts caused by the small primitive filter discarding triangles that it shouldn't. As a side effect of how the effective number of samples is now calculated, this patch also avoids submitting the sample locations for line/poly smoothing when they're not really needed. Cc: 12.0 <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	r600g: call cayman_emit_msaa_sample_locs only when needed	Nicolai Hähnle	2016-07-08	1	-1/+2
\| \| \| \| \| \|	In the case of nr_samples <= 1, that function is (currently) a no-op anyway. Reviewed-by: Marek Olšák <[email protected]>
*	osmesa: Export OSMesaCreateContextAttribs.	Mathias Fröhlich	2016-07-07	3	-0/+3
\| \| \| \| \| \| \| \| \| \| \|	Since the function is exported like any other public api function and put in the header as if you could link against it, export it also from shared objects. Signed-off-by: Mathias Fröhlich <[email protected]> Reviewed-by: Brian Paul <[email protected]> Cc: "11.2 12.0" <[email protected]>
*	radeon/llvm: Use alloca instructions for larger arrays	Tom Stellard	2016-07-06	2	-25/+151
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We were storing arrays in vectors, which was leading to some really bad spill code for large arrays. allocas instructions are a better fit for arrays and LLVM optimizations are more geared toward dealing with allocas instead of vectors. For arrays that have 16 or less 32-bit elements, we will continue to use vectors, because this will force LLVM to store them in registers and use indirect registers, which is usually faster for small arrays. In the future we should use allocas for all arrays and teach LLVM how to store allocas in registers. This fixes the piglit test: spec/glsl-1.50/execution/geometry/max-input-component Reviewed-by: Marek Olšák <[email protected]>
*	radeon/llvm: Add helpers for loading and storing data from arrays.	Tom Stellard	2016-07-06	1	-10/+41
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	radeon/llvm: Remove uses_temp_indirect_addressing() function	Tom Stellard	2016-07-06	1	-23/+1
\| \| \| \| \| \|	bld->indirect_files is never set, so this function always returns false. Reviewed-by: Marek Olšák <[email protected]>
*	gallium: un-inline pipe_surface_desc	Rob Clark	2016-07-06	1	-11/+12
\| \| \| \| \| \| \|	Want to re-use this struct, so un-inline it. Signed-off-by: Rob Clark <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	gallium/util: make util_copy_framebuffer_state(src=NULL) work	Rob Clark	2016-07-06	1	-11/+26
\| \| \| \| \| \| \| \|	Be more consistent with the other u_inlines util_copy_xyz_state() helpers and support NULL src. Signed-off-by: Rob Clark <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	winsys/amdgpu: avoid flushed depth when possible	Nicolai Hähnle	2016-07-06	1	-3/+8
\| \| \| \| \| \| \|	If a depth/stencil texture has no mipmaps, we can always get a layout that is compatible with DB and TC. Reviewed-by: Marek Olšák <[email protected]>
*	gallium/radeon: add depth/stencil_adjusted output to surface computation	Nicolai Hähnle	2016-07-06	3	-2/+14
\| \| \| \| \| \| \| \| \| \| \|	This fixes a rare bug with stencil texturing -- seen on Polaris and Tonga, though it's basically a function of the memory configuration so could affect other parts as well. Fixes piglit "unaligned-blit * stencil downsample" and various "fbo-depth-array stencil" tests. Reviewed-by: Marek Olšák <[email protected]>
*	gallium/radeon: allocate only the required plane for flushed depth	Nicolai Hähnle	2016-07-06	1	-3/+34
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: decompress to flushed depth texture when required	Nicolai Hähnle	2016-07-06	1	-29/+103
\| \| \| \| \| \|	v2: s/dirty_level_mask/stencil_dirty_level_mask/ in stencil case Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: extract DB->CB copy logic into its own function	Nicolai Hähnle	2016-07-06	1	-36/+61
\| \| \| \| \| \|	Also clean up some of the looping. Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: sample from flushed depth texture when required	Nicolai Hähnle	2016-07-06	2	-8/+46
\| \| \| \| \| \| \|	Note that this has no effect yet. A case where can_sample_z/s can be false in radeonsi will be added in a later patch. Reviewed-by: Marek Olšák <[email protected]>
*	gallium/radeon: replace is_flushing_texture with db_compatible	Nicolai Hähnle	2016-07-06	9	-19/+24
\| \| \| \| \| \| \| \| \| \| \|	This is a left-over of when I considered generalizing the separate stencil support. I do prefer the new name since it emphasizes what flushing vs. non-flushing means from a functional point-of-view, namely special handling of the texture format. v2: adjust r600_init_color_surface as well Reviewed-by: Marek Olšák <[email protected]>
*	gallium/radeon: add can_sample_z/s flags for textures	Nicolai Hähnle	2016-07-06	5	-24/+34
\| \| \| \| \| \|	v2: adjust r600_init_color_surface as well Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: correctly mark levels of 3D textures as fully decompressed	Nicolai Hähnle	2016-07-06	1	-2/+2
\| \| \| \| \| \|	Account for the fact that max_layer is minified for higher levels. Reviewed-by: Marek Olšák <[email protected]>
*	gallium/radeon/winsyses: remove unused stencil_offset	Nicolai Hähnle	2016-07-06	3	-5/+0
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	gallium/radeon: remove redundant null-pointer check	Nicolai Hähnle	2016-07-06	1	-2/+1
\| \| \| \| \| \|	v2: keep using r600_texture_reference Reviewed-by: Marek Olšák <[email protected]>
*	gallium/radeon: print StencilLayout only once	Nicolai Hähnle	2016-07-06	1	-2/+2
\| \| \| \| \| \|	It is the same for all levels. Reviewed-by: Marek Olšák <[email protected]>
*	gallium/radeon: flush stdout after printing texture information	Nicolai Hähnle	2016-07-06	1	-0/+1
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	svga: avoid emitting redundant DXSetRenderTargets command	Charmaine Lee	2016-07-05	2	-18/+32
\| \| \| \| \| \| \|	Tested with Lightsmark2008, MTT piglit, glretrace, conform. Reviewed-by: Sinclair Yeh <[email protected]> Reviewed-by: Brian Paul <[email protected]>
*	radeon/vce: update encRefPic addr and array mode to tiled	Leo Liu	2016-07-05	1	-0/+1
\| \| \| \| \|	Signed-off-by: Leo Liu <[email protected]> Reviewed-by: Christian König <[email protected]>
*	radeon/vce: increase cpb height alignment	Leo Liu	2016-07-05	1	-1/+1
\| \| \| \| \| \| \| \|	Height should be aligned with 2 macroblocks, thus making safer for tiled mode Signed-off-by: Leo Liu <[email protected]> Reviewed-by: Christian König <[email protected]>
*	swr: automake: don't ship LLVM version specific generated sources	Emil Velikov	2016-07-05	1	-2/+43
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Otherwise things will fail to build, if the builder is using another version of LLVM. v2: annotate all the dependencies of builder_gen.h v3: clean the generated files as needed v4: comment cleanups (Tim) Cc: "12.0" <[email protected]> Tested-by: Tim Rowley <[email protected]> Tested-by: Chuck Atkins <[email protected]> (v2) Reported-by: Chuck Atkins <[email protected]> Signed-off-by: Emil Velikov <[email protected]>
*	clover: conditionally use MESA_GIT_SHA1	Emil Velikov	2016-07-05	2	-2/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Considering how hard/annoying it was for many peoples' workflow to properly generate the macro, it will be demoted to conditionally available with follow-up commits. v2: Kill off gracious blank line (Vedran). Cc: [email protected] Cc: Vedran Miletić <[email protected]> Cc: Francisco Jerez <[email protected]> Signed-off-by: Emil Velikov <[email protected]> Reviewed-by: Eric Engestrom <[email protected]> (v1) Reviewed-by: Vedran Miletić <[email protected]>
*	nvc0/ir: rename NVE4_SU_INFO_XXX to NVC0_SU_INFO_XXX	Samuel Pitoiset	2016-07-05	1	-49/+49
\| \| \| \| \| \| \| \|	While we are at it, fix a typo inside the comment which describes what those constants are for. Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	nvc0/ir: reset the base offset for indirect images accesses	Samuel Pitoiset	2016-07-05	1	-2/+4
\| \| \| \| \| \| \| \| \| \|	In presence of an indirect image access, the base offset should be zeroed because the stride will be computed twice. This is a pretty rare situation but it can happen when tex.r > 0. Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Cc: "11.2 12.0" <[email protected]>
*	gm107/ir: fix sign bit emission for FADD32I	Samuel Pitoiset	2016-07-05	1	-3/+6
\| \| \| \| \| \| \| \| \| \| \| \|	When emitting OP_SUB, the sign bit for FADD and FADD32I is not at the same position. It's at position 45 for FADD but 51 for FADD32I. This fixes the following piglit test: tests/spec/arb_fragment_program/fdo30337b.shader_test Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Cc: <[email protected]>
*	vc4: Regularize instruction emit macros	Eric Anholt	2016-07-04	2	-39/+50
\| \| \| \| \| \|	ALU0 didn't have the _dest variant, and ALU2 didn't unset the def the way ALU1 did. This should make the ALU[012] macros much clearer, by moving most of their contents to vc4_qir.c
*	vc4: Enable dead CF elimination.	Eric Anholt	2016-07-04	1	-0/+1
\| \| \| \| \| \|	Now that we're about to start generating control flow in our NIR, we want this in place. It optimizes things frequently in the CS, when the GL VS has control flow that doesn't affect the vertex position.
*	vc4: Optimize out redundant SF updates.	Eric Anholt	2016-07-04	2	-6/+78
\| \| \| \| \| \| \| \| \| \| \|	Tiny change on shader-db currently, but it will be important when we start emitting a lot of SFs from the same variable as part of control flow support. total instructions in shared programs: 89463 -> 89430 (-0.04%) instructions in affected programs: 1522 -> 1489 (-2.17%) total estimated cycles in shared programs: 250060 -> 250015 (-0.02%) estimated cycles in affected programs: 8568 -> 8523 (-0.53%)
*	vc4: Move SF removal to a separate peephole pass.	Eric Anholt	2016-07-04	5	-17/+85
\| \| \| \| \| \| \| \| \|	The DCE pass is going to change significantly to handle control flow, while we don't really need to change it for the SF handling. We also need to add some more SF peephole optimization for SF updates generated by control flow support. No change on shader-db.
*	vc4: DCE instructions with a NULL destination.	Eric Anholt	2016-07-04	1	-2/+3
\| \| \| \| \| \| \| \|	I'm going to add an optimization for redundant SF update removal, which will just remove the SF and leave us (in many cases) with an instruction with a NULL destination and no side effects. Rather than teaching that pass whether the whole instruction can be removed, leave that responsibility to this pass.
*	vc4: Mark texturing setup instructions as having side effects.	Eric Anholt	2016-07-04	1	-5/+5
\| \| \| \| \| \| \|	We need to not DCE them even though they don't have a destination in QIR. We also shouldn't relocate them in vc4_opt_vpm. Neither of these things happen, but I'm about to make DCE consider instructions with a NULL destination.
*	vc4: Fix a pasteo in scheduling condition flag usage.	Eric Anholt	2016-07-04	1	-1/+1
\| \| \| \| \| \| \|	Noticed by code inspection. This hasn't been too big of a deal, because our cond usages all start out as adder ops, either MOVs or the FTOI for Z writes. MOVs can get converted to mul ops during scheduling, but apparently we hadn't hit this.