mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	gallium/radeon: add a HUD query for PS draw ratio stats from separate DCC	Marek Olšák	2016-06-29	4	-0/+8
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	gallium/radeon: add a heuristic enabling DCC for scanout surfaces (v2)	Marek Olšák	2016-06-29	6	-4/+338
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	DCC for displayable surfaces is allocated in a separate buffer and is enabled or disabled based on PS invocations from 2 frames ago (to let queries go idle) and the number of slow clears from the current frame. At least an equivalent of 5 fullscreen draws or slow clears must be done to enable DCC. (PS invocations / (width * height) + num_slow_clears >= 5) Pipeline statistic queries are always active if a color buffer that can have separate DCC is bound, even if separate DCC is disabled. That means the window color buffer is always monitored and DCC is enabled only when the situation is right. The tracking of per-texture queries in r600_common_context is quite ugly, but I don't see a better way. The first fast clear always enables DCC. DCC decompression can disable it. A later fast clear can enable it again. Enable/disable typically happens only once per frame. The impact is expected to be negligible because games usually don't have a high level of overdraw. DCC usually activates when too much blending is happening (smoke rendering) or when testing glClear performance and CMASK isn't supported (Stoney). v2: rename stuff, add assertions Reviewed-by: Nicolai Hähnle <[email protected]>
*	gallium/radeon: add state setup for a separate DCC buffer	Marek Olšák	2016-06-29	4	-5/+41
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: always calculate DCC info even if it's not used immediately	Marek Olšák	2016-06-29	2	-2/+2
\| \| \| \| \| \|	for a later use Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: unreference framebuffer state with set_framebuffer_state	Marek Olšák	2016-06-29	3	-4/+6
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	gallium/radeon: add flag R600_QUERY_HW_FLAG_BEGIN_RESUMES	Marek Olšák	2016-06-29	2	-1/+4
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	i965: Use intel_get_param() more often	Chad Versace	2016-06-29	1	-11/+5
\| \| \| \| \| \| \| \|	Replace some open-coded ioctls with intel_get_param(). This is just a cleanup. No change in behavior. Reviewed-by: Ian Romanick <[email protected]>
*	i965: Refactor intel_get_param()	Chad Versace	2016-06-29	1	-7/+8
\| \| \| \| \| \| \|	Replace the function's __DRIscreen parameter with struct intel_screen. The callsites feel more natural that way. Reviewed-by: Ian Romanick <[email protected]>
*	radeonsi: don't advertise multisample shader images	Marek Olšák	2016-06-29	1	-0/+3
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: enable distributed tess on multi-SE parts only	Marek Olšák	2016-06-29	4	-2/+7
\| \| \| \| \| \| \|	ported from Vulkan Reviewed-by: Edward O'Callaghan <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: set optimal VGT_HS_OFFCHIP_PARAM	Marek Olšák	2016-06-29	5	-14/+49
\| \| \| \| \| \| \|	ported from Vulkan Reviewed-by: Edward O'Callaghan <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: enable CU0 in each SE for LS-HS execution	Marek Olšák	2016-06-29	1	-2/+1
\| \| \| \| \| \| \|	Offchip-only tessellation allows this. Reviewed-by: Edward O'Callaghan <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: use conformant line rasterization	Marek Olšák	2016-06-29	4	-5/+29
\| \| \| \| \| \| \| \| \| \|	AA lines are not completely correct (see TODO), but everything else should be. + 3 linestipple piglits Reviewed-by: Edward O'Callaghan <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	Android: add missing u_math.h include path for libmesa_isl	Rob Herring	2016-06-28	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Commit 87d062a94080 ("i965: Fix shared local memory size for Gen9+.") added u_math.h include which broke the Android build: In file included from external/mesa3d/src/intel/isl/isl_storage_image.c:25: In file included from external/mesa3d/src/mesa/drivers/dri/i965/brw_compiler.h:29: external/mesa3d/src/mesa/main/macros.h:35:10: fatal error: 'util/u_math.h' file not found ^ Add the missing include paths for libmesa_isl. Signed-off-by: Rob Herring <[email protected]> Reviewed-by: Kenneth Garunke <[email protected]>
*	svga: force direct map for transfering multiple slices	Charmaine Lee	2016-06-28	1	-15/+24
\| \| \| \| \| \| \| \| \| \| \| \| \|	With commit fb9fe35, we start using transfer_inline_write for memcpy of TexSubImage. But SurfaceDMA command does not work well with texture array. This patch forces direct map when transfering multiple slices of a texture array. Fixes piglit regression "texelFetch fs sampler1DArray" Tested with MTT piglit, glretrace, conform. Reviewed-by: Sinclair Yeh <[email protected]>
*	svga: whitespace, line wrapping fixes in svga_surface.c	Brian Paul	2016-06-28	1	-11/+16
\|
*	gm107/ir: make sure that flagsDef is set when emitting setcond	Samuel Pitoiset	2016-06-28	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	Rely on the existence of a second destination when emitting a setcond flag is dangerous, because this doesn't mean that the flag has been correctly set. Instead rely on flagsDef like what emitX() does for flagsSrc. Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Cc: <[email protected]>
*	doc: improve INTEL_DEBUG documentation	Grazvydas Ignotas	2016-06-28	1	-2/+10
\| \| \| \| \| \| \| \|	Remove 'reg' option that does not actually exist, elaborate more about 'sync' and add the missing options. Signed-off-by: Grazvydas Ignotas <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	radeonsi: set PA_SU_SMALL_PRIM_FILTER_CNTL register on Polaris	Marek Olšák	2016-06-28	2	-0/+11
\| \| \| \| \| \| \|	This was missing. Cc: 12.0 <[email protected]> Reviewed-by: Alex Deucher <[email protected]>
*	radeon/vce: use vce structure for vce 52 firmware	Boyuan Zhang	2016-06-28	5	-98/+517
\| \| \| \| \| \|	Signed-off-by: Boyuan Zhang <[email protected]> Reviewed-by: Christian König <[email protected]> Reviewed-by: Leo Liu <[email protected]>
*	radeon/vce: add vce structures	Boyuan Zhang	2016-06-28	1	-0/+297
\| \| \| \| \| \|	Signed-off-by: Boyuan Zhang <[email protected]> Reviewed-by: Christian König <[email protected]> Reviewed-by: Leo Liu <[email protected]>
*	st/omx: fix decoder fillout for the OMX result buffer	Leo Liu	2016-06-28	1	-4/+5
\| \| \| \| \| \| \| \| \| \| \| \| \|	The call for vl_video_buffer_adjust_size is with wrong order of arguments, apparently it will have problem when interlaced false; The size of OMX result buffer depends on real size of clips, vl buffer dimension is aligned with 16, so 1080p(1920*1080) video will overflow the OMX buffer Signed-off-by: Leo Liu <[email protected]> Reviewed-by: Christian König <[email protected]> Tested-by: Julien Isorce <[email protected]>
*	pipe_loader_sw: Fix fd leak when instantiated via pipe_loader_sw_probe_kms	Hans de Goede	2016-06-28	1	-0/+7
\| \| \| \| \| \| \| \| \| \| \| \|	Make pipe_loader_sw_probe_kms take ownership of the passed in fd, like pipe_loader_drm_probe_fd does. The only caller is dri_kms_init_screen which passes in a dupped fd, just like dri2_init_screen passes in a dupped fd to pipe_loader_drm_probe_fd. Signed-off-by: Hans de Goede <[email protected]> Reviewed-by: Emil Velikov <[email protected]>
*	clover: Fix kernel metadata retrieval after clang r273425	Jan Vesely	2016-06-27	1	-4/+31
\| \| \| \| \|	Signed-off-by: Jan Vesely <[email protected]> Acked-by: Francisco Jerez <[email protected]>
*	clover/llvm: Fix copyright attribution of invocation.cpp.	Francisco Jerez	2016-06-27	1	-1/+5
\| \| \| \| \| \| \| \| \| \|	This file still only has my name on the copyright notice even though most of the code (likely more than 90% of it) was authored by various contributors -- It doesn't seem right to have the whole file attributed to myself. Acked-by: Michel Dänzer <[email protected]> Acked-by: Serge Martin <[email protected]>
*	i965: Print EOT in fs_visitor::dump_instruction().	Kenneth Graunke	2016-06-27	1	-0/+4
\| \| \| \| \| \| \|	This was useful when debugging the previous commit's issue. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]>
*	i965: Make emit_urb_writes() not produce an EOT message for GS.	Kenneth Graunke	2016-06-27	1	-0/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	emit_urb_writes() contains code to emit an EOT write with no actual data when there are no output varyings. This makes sense for the VS and TES stages, where it's called once at the end of the program. However, in the geometry shader stage, emit_urb_writes() is called once for every EmitVertex(). We explicitly emit a URB write with EOT set at the end of the shader, separately from this path. So we'd better not terminate the thread. This could get us into trouble for shaders which do EmitVertex() with no varyings followed by SSBO/image/atomic writes. It also caused us to emit multiple sends with EOT set, which apparently confuses the register allocator into not using g112-g127 for all but the first one. This caused EU validation failures in OglGSCloth shaders in shader-db. (The actual application was fine, but shader-db thinks there are no outputs because it doesn't understand transform feedback.) Cc: [email protected] Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]>
*	glsl: Ignore ir_texture in lower_const_arrays_to_uniforms.	Kenneth Graunke	2016-06-27	1	-0/+7
\| \| \| \| \| \| \| \| \| \| \| \| \|	The only part of an ir_texture which can be an array is the offsets array in textureGatherOffsets() calls. We don't want to lower those, because they're required to remain constants. Fixes textureGatherOffsets with Gallium drivers such as llvmpipe, which commit ef78df8d3b0cf540e5f08c8c2f6caa338b64a6c7 regressed. Cc: [email protected] Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	gm107/ir: add missing setcond flags for LOP variants	Samuel Pitoiset	2016-06-28	1	-0/+2
\| \| \| \| \| \|	Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Cc: <[email protected]>
*	gm107/ir: make use of LOP32I for all immediates	Samuel Pitoiset	2016-06-28	1	-1/+1
\| \| \| \| \| \| \| \|	LOP only allows to emit 19-bits immediates. Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Cc: <[email protected]>
*	virgl: reduce some limits for now	Dave Airlie	2016-06-28	1	-3/+4
\| \| \| \| \| \| \| \| \|	These need to be passed from the host in caps structure if they are larger, this fixes a bunch of tests on Intel hw, that I'd put the limits too high for. Cc: "11.2 12.0" <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	st/omx: count number of slices	Julien Isorce	2016-06-27	1	-0/+3
\| \| \| \| \| \| \| \| \|	Used by nouveau driver. Similar patch was done for st/va: 851e7e12aa628d6781b5a3af2f2fc16ee73f435f Signed-off-by: Julien Isorce <[email protected]> Reviewed-by: Christian König <[email protected]>
*	st/omx: add support for nouveau / interlaced	Julien Isorce	2016-06-27	1	-25/+26
\| \| \| \| \|	Signed-off-by: Julien Isorce <[email protected]> Reviewed-by: Christian König <[email protected]>
*	st/omx: retrieve preferred interlaced and buffer_formats	Julien Isorce	2016-06-27	1	-2/+24
\| \| \| \| \| \| \|	Interlaced can be true for nouveau driver. Signed-off-by: Julien Isorce <[email protected]> Reviewed-by: Christian König <[email protected]>
*	radeonsi: use optimal WD settings for primitive restart on Polaris	Marek Olšák	2016-06-27	1	-2/+10
\| \| \| \| \| \|	ported from Vulkan Reviewed-by: Nicolai Hähnle <[email protected]>
*	st/va: Check NULL pointer	Gurkirpal Singh	2016-06-27	1	-0/+4
\| \| \| \| \| \| \| \| \| \|	Call to handle_table_get in vlVaDestroySurfaces can return NULL on failure. CID: 1243522 Signed-off-by: Gurkirpal Singh <[email protected]> Reviewed-by: Julien Isorce <[email protected]>
*	nir: Fix copy_prop_src when src is an indirect access on a reg.	Eric Anholt	2016-06-26	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	The intent was to continue down the indirect chain, not to call ourselves with unchanged input arguments. Found by code inspection, and comparison to copy_prop_alu_src(). We haven't hit this because callers of NIR's copy prop are doing so in SSA, before indirect variable dereferences have been lowered to registers. Reviewed-by: Rob Clark <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	gm107/ir: make use of MOV32I for all immediates	Samuel Pitoiset	2016-06-27	1	-2/+1
\| \| \| \| \| \| \| \| \|	MOV only allows to emit 19-bits immediates. This is similar to the previous fix I did for IMUL. Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Cc: <[email protected]>
*	i965: Use miptree to decide format on multi-plane images for gen < 7	Jordan Justen	2016-06-26	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \|	This wasn't handled correctly for multi-plane images on gen < 7 in 727a9b24933d384f5440ed4318fb720ed11d6dd1. Reported-by: Mark Janes <[email protected]> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=96674 Signed-off-by: Jordan Justen <[email protected]> Cc: "12.0" <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	nvc0: update "derived" state function names	Ilia Mirkin	2016-06-26	1	-8/+8
\| \| \| \| \| \| \|	derived_1/2/etc aren't too informative. Instead name them based on the state they're derived from. Signed-off-by: Ilia Mirkin <[email protected]>
*	nvc0: provide support for unscaled poly offset units	Ilia Mirkin	2016-06-26	3	-3/+26
\| \| \| \| \| \| \| \|	On at least Kepler hardware, the units differ based on RT format. Emit a properly scaled value for Z16 depth buffers vs other formats, to help out st/nine. Signed-off-by: Ilia Mirkin <[email protected]>
*	gm107/ir: make use of IMUL32I for all immediates	Samuel Pitoiset	2016-06-26	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	IMUL only allows to emit 19-bits immediates. This is similar to d30768025a2283d4cc57930b784798bf278969da which fixed the same thing for the GK110 emitter. Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Cc: <[email protected]>
*	radeonsi: make si_is_format_supported static	Marek Olšák	2016-06-25	3	-11/+6
\| \| \| \| \| \|	Reviewed-by: Alex Deucher <[email protected]> Reviewed-by: Vedran Miletić <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: boolean -> bool, TRUE -> true, FALSE -> false	Marek Olšák	2016-06-25	4	-15/+15
\| \| \| \| \| \|	Reviewed-by: Alex Deucher <[email protected]> Reviewed-by: Vedran Miletić <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	gallium/radeon: boolean -> bool, TRUE -> true, FALSE -> false	Marek Olšák	2016-06-25	9	-75/+75
\| \| \| \| \| \|	Reviewed-by: Alex Deucher <[email protected]> Reviewed-by: Vedran Miletić <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	gallium/radeon/winsyses: boolean -> bool, TRUE -> true, FALSE -> false	Marek Olšák	2016-06-25	9	-132/+134
\| \| \| \| \| \|	Reviewed-by: Alex Deucher <[email protected]> Reviewed-by: Vedran Miletić <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	gallium/radeon: use r600_resource_reference	Marek Olšák	2016-06-25	12	-40/+32
\| \| \| \| \| \|	Reviewed-by: Alex Deucher <[email protected]> Reviewed-by: Vedran Miletić <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	nir: Add a NIR_VALIDATE environment variable	Jason Ekstrand	2016-06-25	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \|	It defaults to true so default behavior doesn't change but it allows you to do NIR_VALIDATE=false if you don't want validation. Disabling validation can substantially speed up shader compiles so you frequently want to turn it off if compiler invariants aren't in question. Reviewed-by: Matt Turner <[email protected]> Reviewed-by: Matt Turner <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	st/nine: Use offset_units_unscaled	Axel Davy	2016-06-25	5	-11/+13
\| \| \| \| \| \| \| \| \| \| \| \|	offset_units_unscaled enables proper support for depth bias for gallium nine. Use it if available. Solves issues with some games using depth bias. For example: https://github.com/iXit/Mesa-3D/issues/220 Signed-off-by: Axel Davy <[email protected]>
*	r600g: Implement POLYGON_OFFSET_UNITS_UNSCALED	Axel Davy	2016-06-25	5	-36/+46
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Empirical tests show that the polygon offset behaviour is entirely determined by the content of the PA_SU_POLY_OFFSET states, and not by the depth buffer format bound. PA_SU_POLY_OFFSET seems to directly set the parameters of the polygon offset formula, and setting 0 for PA_SU_POLY_OFFSET_DB_FMT_CNTL (ie setting the unorm depth bias behaviour with a scale of 2^0 = 1.0f) gives the unscaled behaviour. Signed-off-by: Axel Davy <[email protected]> Reviewed-by: Marek Olšák <[email protected]>