mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	freedreno: make gmem tile size alignment configurable	Rob Clark	2016-11-30	3	-8/+17
\| \| \| \| \| \| \|	a5xx seems to prefer 64 pixel alignment, in at least some cases. Make this configurable per generation. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: don't offset inloc by 8	Rob Clark	2016-11-30	4	-27/+15
\| \| \| \| \| \| \| \| \|	On a3xx/a4xx, the SP_VS_VPC_DST_REG.OUTLOCn is offset by 8, so we used to add this offset into fs->inputs[n].inloc. But a5xx drops this extra offset-by-8. So instead make inloc zero based and add the offset when we emit OUTLOCn values (for the gen's that need the offset). Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx: use new shader linkage helper	Rob Clark	2016-11-30	1	-27/+16
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a4xx: use new shader linkage helper	Rob Clark	2016-11-30	1	-27/+16
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: add new helper for shader linkage	Rob Clark	2016-11-30	1	-0/+47
\| \| \| \| \| \| \|	Helps simplify things on a5xx, where pos/psize get added to the vs-out map. And anyways, simplifies a3xx and a4xx. Signed-off-by: Rob Clark <[email protected]>
*	st/mesa: skip lower_output_reads when possible	Nicolai Hähnle	2016-11-30	1	-1/+2
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	st/glsl_to_tgsi: swizzle PROGRAM_OUTPUTs correctly in src_register translation	Nicolai Hähnle	2016-11-30	1	-1/+11
\| \| \| \| \| \| \|	This is required for reading directly from fragment shader stencil and depth outputs. Reviewed-by: Marek Olšák <[email protected]>
*	gallium: add PIPE_CAP_TGSI_CAN_READ_OUTPUTS	Nicolai Hähnle	2016-11-30	17	-0/+18
\| \| \| \| \| \| \| \| \| \| \|	Drivers that support this benefit by saving one lowering pass in the GLSL-to-TGSI conversion. radeonsi already supports this because all outputs are stored in temporary variables before the export (except for TCS outputs, which have always been readable in TGSI anyway due to their special semantics). Reviewed-by: Marek Olšák <[email protected]>
*	ac/nir: Fix out of bounds array access.	Bas Nieuwenhuizen	2016-11-30	1	-1/+1
\| \| \| \| \| \| \|	With nir_intrinsic_ssbo_atomic_comp_swap we run out of params. Signed-off-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Dave Airlie <[email protected]>
*	aubinator: Add support for enum types	Kristian H. Kristensen	2016-11-29	2	-40/+93
\| \| \| \| \|	Signed-off-by: Kristian H. Kristensen <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/genxml: Fix ksp for INTERFACE_DESCRIPTOR_DATA	Kristian H. Kristensen	2016-11-29	2	-4/+2
\| \| \| \| \| \| \| \| \| \|	This one was split across two dwords as "Kernel Start Pointer" and "Kernel Start Pointer High", which looks like it works when the driver only accesses "Kernel Start Pointer". This breaks, of course, with BO offsets > 4G. Signed-off-by: Kristian H. Kristensen <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/genxml: Use enum 3D_Logic_Op_Function where applicable	Kristian H. Kristensen	2016-11-29	5	-56/+62
\| \| \| \| \|	Signed-off-by: Kristian H. Kristensen <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/genxml: Use blend function and factor enums where applicable	Kristian H. Kristensen	2016-11-29	5	-130/+124
\| \| \| \| \|	Signed-off-by: Kristian H. Kristensen <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/genxml: Use enum 3D_Vertex_Component_Control where applicable	Kristian H. Kristensen	2016-11-29	5	-20/+20
\| \| \| \| \|	Signed-off-by: Kristian H. Kristensen <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/genxml: Use enum 3D_Stencil_Operation where applicable	Kristian H. Kristensen	2016-11-29	5	-84/+63
\| \| \| \| \|	Signed-off-by: Kristian H. Kristensen <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/genxml: Use enum SURFACE_FORMAT where applicable	Kristian H. Kristensen	2016-11-29	5	-10/+10
\| \| \| \| \|	Signed-off-by: Kristian H. Kristensen <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/genxml: Use enum 3D_Prim_Topo_Type where applicable	Kristian H. Kristensen	2016-11-29	5	-15/+15
\| \| \| \| \|	Signed-off-by: Kristian H. Kristensen <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/genxml: Use 3D_Compare_Function for gen8+ test functions	Kristian H. Kristensen	2016-11-29	2	-8/+8
\| \| \| \| \| \| \| \| \|	When the state fields where shuffled around for gen8, the compare function enums were downgraded to just uints. Change them to enum 3D_Compare_Function. Signed-off-by: Kristian H. Kristensen <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/genxml: Emit genxml enums as C enums	Kristian H. Kristensen	2016-11-29	1	-4/+4
\| \| \| \| \| \| \| \| \|	The previous commits got rid of any clashes between #defines and enum values and we can now emit the genxml enums as debugger friendly C enums. Signed-off-by: Kristian H. Kristensen <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/genxml: Remove duplicate COMPAREFUNCTION values	Kristian H. Kristensen	2016-11-29	3	-120/+12
\| \| \| \| \| \| \| \|	These values were defined both as an enum and as inline values. Remove the inline values and reference the 3D_Compare_Function enum instead. Signed-off-by: Kristian H. Kristensen <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/genxml: Allow referencing enums in type attributes	Kristian H. Kristensen	2016-11-29	1	-0/+7
\| \| \| \| \| \| \|	This lets us reference enums in the type attribute of a field. Signed-off-by: Kristian H. Kristensen <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	anv: Emit cherryview SF state without including gen9_pack.h	Kristian H. Kristensen	2016-11-29	1	-13/+23
\| \| \| \| \| \| \| \| \|	Cleaner this way and we avoid including gen9_pack.h when we compile with gen8_pack.h. We also avoid the if (cherryview) condition for non-gen8 gens that don't need it. Signed-off-by: Kristian H. Kristensen <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	anv: Don't include two different pack headers	Kristian H. Kristensen	2016-11-29	1	-3/+5
\| \| \| \| \| \| \| \| \| \|	The batch chain logic only needs the pre-gen8 size of MI_BATCH_BUFFER_START, which seems like something we can make a special case for. The other two gen7 references, MI_BATCH_BUFFER_END and MI_NOOP, are the same on all gens. Signed-off-by: Kristian H. Kristensen <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	intel/genxml: Move enums above structs	Kristian H. Kristensen	2016-11-29	5	-1726/+1726
\| \| \| \| \| \| \| \| \|	We'll need to define them before we can reference them in structs and instructions. Enums have no dependencies, so move them first in the file. Signed-off-by: Kristian H. Kristensen <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	genxml: Add values for Barycentric Interpolation Mode	Kristian H. Kristensen	2016-11-29	5	-5/+40
\| \| \| \| \|	Signed-off-by: Kristian H. Kristensen <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	anv: remove per-sample shading from TODO	Ilia Mirkin	2016-11-30	1	-1/+0
\| \| \| \| \| \| \|	This was done some time ago. Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	anv: clean up VkPhysicalDeviceFeatures list	Ilia Mirkin	2016-11-30	1	-3/+3
\| \| \| \| \| \| \| \| \| \|	Remove duplicate .alphaToOne, add missing .shaderResourceMinLod, and reorder a few entries to match their vulkan.h order. All the sparse features are still left out entirely. Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Lionel Landwerlin <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	vulkan/wsi/x11: Destroy Present event context when destroying swapchain	Michel Dänzer	2016-11-30	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Without this, the X server may accumulate stale Present event contexts if a client creates and destroys multiple swapchains using the same window. v2: Based on Chris Wilson's review: * Use xcb_present_select_input_checked so that protocol errors generated by old X servers can be handled gracefully * Use xcb_discard_reply() instead of free(xcb_request_check()) v3: Rebased on top of this code having been refactored out of anv Reviewed-by: Dave Airlie <[email protected]>
*	glsl: use linked_shaders bitmask to iterate stages for subroutine fields	Timothy Arceri	2016-11-30	2	-31/+26
\| \| \| \| \| \| \| \| \|	This should be faster than looping over every stage and null checking, but will also make the code a bit cleaner when we switch to getting more fields from gl_program rather than from gl_linked_shader as we can just copy the pointer and not need to worry about null checking then copying. Reviewed-by: Ian Romanick <[email protected]>
*	mesa: optimise interleaved sso validation	Timothy Arceri	2016-11-30	1	-11/+14
\| \| \| \| \| \| \| \| \| \|	Now that we have a linked_stages bitfield we can use this to check if the program is used at a later stage. This change is also required to be able to use gl_program rather than gl_shader_program in the CurrentProgram array. Reviewed-by: Ian Romanick <[email protected]>
*	mesa/glsl: add bitmask to track stages a program was linked against	Timothy Arceri	2016-11-30	2	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This will be used to enable us to store the current gl_program rather than gl_shader_program in the gl_pipline_object allowing us to simplify handing of validation. Also we should not be depending on _LinkedShader for this information as it may contain shaders from a failed linking attempt rather than the current program still in use. We could also use this mask to iterate over the stages during linking with _mesa_bit_scan() rather then the current method of NULL checking each stage. Reviewed-by: Ian Romanick <[email protected]>
*	swr: [rasterizer jit] use signed integer representation for logic op	Ilia Mirkin	2016-11-29	1	-5/+12
\| \| \| \| \| \| \| \| \| \|	Instead of (incorrectly) biasing the snorm value to make it look like a unorm, just use signed integer math. This fixes arb_color_buffer_float-render GL_RGBA8_SNORM Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Tim Rowley <[email protected]>
*	swr: add missing rgbx8_srgb variant	Ilia Mirkin	2016-11-29	1	-0/+1
\| \| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Tim Rowley <[email protected]>
*	swr: reorder renderable formats, add grouping comments	Ilia Mirkin	2016-11-29	1	-65/+87
\| \| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Tim Rowley <[email protected]>
*	swr: use util_copy_framebuffer_state helper	Ilia Mirkin	2016-11-29	1	-12/+1
\| \| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Tim Rowley <[email protected]>
*	swr: enable cubemap arrays	Ilia Mirkin	2016-11-29	1	-1/+1
\| \| \| \| \| \| \|	Everything is in place for these. Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Tim Rowley <[email protected]>
*	swr: rearrange caps into limits/supported/unsupported groups	Ilia Mirkin	2016-11-29	1	-129/+84
\| \| \| \| \| \| \| \| \| \|	I find this a lot more readable and compact - much easier to scan through the list and see what's on and what's off. No functional change intended. Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Tim Rowley <[email protected]>
*	swr: only store up to the LOD size	Ilia Mirkin	2016-11-29	1	-1/+3
\| \| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Tim Rowley <[email protected]>
*	swr: [rasterizer common] add SwrTrace() and macros	Tim Rowley	2016-11-29	2	-15/+95
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	radeonsi: don't fetch 8 dwords for samplerBuffer and imageBuffer	Marek Olšák	2016-11-29	1	-51/+43
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The compiler doesn't shrink s_load_dwordx8, so we always wasted 4 SGPRs. Also, the extraction of the descriptor created some really ugly asm code with lots of VALU bitwise ops and v_readfirstlane. Totals from affected shaders: SGPRS: 13880 -> 13253 (-4.52 %) VGPRS: 15200 -> 15088 (-0.74 %) Code Size: 499864 -> 459816 (-8.01 %) bytes Max Waves: 1554 -> 1564 (0.64 %) Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: disable XNACK to free 2 SGPRs on APUs	Marek Olšák	2016-11-29	1	-1/+1
\| \| \| \| \| \|	My LLVM commit disables it for dGPUs, but not APUs. Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: count and report temp arrays in scratch separately	Marek Olšák	2016-11-29	2	-4/+40
\| \| \| \| \| \|	v2: only do this if debug output of shader dumping is enabled Reviewed-by: Nicolai Hähnle <[email protected]> (v1)
*	radeonsi: don't try to eliminate trivial VS outputs for PS and CS	Marek Olšák	2016-11-29	1	-1/+4
\| \| \| \| \| \|	PS and CS don't have any param exports, so it's a no-op. Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: disable RB+ blend optimizations for dual source blending	Marek Olšák	2016-11-29	1	-0/+11
\| \| \| \| \| \| \| \|	This fixes dual source blending on Stoney. The fix was copied from Vulkan. The problem was discovered during internal testing. Cc: 13.0 <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: set CB_BLEND1_CONTROL.ENABLE for dual source blending	Marek Olšák	2016-11-29	1	-0/+4
\| \| \| \| \| \| \|	copied from Vulkan Cc: 13.0 <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: always set all blend registers	Marek Olšák	2016-11-29	1	-5/+5
\| \| \| \| \| \| \|	better safe than sorry Cc: 13.0 <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: set the smallest possible CB_TARGET_MASK	Marek Olšák	2016-11-29	1	-5/+5
\| \| \| \| \| \|	better safe than sorry; set_framebuffer_state always makes this dirty Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: don't print bodies of header-only packets	Marek Olšák	2016-11-29	1	-0/+4
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: print unknown registers with correct formatting	Marek Olšák	2016-11-29	1	-1/+2
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	ddebug: fix hang detection with deferred flushes	Marek Olšák	2016-11-29	1	-1/+1
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>