mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	gallium/hud: split hud_draw into 3 separate functions	Marek Olšák	2017-11-25	6	-79/+99
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	st/dri: remove dead code and incorrect comment around make_current	Marek Olšák	2017-11-25	1	-13/+0
\| \| \| \| \| \| \| \| \| \|	Core Mesa already handles flushing based on ContextReleaseBehavior, so the comment is wrong. Also, old_st is always NULL, because unbind_context always precedes make_current. Reviewed-by: Nicolai Hähnle <[email protected]>
*	st/dri: clean up dri_unbind_context	Marek Olšák	2017-11-25	1	-3/+4
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: expose all CB performance counters on Stoney	Marek Olšák	2017-11-25	1	-1/+1
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: handle imported textures with DCC robustly	Marek Olšák	2017-11-25	1	-1/+1
\| \| \| \| \| \| \|	now you can hack the driver to enable DCC for displayable textures and Glamor that doesn't enable that by default won't crash anymore. Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: fix a typo in creating monolithic ES-GS	Marek Olšák	2017-11-25	1	-1/+1
\| \| \| \| \| \|	This has no effect because both occupy the same memory in a union. Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: don't write undefined output channels to LDS in LS	Marek Olšák	2017-11-25	1	-0/+3
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: use ac.lds for shared memory	Marek Olšák	2017-11-25	3	-5/+3
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: do 64-bit LDS loads recursively	Marek Olšák	2017-11-25	1	-7/+9
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	mapi: Teach es{1,2}api/ABI-check shared library names on Cygwin	Jon Turney	2017-11-24	2	-8/+18
\| \| \| \| \| \| \| \| \| \| \|	Ideally we'd be able to get the library filename from libtool, but that doesn't seem to be a feature... Use of ${uname} is presumably ok here as we won't be running 'make check' if we are cross-compiling Signed-off-by: Jon Turney <[email protected]> Reviewed-by: Eric Engestrom <[email protected]>
*	Revert "radv: remove unnecessary memset() in radv_AllocateCommandBuffers()"	Samuel Pitoiset	2017-11-24	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This fixes two CTS regressions: - dEQP-VK.api.object_management.alloc_callback_fail_multiple.command_buffer_primary - dEQP-VK.api.object_management.alloc_callback_fail_multiple.command_buffer_secondary These two tests are part the mustpass lists, so presumably they are correct and my change was wrong. This reverts commit 0f68208f1d1d3b7b2963dab40e84c60212518692. Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	radv/winsys: improve error messages when the buffer list creation failed	Samuel Pitoiset	2017-11-24	1	-3/+6
\| \| \| \| \| \|	Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Dave Airlie <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	radv/winsys: do not try to create a BO list with 0 buffers	Samuel Pitoiset	2017-11-24	1	-3/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This happens when all BOs have the RADEON_FLAG_NO_INTERPROCESS_SHARING (DRM version >= 3.23) flag set. This flag is mainly used for reducing overhead on the userspace side because we don't have to put those BOs inside the list. Though, if the driver tries to create a list with 0 buffers inside it, libdrm returns -EINVAL and the app just crashes. This fixes a bunch of CTS dEQP-VK.sparse_resources.* fails (~100). Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Dave Airlie <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	i965/vec4: fix splitting of interleaved attributes	Iago Toral Quiroga	2017-11-24	1	-1/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When we split an instruction that reads an uniform value (vstride 0) we need to respect the vstride on the second half of the instruction (that is, the second half should read the same region as the first). We were doing this already, but we didn't account for stages that have interleaved input attributes which also have a vstride of 0 and need the same treatment. Fixes the following on Haswell: KHR-GL45.enhanced_layouts.varying_locations KHR-GL45.enhanced_layouts.varying_array_locations KHR-GL45.enhanced_layouts.varying_structure_locations Reviewed-by: Matt Turner <[email protected]> Acked-by: Andres Gomez <[email protected]>
*	etnaviv: Emit vertex buffers consecutively	Wladimir J. van der Laan	2017-11-23	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \|	Vertex buffer legacy state is no longer picked up with new drawing commands. Change to use different cases depending on the number of vertex streams in the GPU specs. This results in slightly more compact state emission as well, on all vivantes. Signed-off-by: Wladimir J. van der Laan <[email protected]> Reviewed-by: Lucas Stach <[email protected]> Reviewed-by: Christian Gmeiner <[email protected]>
*	genxml: fix assert guards	Eric Engestrom	2017-11-23	1	-5/+5
\| \| \| \| \| \| \|	This removes a few hundred warnings on debug builds with asserts off. Signed-off-by: Eric Engestrom <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	meson: add variable for mapi_abi.py instead of going back up the tree	Eric Engestrom	2017-11-23	5	-4/+6
\| \| \| \| \|	Signed-off-by: Eric Engestrom <[email protected]> Reviewed-by: Dylan Baker <[email protected]>
*	meson: reorder subdirs to avoid directly including more than one level	Eric Engestrom	2017-11-23	3	-2/+3
\| \| \| \| \|	Signed-off-by: Eric Engestrom <[email protected]> Reviewed-by: Dylan Baker <[email protected]>
*	r600: set DX10_CLAMP for compute shader too	Roland Scheidegger	2017-11-23	1	-2/+3
\| \| \| \| \| \| \| \|	I really intended to set this for all shader stages by 3835009796166968750ff46cf209f6d4208cda86 but missed it for compute shaders (because it's in a different source file...). Reviewed-by: Dave Airlie <[email protected]>
*	anv: flag batch & instruction BOs for capture	Lionel Landwerlin	2017-11-22	2	-2/+6
\| \| \| \| \| \| \| \| \| \| \| \| \|	When the kernel support flagging our BO, let's mark batch & instruction BOs for capture so then can be included in the error state. v2: Only add EXEC_CAPTURE if supported (Kristian) v3: Fix operator precedence issue (Lionel) Signed-off-by: Lionel Landwerlin <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	anv: setup BO flags at state_pool/block_pool creation	Lionel Landwerlin	2017-11-22	7	-22/+41
\| \| \| \| \| \| \| \|	This will allow to set the flags on any anv_bo created/filled from a state pool or block pool later. Signed-off-by: Lionel Landwerlin <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	r600/shader: Fix all warnings issed with "-Wall -Wextra"	Gert Wollny	2017-11-22	1	-31/+36
\| \| \| \| \| \| \| \| \| \| \| \|	- fix a number of -Wsign-compare warnings - fix two warnings for -Woverride-init because TGSI_OPCODE_CEIL == 83, and the according field was defined two times. [airlied: don't use -1 with unsigned type, fix whitespace] Signed-off-by: Gert Wollny <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	r600: Emit EOP for more CF instruction types	Gert Wollny	2017-11-22	4	-7/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	So far on pre-cayman chipsets the CF instructions CF_OP_LOOP_END, CF_OP_CALL_FS, CF_OP_POP, and CF_OP_GDS an extra CF_NOP instruction was added to add the EOP flag, even though this is not actually needed, because all these instrutions support the EOP flag. This patch removes the fixup code, adds setting the EOP flag for the according instructions as well as others like CF_OP_TEX and CF_OP_VTX, and adds writing out EOP for this type of instruction in the disassembler. This also fixes a bug where shaders were created that didn't actually have the EOP flag set in the last CF instruction, which might have resulted in GPU lockups. [airlied: cleaned up a little] Signed-off-by: Gert Wollny <[email protected]> Cc: <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	meson: replace with_*dri with with_dri_platform	Dylan Baker	2017-11-22	2	-3/+3
\| \| \| \| \| \| \| \|	This fixes the windows and macos stubs to be consistent with the *nix path. Signed-off-by: Dylan Baker <[email protected]> Reviewed-by: Eric Engestrom <[email protected]>
*	meson: Enable SSE4.1 optimizations	Dylan Baker	2017-11-22	1	-3/+11
\| \| \| \| \| \| \| \| \| \| \| \|	This patch checks for an and then enables sse4.1 optimizations if the host machine will be x86/x86_64. v2: - Don't compile code, it's unnecessary since we require a compiler which always has SSE4.1 (Matt) v3: - x64 -> x86_64 (Matt) Signed-off-by: Dylan Baker <[email protected]> Reviewed-by: Eric Engestrom <[email protected]>
*	broadcom/vc5: Fix BASE_LEVEL handling with txl.	Eric Anholt	2017-11-22	2	-2/+8
\| \| \| \| \| \| \|	The HW doesn't add the base level anywhere (the min/max lod clamping is what does base level), so we need to add it manually in this case. Fixes piglit tex-miplevel-selection *Lod 2D.
*	broadcom/vc5: Fix array texture layer count setup.	Eric Anholt	2017-11-22	1	-1/+6
\| \| \| \|	Fixes piglit array-texture.
*	broadcom/vc5: Don't increment primitive queries while they're paused.	Eric Anholt	2017-11-22	1	-1/+3
\| \| \| \|	Fixes ext_transform_feedback-generatemipmap prims_generated
*	broadcom/vc5: Fix incorrect padding of TF outputs.	Eric Anholt	2017-11-22	1	-0/+2
\| \| \| \| \| \|	After the first output, we were padding by an extra size of the previous output. Fixes piglit ext_transform_feedback-output-type mat4x3[2] and friends.
*	broadcom/vc5: Fix UIF surface size setup for ARB_fbo's mismatched sizes.	Eric Anholt	2017-11-22	1	-2/+23
\| \| \| \| \| \| \| \| \| \|	The HW was computing an implicit height for the surface based on the image size, but that may be smaller than the surface with ARB_fbo mismatched sizes. In that case, we need to tell it about the pad, either with the little 4-bit field in the RT config, or the extended field in CLEAR_COLORS_PART3. Fixes piglit arb_framebuffer_object-mixed-buffer-sizes.
*	etnaviv: Put HALTI level in specs	Wladimir J. van der Laan	2017-11-22	2	-0/+23
\| \| \| \| \| \| \| \| \| \|	The HALTI level is an indication of the gross architecture of the GPU. It determines for significant part what feature level the GPU has, what state (especially frontend state) is there, and where it is located. Signed-off-by: Wladimir J. van der Laan <[email protected]> Reviewed-by: Christian Gmeiner <[email protected]> Signed-off-by: Lucas Stach <[email protected]>
*	etnaviv: Const-correctness etnaviv_emit.h	Wladimir J. van der Laan	2017-11-22	1	-1/+1
\| \| \| \| \| \| \| \| \|	The relocation structure is never changed by submitting it. Signed-off-by: Wladimir J. van der Laan <[email protected]> Reviewed-by: Philipp Zabel <[email protected]> Reviewed-by: Christian Gmeiner <[email protected]> Signed-off-by: Lucas Stach <[email protected]>
*	meson: add si_driinfo.h in libgallium_dri	Juan A. Suarez Romero	2017-11-22	1	-0/+1
\| \| \| \| \| \|	v2: generate target conditionally (Dylan) Reviewed-by: Dylan Baker <[email protected]>
*	nir/gather_info: recognize load_patch_vertices_in as a system value	Iago Toral Quiroga	2017-11-22	1	-0/+1
\| \| \| \| \| \| \| \|	This intrinsic is produced to load SYSTEM_VALUE_VERTICES_IN, which is generated to load gl_PatchVerticesIn in the SPIR-V path for both Vulkan and OpenGL. Reviewed-by: Marek Olšák <[email protected]>
*	i965: Support decoding INTERFACE_DESCRIPTOR_DATA with INTEL_DEBUG=bat	Jordan Justen	2017-11-21	1	-0/+24
\| \| \| \| \| \| \| \|	This will dump the INTERFACE_DESCRIPTOR_DATA along with the associated samplers & surfaces. Signed-off-by: Jordan Justen <[email protected]> Reviewed-by: Scott D Phillips <[email protected]>
*	intel/genxml: Add helpers for determining field type	Kristian H. Kristensen	2017-11-21	1	-6/+17
\| \| \| \|	Reviewed-by: Lionel Landwerlin <[email protected]>
*	i965/fs: Check ADD/MAD with immediates in satprop unit test	Matt Turner	2017-11-21	1	-1/+125
\| \| \| \| \| \| \| \| \|	The gen had to be changed from 4 to 6 so that we could test MAD, which is new on Gen6. mad_imm_float_neg_mov_sat tests the case fixed by the previous commit. Reviewed-by: Ian Romanick <[email protected]>
*	i965/fs: Handle negating immediates on MADs when propagating saturates	Matt Turner	2017-11-21	1	-2/+8
\| \| \| \| \| \| \| \| \| \| \|	MADs don't take immediate sources, but we allow them in the IR since it simplifies a lot of things. I neglected to consider that case. Fixes: 4009a9ead490 ("i965/fs: Allow saturate propagation to propagate negations into MADs.") Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=103616 Reported-and-Tested-by: Ruslan Kabatsayev <[email protected]> Reviewed-by: Ian Romanick <[email protected]>
*	mesa/teximage: add TEXTURE_CUBE_MAP_ARRAY target for CompressedTexImage3D	Juan A. Suarez Romero	2017-11-21	1	-1/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	From section 8.7, page 179 of OpenGL ES 3.2 spec: An INVALID_OPERATION error is generated by CompressedTexImage3D if internalformat is one of the the formats in table 8.17 and target is not TEXTURE_2D_ARRAY, TEXTURE_CUBE_MAP_ARRAY or TEXTURE_3D. An INVALID_OPERATION error is generated by CompressedTexImage3D if internalformat is TEXTURE_CUBE_MAP_ARRAY and the “Cube Map Array” column of table 8.17 is not checked, or if internalformat is TEXTURE_3D and the “3D Tex.” column of table 8.17 is not checked. So far it was only considering TEXTURE_2D_ARRAY as valid target. But as "Cube Map Array" column is checked for all the cases, in practice we can consider also TEXTURE_CUBE_MAP_ARRAY. This fixes KHR-GLES32.core.texture_cube_map_array.etc2_texture Reviewed-by: Nanley Chery <[email protected]>
*	intel: fix disasm_info memory leaks	Tapani Pälli	2017-11-21	2	-2/+2
\| \| \| \| \| \| \| \|	Fixes: 4f82b1728719 ("i965: Rewrite disassembly annotation code") Cc: Matt Turner <[email protected]> Signed-off-by: Tapani Pälli <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	st/glsl_to_nir: don't generate nir twice for gs	Timothy Arceri	2017-11-21	1	-8/+2
\| \| \| \| \| \|	This was left out of c980a3aa3133 Reviewed-by: Marek Olšák <[email protected]>
*	llvmpipe: fix snorm blending	Roland Scheidegger	2017-11-21	4	-53/+191
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The blend math gets a bit funky due to inverse blend factors being in range [0,2] rather than [-1,1], our normalized math can't really cover this. src_alpha_saturate blend factor has a similar problem too. (Note that piglit fbo-blending-formats test is mostly useless for anything but unorm formats, since not just all src/dst values are between [0,1], but the tests are crafted in a way that the results are between [0,1] too.) v2: some formatting fixes, and fix a fairly obscure (to debug) issue with alpha-only formats (not related to snorm at all), where blend optimization would think it could simplify the blend equation if the blend factors were complementary, however was using the completely unrelated rgb blend factors instead of the alpha ones... Reviewed-by: Jose Fonseca <[email protected]>
*	r600: add cull distance support	Dave Airlie	2017-11-21	6	-6/+24
\| \| \| \| \| \|	This passes all the tests in piglit. Signed-off-by: Dave Airlie <[email protected]>
*	i965: Optimize bucket index calculation	Aravindan Muthukumar	2017-11-20	1	-8/+39
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Reducing Bucket index calculation to O(1). This algorithm calculates the index using matrix method. Assuming PAGE_SIZE is 4096, matrix arrangement is as below: 14096 24096 34096 44096 54096 64096 74096 84096 104096 124096 144096 164096 204096 244096 284096 324096 ... ... ... ... ... ... ... ... ... ... ... max_cache_size From this matrix its clearly seen that every row follows the below way: ... ... ... n n+(1/4)n n+(1/2)n n+(3/4)n 2n Row is calculated as log2(size/PAGE_SIZE) Column is calculated as converting the difference between the elements to fit into power size of two and indexing it. Final Index is (row*4)+(col-1) Tested with Intel Mesa CI. Improves performance of 3DMark on BXT by 0.705966% +/- 0.229767% (n=20) v4: Review comments on style and code comments implemented (Ian). v3: Review comments implemented (Ian). v2: Review comments implemented (Jason). Signed-off-by: Aravindan Muthukumar <[email protected]> Signed-off-by: Kedar Karanje <[email protected]> Reviewed-by: Yogesh Marathe <[email protected]> Signed-off-by: Ian Romanick <[email protected]>
*	meson: Guard the gallium dri componenet	Dylan Baker	2017-11-20	1	-2/+4
\| \| \| \| \| \| \| \|	Currently the target has a redundant guard, and the state tracker isn't properly guarded. Signed-off-by: Dylan Baker <[email protected]> Reviewed-by: Eric Anholt <[email protected]>
*	meson: don't build gallium subdir unless we're building gallium	Dylan Baker	2017-11-20	1	-1/+3
\| \| \| \| \| \| \|	This will allow us to simplify some guards within the gallium directory. Signed-off-by: Dylan Baker <[email protected]> Reviewed-by: Eric Anholt <[email protected]>
*	broadcom/vc5: Align 1D texture miplevels to 64b.	Eric Anholt	2017-11-20	1	-0/+2
\| \| \| \|	Fixes tex-miplevel-selection GL2:texture() 1D
*	broadcom/vc5: Clamp min lod to the last level.	Eric Anholt	2017-11-20	1	-2/+3
\| \| \| \| \| \|	Otherwise, the simulator would complain in tex-miplevel-selection that the min/max clamp was out of order. The actual HW seems to have clamped to the max anyway.
*	broadcom/vc5: Increase simulator memory for tex-miplevel-selection.	Eric Anholt	2017-11-20	1	-1/+1
\| \| \| \| \|	We were overflowing, because of all the little 4k allocations for CLs that were getting expanded to 128kb in the simulator due to the GMP alignment.
*	swr/rast: Repair simd8 frontend code rot	Tim Rowley	2017-11-20	1	-1/+1
\| \| \| \| \| \|	Keep non-default simd8 frontend code running for comparison purposes. Reviewed-by: Bruce Cherniak <[email protected]>