mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	glsl: always call do_lower_jumps() after loop unrolling	Timothy Arceri	2018-04-04	1	-0/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This fixes a bug in radeonsi where LLVM cannot handle the case where a break exists but its not the last instruction in the block. LLVM would fail with: Terminator found in the middle of a basic block! LLVM ERROR: Broken function found, compilation aborted! Fixes: 96fe8834f539 "glsl_to_tgsi: do fewer optimizations with GLSLOptimizeConservatively" Reviewed-by: Matt Turner <[email protected]> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=105317
*	vulkan/wsi/wayland: fix leaks	James Legg	2018-04-03	1	-0/+4
\| \| \| \| \| \|	Fixes: bfa22266cd ("vulkan/wsi/wayland: Add support for zwp_dmabuf") Reviewed-by: Daniel Stone <[email protected]> CC: Jason Ekstrand <[email protected]>
*	docs: update calendar, add news and link release notes to 17.3.8	Juan A. Suarez Romero	2018-04-03	3	-7/+8
\| \| \| \|	Signed-off-by: Juan A. Suarez Romero <[email protected]>
*	docs: add sha256 checksums for 17.3.8	Juan A. Suarez Romero	2018-04-03	1	-1/+2
\| \| \| \| \|	Signed-off-by: Juan A. Suarez Romero <[email protected]> (cherry picked from commit ba371c7262a484391cace9d5e17635ed14c58692)
*	docs: add release notes for 17.3.8	Juan A. Suarez Romero	2018-04-03	1	-0/+146
\| \| \| \| \|	Signed-off-by: Juan A. Suarez Romero <[email protected]> (cherry picked from commit 3bf5c10c5c0e9fac6eb0b2c201bcf44755ecfaec)
*	st/mesa: Also use PIPE_FORMAT_R8G8B8A8_SRGB for framebuffer_sRGB.	Jakob Bornecrantz	2018-04-03	1	-1/+2
\| \| \| \| \| \| \| \| \|	When running virgl on a GLES host the only sRGB formats that support rendering is RGBA and RGBX. That pipe format is in the sRGB default lists that the state tracker uses when mapping mesa formats. Reviewed-by: Brian Paul <[email protected]> Signed-off-by: Jakob Bornecrantz <[email protected]>
*	intel: gen-decoder: print all dword a field belongs to	Lionel Landwerlin	2018-04-03	2	-7/+9
\| \| \| \| \| \| \| \| \| \|	Prior to printing a decoded field, print out all dwords that field belongs to. In particular with address fields spanning multiple dwords, we want to have all the dwords presented before the field is decoded to make it easier to read. Signed-off-by: Lionel Landwerlin <[email protected]> Reviewed-by: Scott D Phillips <[email protected]>
*	intel: genxml: decode variable length MI_LRI	Lionel Landwerlin	2018-04-03	10	-0/+40
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	MI_LOAD_REGISTER_IMM can load multiple (register, value) tuples in one command. In our drivers we only use one tuple at a time, but the kernel might load more than one at a time. Instead of making all the tuple part of a group, we leave out the first tuple (the one we use in the generated packing structures). This is particularly useful for looking at error stats generated by the kernel. Signed-off-by: Lionel Landwerlin <[email protected]> Reviewed-by: Scott D Phillips <[email protected]>
*	intel: gen-decoder: don't decode fields beyond a dword length	Lionel Landwerlin	2018-04-03	1	-15/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	For example, a PIPE_CONTROL with DWordLength = 2 should look like this : 0xffffe374: 0x7a000002: PIPE_CONTROL 0xffffe374: 0x7a000002 : Dword 0 DWord Length: 2 0xffffe378: 0x00800000 : Dword 1 Depth Cache Flush Enable: false Stall At Pixel Scoreboard: false State Cache Invalidation Enable: false Constant Cache Invalidation Enable: false VF Cache Invalidation Enable: false DC Flush Enable: false Pipe Control Flush Enable: false Notify Enable: false Indirect State Pointers Disable: false Texture Cache Invalidation Enable: false Instruction Cache Invalidate Enable: false Render Target Cache Flush Enable: false Depth Stall Enable: false Post Sync Operation: 0 (No Write) Generic Media State Clear: false TLB Invalidate: false Global Snapshot Count Reset: false Command Streamer Stall Enable: false Store Data Index: 0 LRI Post Sync Operation: 1 (MMIO Write Immediate Data) Destination Address Type: 0 (PPGTT) Flush LLC: false 0xffffe37c: 0x00000000 : Dword 2 Address: 0x00000000 0xffffe384: 0x05000000: MI_BATCH_BUFFER_END Prior to this change, fields beyond the length of the command would be decoded (notice the MI_BATCH_BUFFER_END decoded as part of the previous PIPE_CONTROL) : 0xffffe374: 0x7a000002: PIPE_CONTROL 0xffffe374: 0x7a000002 : Dword 0 DWord Length: 2 0xffffe378: 0x00800000 : Dword 1 Depth Cache Flush Enable: false Stall At Pixel Scoreboard: false State Cache Invalidation Enable: false Constant Cache Invalidation Enable: false VF Cache Invalidation Enable: false DC Flush Enable: false Pipe Control Flush Enable: false Notify Enable: false Indirect State Pointers Disable: false Texture Cache Invalidation Enable: false Instruction Cache Invalidate Enable: false Render Target Cache Flush Enable: false Depth Stall Enable: false Post Sync Operation: 0 (No Write) Generic Media State Clear: false TLB Invalidate: false Global Snapshot Count Reset: false Command Streamer Stall Enable: false Store Data Index: 0 LRI Post Sync Operation: 1 (MMIO Write Immediate Data) Destination Address Type: 0 (PPGTT) Flush LLC: false 0xffffe37c: 0x00000000 : Dword 2 Address: 0x00000000 0xffffe380: 0x00000000 : Dword 3 0xffffe384: 0x05000000 : Dword 4 Immediate Data: 83886080 0xffffe384: 0x05000000: MI_BATCH_BUFFER_END Signed-off-by: Lionel Landwerlin <[email protected]> Reviewed-by: Scott D Phillips <[email protected]>
*	intel: error_decode: add an option to decode all buffers	Lionel Landwerlin	2018-04-03	1	-2/+7
\| \| \| \| \| \| \| \| \| \| \|	The kernel reports workaround batch buffers, but we're not presenting them currently. Also they might not be useful for debugging purely userspace driver issues, when problems arise because of interactions between kernel & userspace drivers, it's nice to be able to decode them. Signed-off-by: Lionel Landwerlin <[email protected]> Reviewed-by: Scott D Phillips <[email protected]>
*	intel: genxml: add preemption control instructions	Lionel Landwerlin	2018-04-03	4	-0/+26
\| \| \| \| \| \| \|	Helpful to debug kernel workaround batchbuffers. Signed-off-by: Lionel Landwerlin <[email protected]> Reviewed-by: Scott D Phillips <[email protected]>
*	mesa: ensure that variable is initialized	Dylan Baker	2018-04-03	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This variable controls whether we link using the glsl code path or the spirv path. It's set when we validate that all shaders are glsl or spirv, but if there are no shaders attached to the program it will remain unset, resulting in undefined behavior. We want to go down the glsl path in that case, so initialize to false. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=105820 Fixes: 16f6634e7fb5ada308e55b852cd49251e7f3f8b1 ("mesa/program: Link SPIR-V shaders using the SPIR-V code-path") Signed-off-by: Dylan Baker <[email protected]> Tested-by: Mark Janes <[email protected]> Reviewed-by: Alejandro Piñeiro <[email protected]>
*	radeonsi/gfx9: fix bad LLVM params in monolithic LS+HS	Marek Olšák	2018-04-03	1	-1/+5
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	radv: enable VK_EXT_shader_viewport_index_layer	Samuel Pitoiset	2018-04-03	2	-0/+2
\| \| \| \| \| \| \| \|	The driver already supports exporting the Layer and ViewportIndex built-ins from vertex or tessellation shaders. Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	nir+drivers: add helpers to get # of src/dest components	Rob Clark	2018-04-03	6	-25/+32
\| \| \| \| \| \| \| \| \|	Add helpers to get the number of src/dest components for an intrinsic, and update spots that were open-coding this logic to use the helpers instead. Signed-off-by: Rob Clark <[email protected]> Reviewed-by: Eric Anholt <[email protected]>
*	freedreno/ir3: fix fallout of unused false-depth elimination	Rob Clark	2018-04-03	2	-17/+19
\| \| \| \| \| \| \| \| \| \|	Since we were MARK flag for both preventing loops, and tracking whether instructions were used, we could end up in an infinite loop due to bd2ca2bcdd. Instead invert the logic.. mark all instructions UNUSED up front and clear the flag as we visit them. Fixes: bd2ca2bcdd freedreno/ir3: eliminate unused false-deps Signed-off-by: Rob Clark <[email protected]>
*	gallium/pipebuffer: fix parenthesis location	Timothy Arceri	2018-04-03	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	Without this the return value will never get set to -1. This was first added in 49866c8f3457 and copied in 2b396eeed983. Fixes: 2b396eeed983 "gallium/pb_cache: add a copy of cache bufmgr independent of pb_manager" Reviewed-by: Marek Olšák <[email protected]> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=102342
*	Revert "mesa: add GL_HALF_FLOAT as supported type to readpixels"	Tapani Pälli	2018-04-03	1	-2/+0
\| \| \| \| \| \| \| \| \|	This reverts commit 41cf30b8bc55fdf36adac3311002dc32b6715949. Commit caused regressions with KHR-GLES3.packed_pixels.* tests. Signed-off-by: Tapani Pälli <[email protected]> Suggested-by: Eric Anholt <[email protected]>
*	gallivm: Fix include for LLVMAddPromoteMemoryToRegisterPass	Mike Lothian	2018-04-02	1	-0/+3
\| \| \| \| \| \| \| \| \|	Include llvm-c/Transforms/Utils.h with the newest LLVM 7 Signed-of-by: Mike Lothian <[email protected]> Reviewed-by: Samuel Pitoiset <[email protected]> Tested-by: Dieter Nützel <[email protected]> Signed-off-by: Marek Olšák <[email protected]>
*	radeonsi: Fix include for LLVMAddPromoteMemoryToRegisterPass	Mike Lothian	2018-04-02	1	-0/+3
\| \| \| \| \| \| \| \| \|	Include llvm-c/Transforms/Utils.h with the newest LLVM 7 Signed-of-by: Mike Lothian <[email protected]> Reviewed-by: Samuel Pitoiset <[email protected]> Tested-by: Dieter Nützel <[email protected]> Signed-off-by: Marek Olšák <[email protected]>
*	ac/nir: Fix include for LLVMAddPromoteMemoryToRegisterPass	Mike Lothian	2018-04-02	1	-0/+3
\| \| \| \| \| \| \| \| \|	Include llvm-c/Transforms/Utils.h with the newest LLVM 7 Signed-of-by: Mike Lothian <[email protected]> Reviewed-by: Samuel Pitoiset <[email protected]> Tested-by: Dieter Nützel <[email protected]> Signed-off-by: Marek Olšák <[email protected]>
*	st/dri: Initialise modifier to INVALID for DRI2	Daniel Stone	2018-04-02	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When allocating a buffer for DRI2, set the modifier to INVALID to inform the backend that we have no supplied modifiers and it should do its own thing. The missed initialisation forced linear, even if the implementation had made other decisions. This resulted in VC4 DRI2 clients failing with: Modifier 0x0 vs. tiling (0x700000000000001) mismatch Signed-off-by: Daniel Stone <[email protected]> Reported-by: Andreas Müller <[email protected]> Reviewed-by: Eric Anholt <[email protected]> Fixes: 3f8513172ff6 ("gallium/winsys/drm: introduce modifier field to winsys_handle")
*	radeonsi: implement GL_KHR_blend_equation_advanced	Marek Olšák	2018-04-02	14	-18/+205
\| \| \| \| \| \| \|	MSAA is supported using sample shading. Layered rendering and all texture targets are also supported. Tested-by: Dieter Nützel <[email protected]>
*	radeonsi: rename unpack_param -> si_unpack_param	Marek Olšák	2018-04-02	2	-27/+31
\| \| \| \|	Tested-by: Dieter Nützel <[email protected]>
*	radeonsi: move FMASK shader logic to shared code	Marek Olšák	2018-04-02	3	-72/+61
\| \| \| \| \| \|	We'll need it for FBFETCH in both TGSI and NIR paths. Tested-by: Dieter Nützel <[email protected]>
*	radeonsi: add R600_DEBUG=nofmask to disable MSAA compression	Marek Olšák	2018-04-02	5	-14/+17
\| \| \| \| \| \|	For testing. Tested-by: Dieter Nützel <[email protected]>
*	gallium/u_tests: test FBFETCH and shader-based blending with MSAA	Marek Olšák	2018-04-02	1	-40/+128
\| \| \| \|	Tested-by: Dieter Nützel <[email protected]>
*	ac/gpu_info: print GB_ADDR_CONFIG	Marek Olšák	2018-04-02	2	-0/+51
\|
*	ac/gpu_info: reorder the fields and print them nicely	Marek Olšák	2018-04-02	2	-55/+76
\|
*	ac/gpu_info: rename has_virtual_memory -> r600_has_virtual_memory	Marek Olšák	2018-04-02	8	-25/+25
\|
*	ac/gpu_info: don't print irrelevant fields	Marek Olšák	2018-04-02	1	-5/+0
\|
*	st/mesa: don't draw if the bound element array buffer is not allocated	Marek Olšák	2018-04-02	1	-0/+7
\| \| \| \|	Tested-by: Dieter Nützel <[email protected]>
*	anv/cmd_buffer: honor pending clear views for depth/stencil attachments	Iago Toral Quiroga	2018-04-02	1	-1/+21
\| \| \| \| \| \| \| \| \| \| \| \|	v2: rebased on top of subpass rework. v3: rebased v4: - rebased - reset pending clear views in one go rather one bit at a time (Caio) Reviewed-by: Jason Ekstrand <[email protected]>
*	anv/cmd_buffer: consider multiview masks for tracking pending clear aspects	Iago Toral Quiroga	2018-04-02	2	-3/+96
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When multiview is active a subpass clear may only clear a subset of the attachment layers. Other subpasses in the same render pass may also clear too and we want to honor those clears as well, however, we need to ensure that we only clear a layer once, on the first subpass that uses a particular layer (view) of a given attachment. This means that when we check if a subpass attachment needs to be cleared we need to check if all the layers used by that subpass (as indicated by its view_mask) have already been cleared in previous subpasses or not, in which case, we must clear any pending layers used by the subpass, and only those pending. v2: - track pending clear views in the attachment state (Jason) - rebased on top of fast-clear rework. v3: - rebased on top of subpass rework. v4: rebased. v5 (Caio): - Rebased. - Initialize pending clear views to only have bits set for layers that exist. - Reset pending clear views in one go rather one bit at a time. - Put "last subpass for this attachment" condition in a separate function to simplify the conditional that resets pending_clear_aspects. Fixes: dEQP-VK.multiview.readback_implicit_clear.* Reviewed-by: Jason Ekstrand <[email protected]>
*	radeonsi/nir: fix explicit component packing for geom/tess doubles	Timothy Arceri	2018-04-02	1	-8/+11
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi/nir: gather buffers declared more accurately and use const fast path	Timothy Arceri	2018-04-02	2	-6/+90
\| \| \| \| \| \| \| \|	For now we skip SI && HAVE_LLVM < 0x0600 for simplicity. We also skip setting the more accurate masks for builtin uniforms for now as it causes some piglit regressions. Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: create load_const_buffer_desc_fast_path() helper	Timothy Arceri	2018-04-02	1	-39/+49
\| \| \| \| \| \| \| \|	This will be shared by the TGSI and NIR backends. For simplicity we leave the SI LLVM 5.0 and lower work around only in the TGSI backend. Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi/nir: set TGSI_PROPERTY_NEXT_SHADER	Timothy Arceri	2018-04-02	1	-0/+3
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	st/glsl_to_nir: gather next_stage in shader_info	Timothy Arceri	2018-04-02	2	-0/+20
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	freedreno/a5xx: don't align height for PIPE_BUFFER	Rob Clark	2018-04-01	1	-1/+1
\| \| \| \| \| \| \| \| \|	Buffers can be large, so we probably don't want to make them all 32x bigger. But they can't be rendered to (at least in GL) so we don't need this workaround to prevent page faults on mem<->gmem. Cc: "18.0" <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a5xx: fix page faults on last level	Rob Clark	2018-04-01	1	-0/+10
\| \| \| \| \| \| \| \| \| \|	We could alternatively fall back to using "old style" draw's for mem<->gmem (ie. what <= a4xx do) when height is not aligned to 32, but that is somewhat more work (and not really something that could be applied to stable) Cc: "18.0" <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: fix issue w/ glamor composite shaders	Rob Clark	2018-03-31	2	-2/+36
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Fixes an issue that became possible when we started lowering phi webs to regs (a7ea2b4e) (although was not really seen until we also switched to using peephole select pass (ec8bc54a) instead of lowering all if/else to select). If texture coord (or anything else that uses create_collect() to collect scalar values in a sequence of scalar registers) was consuming a value produced on either side of an if/else (ie. a phi lowered to nir reg, which in ir3 is an "array" of length 1) then register allocation would happen incorrectly and we'd end up sampling from garbage coordinates. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: more half-precision fixes	Rob Clark	2018-03-31	2	-8/+37
\| \| \| \| \| \| \| \|	Some instructions require src/dst to be in full or half precision register depending on src/dst type. So do a better job of propagating register type. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: add helper to create immed of specified size	Rob Clark	2018-03-31	1	-4/+11
\| \| \| \| \| \| \|	We'll also need to be able to create a half-precision immediate. So re-work create_immed(). Prep work for following patch. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: pass ctx instead of block to create_collect()	Rob Clark	2018-03-31	1	-18/+19
\| \| \| \| \| \|	Prep work for following patch. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: eliminate unused false-deps	Rob Clark	2018-03-31	2	-11/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously false-dependencies would get flagged as used, even if the only "use" was a false dep to (for example) prevent a load from being scheduled after a store. In addition to being pointless instructions, in some cases they can cause problems. For example, ldg (and similar instructions) depend on an immed arg getting CP'd into the instruction, but this doesn't happen if an instruction is otherwise unused. Which can result in undefined results (overwriting unintended registers). Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: add local_group_size	Rob Clark	2018-03-31	3	-2/+12
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: clear SSA flag when assigning "ARRAY" regs too	Rob Clark	2018-03-31	1	-0/+1
\| \| \| \| \| \|	Avoids a misleading "INVALID FLAGS" warning in debug builds. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: print array live ranges	Rob Clark	2018-03-31	1	-4/+10
\| \| \| \| \| \|	This is also useful to see if optmsgs are enabled. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: a2xx: Implement DP2 instruction	Wladimir J. van der Laan	2018-03-31	1	-0/+21
\| \| \| \| \| \| \| \|	Use DOT2ADDv instruction with 0.0f constant add. Signed-off-by: Wladimir J. van der Laan <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]> Reviewed-by: Rob Clark <[email protected]>