mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	freedreno/ir3: output ir3 and nir asm for frameretrace	Rob Clark	2018-07-18	4	-0/+69
\| \| \| \| \| \|	See: https://github.com/janesma/apitrace/commit/298dc8195bf082fe1f47aa474e28411f85dd5393 Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: redirectable ir3 disasm output	Rob Clark	2018-07-18	3	-50/+48
\| \| \| \| \| \| \| \| \| \| \|	For now it still goes to stdout, this will make it easier to support output on stderr like what frameretrace expects. (If we eventually have a proper GL extension for this, implementation probably looks like dumping shader disasm to a tmp file and then dumping that out over whatever mechanism is used.) Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: resync ir3 disassembler	Rob Clark	2018-07-18	4	-191/+209
\| \| \| \| \| \| \|	Pull in latest updates from cffdump in envytools tree, so we can output to other than just stdout. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: register usage queries	Rob Clark	2018-07-18	8	-22/+91
\| \| \| \| \| \| \|	Avg number of (half) regs per draw, so we can corrolate fps dips to shader register usage. Signed-off-by: Rob Clark <[email protected]>
*	nir: add lowering for gl_HelperInvocation	Rob Clark	2018-07-18	5	-0/+36
\| \| \| \| \| \| \| \| \|	v2: reword comment about lower_helper_invocations to be more clear that it might not work on all hardware v3: add special variant of load_sample_id which does not imply per- sample shading Signed-off-by: Rob Clark <[email protected]>
*	mesa: don't double incr/decr ActiveCounters	Rob Clark	2018-07-18	1	-4/+8
\| \| \| \| \| \| \| \| \| \| \| \| \|	Frameretrace ends up w/ excess calls to SelectPerfMonitorCountersAMD() which ends up re-enabling already enabled counters. Which causes ActiveCounters[group] to be double incremented for the same counter. This causes BeginPerfMonitorAMD() to fail. The AMD_performance_monitor spec doesn't say that an error should be generated in this case. So I think the safe thing to do is just safe- guard against excess increments/decrements. Signed-off-by: Rob Clark <[email protected]>
*	mesa: fix error msg typo	Rob Clark	2018-07-18	1	-1/+1
\| \| \| \| \|	Signed-off-by: Rob Clark <[email protected]> Reviewed-by: Brian Paul <[email protected]>
*	nir: fixup intrinsic comment	Rob Clark	2018-07-18	1	-1/+1
\| \| \| \| \| \| \|	Now the deref is the first src. Signed-off-by: Rob Clark <[email protected]> Reviewed-by: Samuel Iglesias Gonsálvez <[email protected]>
*	mesa: handle a bunch of formats in IMPLEMENTATION_COLOR_READ_*	Tomeu Vizoso	2018-07-18	1	-32/+43
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Virgl could save a lot of work converting buffers in the host side between formats if Mesa supported a bunch of other formats when reading pixels. This commit adds cases to handle specific formats so that the values reported by the two calls match more closely the underlying native formats. In GLES is important that IMPLEMENTATION_COLOR_READ_* return the native format and data type because the spec only allows reading with those, besides GL_RGBA or GL_RGBA_INTEGER. Additionally, because virgl currently doesn't implement such conversions, this commit fixes several tests in dEQP-GLES3.functional.fbo.color.clear., when using virgl in the guest side. The logic is based on knowledge that is shared with _mesa_format_matches_format_and_type() but we cannot assert that the results match as we don't have all the starting information at both points. So leave the assert out and hope CI comes soon to save us all. v2: Let R10G10B10A2_UINT fall back to GL_RGBA_INTEGER (Eric Anholt) * Assert with _mesa_format_matches_format_and_type (Eric Anholt) v3: * Remove the assert, as it won't be reliable (Eric Anholt) v4: * Use _mesa_is_format_integer in the fallback (Eric Anholt) v5: * Remove superfluous call to _mesa_uncompressed_format_to_type_and_comps (Eric Anholt) Reviewed-by: Gurchetan Singh <[email protected]> Reviewed-by: Eric Anholt <[email protected]> Signed-off-by: Tomeu Vizoso <[email protected]> Signed-off-by: Jakob Bornecrantz <[email protected]>
*	radv: add support for VK_EXT_conditional_rendering	Samuel Pitoiset	2018-07-18	8	-1/+111
\| \| \| \| \| \| \|	Inherited commands buffers are not supported. Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	radv: add support for non-inverted conditional rendering	Samuel Pitoiset	2018-07-18	3	-5/+17
\| \| \| \| \| \| \| \| \| \| \|	By default, our internal rendering commands are discarded only if the predicate is non-zero (ie. DRAW_VISIBLE). But VK_EXT_conditional_rendering also allows to discard commands when the predicate is zero, which means we have to use a different flag. Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	radv: set the predicate for indirect/indexed draw commands	Samuel Pitoiset	2018-07-18	1	-3/+4
\| \| \| \| \| \| \| \|	VK_EXT_conditional_rendering allows to discard draw commands (not only normal draws). Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	radv: set the predicate for dispatch commands	Samuel Pitoiset	2018-07-18	1	-3/+4
\| \| \| \| \| \| \|	VK_EXT_conditional_rendering allows to discard dispatch commands. Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	i965: batchbuffer: write correct canonical offset with softpin	Lionel Landwerlin	2018-07-18	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \|	Addresses in the command streams should be in canonical form (i.e bit[63:48] == bit[47]). If the [bo->gtt_offset, bo->gtt_offset + target_offset] range contains the address 0x800000000000, the current code will fail that criteria. v2: Fix missing include (Lionel) Fixes: 1c9053d0765dc6 ("i965: Prepare batchbuffer module for softpin support.") Signed-off-by: Lionel Landwerlin <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	radv: remove unused variable in radv_CreateRenderPass2KHR()	Samuel Pitoiset	2018-07-18	1	-1/+0
\| \| \| \|	Signed-off-by: Samuel Pitoiset <[email protected]>
*	radv: optimize radv_stage_flush() for pre fragment shader stages	Samuel Pitoiset	2018-07-18	1	-5/+5
\| \| \| \| \| \| \| \| \| \|	We don't need to emit PS_PARTIAL_FLUSH for the pre fragment shader stages (ie. geometry/tessellation). Emitting VS_PARTIAL_FLUSH is enough for these stages. Note that PS_PARTIAL_FLUSH also synchronizes all vertex stages. Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	anv: fix assert in anv_CmdBindDescriptorSets()	Samuel Iglesias Gonsálvez	2018-07-18	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	The assert is checking that we are not binding more descriptor sets than the supported by the driver. When binding the descriptor set number MAX_SETS-1, it was breaking the assert because descriptorSetCount = 1. Signed-off-by: Samuel Iglesias Gonsálvez <[email protected]> Cc: <[email protected]> Reviewed-by: Lionel Landwerlin <[email protected]>
*	clover: Report error when pipe driver fails to create compute state	Jan Vesely	2018-07-17	1	-0/+4
\| \| \| \| \| \|	CC: <[email protected]> Signed-off-by: Jan Vesely <[email protected]> Reviewed-by: Francisco Jerez <[email protected]>
*	clover: Catch errors from executing event action	Jan Vesely	2018-07-17	1	-1/+3
\| \| \| \| \| \| \| \| \|	Abort all dependent events. v2: Abort the current event as well. CC: <[email protected]> Signed-off-by: Jan Vesely <[email protected]> Reviewed-by: Francisco Jerez <[email protected]>
*	nir: add a couple of ior opts to nir_opt_algebraic	Timothy Arceri	2018-07-18	1	-0/+3
\| \| \| \| \| \|	One of these was seen in a Deus Ex shader. Reviewed-by: Jason Ekstrand <[email protected]>
*	nir: allow opt_peephole_select to handle nir_instr_type_deref	Timothy Arceri	2018-07-18	1	-0/+1
\| \| \| \|	Reviewed-by: Jason Ekstrand <[email protected]>
*	r600: fix warnings when unref'ing pool->bo	Marek Olšák	2018-07-17	1	-3/+3
\|
*	r600g: some -Wsign-compare fixes	Konstantin Kharlamov	2018-07-17	6	-14/+13
\| \| \| \| \|	Signed-off-by: Konstantin Kharlamov <[email protected]> Signed-off-by: Marek Olšák <[email protected]>
*	st/glx: constify some variables	Konstantin Kharlamov	2018-07-17	1	-1/+1
\| \| \| \| \| \| \|	Just a nice hint for both peoples and compilers. Signed-off-by: Konstantin Kharlamov <[email protected]> Signed-off-by: Marek Olšák <[email protected]>
*	st/nine: constify some variables	Konstantin Kharlamov	2018-07-17	2	-6/+6
\| \| \| \| \| \| \|	Just a nice hint for both peoples and compilers. Signed-off-by: Konstantin Kharlamov <[email protected]> Signed-off-by: Marek Olšák <[email protected]>
*	r600g: constify some variables	Konstantin Kharlamov	2018-07-17	5	-10/+10
\| \| \| \| \| \| \|	Just a nice hint for both peoples and compilers. Signed-off-by: Konstantin Kharlamov <[email protected]> Signed-off-by: Marek Olšák <[email protected]>
*	r600g: do not use "fast-clear" for small textures (v3)	Konstantin Kharlamov	2018-07-17	1	-0/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Ported from radeonsi. Improves windowed glxgears ran as vblank_mode=0 glxgears -info -geometry 0+0+512+512 from ≈2270 FPS to ≈2360 FPS. Tested with AMD TURKS. v2: turned out glxgears ignores the option above, the correct way would be "512x512+0+0". Now it can be seen 512x512 actually loses 30 FPS. 300×300 however wins around a hundred FPS, and to leave some room in case results may differ for other cards I want not to nitpick in search of an optimum but to simply leave 300×300 in the code. v3: remove redundant braces, and try harder for the mail to stick to the rest of the series. Signed-off-by: Konstantin Kharlamov <[email protected]> Reviewed-by: Gert Wollny <[email protected]> Signed-off-by: Marek Olšák <[email protected]>
*	freedreno: re-work fd_batch_reference() locking	Rob Clark	2018-07-17	2	-23/+26
\| \| \| \| \| \| \| \|	Annoyingly we still have to briefly drop the lock to unref resources.. but push the lock down into __fd_batch_destroy() so we can invalidate the batch and reset resources before dropping the lock. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: make fd_batch a one-shot thing	Rob Clark	2018-07-17	2	-11/+36
\| \| \| \| \| \| \| \| \| \| \| \| \|	Re-allocate rather than re-use. Originally we had an unnecessarily complex design to avoid re-allocating cmdstream buffers. But now that support for "growable" cmdstream buffers has been in place for a couple years, I guess we can care a bit less about the extra overhead on older kernels. But making the batches one-shot removes a class of potential race conditions vs the flush_queue. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: flush immediately when reading a pending batch	Rob Clark	2018-07-17	2	-30/+32
\| \| \| \| \| \| \| \|	Instead of the reading batch setting a dependency on the writing batch, simply flush the writing batch immediately. This avoids situations where we have to flush the context's current batch later. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: get rid of noop render	Rob Clark	2018-07-17	4	-21/+6
\| \| \| \| \| \| \|	This was basically to avoid a zero-dword IB (indirect-branch), but instead just don't emit the IB packet in that case. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: fix samples=0 vs samples=1 confusion	Rob Clark	2018-07-17	1	-1/+1
\| \| \| \| \| \| \| \| \|	pipe_framebuffer_state can have samples=0 in various cases, which is actually the same thing as samples=1. So use the _get_num_samples() helper to populate the key, to avoid this looking like two distinct fb states to the cache. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: comment for _invalidate_batch()	Rob Clark	2018-07-17	1	-3/+13
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno: hold batch references when flushing	Rob Clark	2018-07-17	1	-32/+38
\| \| \| \| \| \| \|	It is possible for a batch to be freed under our feet when flushing, so it is best to hold a reference to all of them up-front. Signed-off-by: Rob Clark <[email protected]>
*	nir/spirv: print id for unsupported alu opcode	Karol Herbst	2018-07-17	1	-1/+1
\| \| \| \| \|	Reviewed-by: Jason Ekstrand <[email protected]> Signed-off-by: Karol Herbst <[email protected]>
*	nir: prepare for bumping up max components to 16	Karol Herbst	2018-07-17	13	-53/+57
\| \| \| \| \| \| \| \| \| \| \|	OpenCL knows vector of size 8 and 16. v2: rebased on master (nir_swizzle rework) rework more declarations with nir_component_mask_t adjust print_var_decl Reviewed-by: Jason Ekstrand <[email protected]> Signed-off-by: Karol Herbst <[email protected]>
*	radv/winsys: use alloca() for semaphore dependencies	Samuel Pitoiset	2018-07-17	1	-6/+2
\| \| \| \| \|	Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	radv: reduce number of CB/DB meta flushes for VK_ACCESS_TRANSFER_WRITE_BIT	Samuel Pitoiset	2018-07-17	1	-6/+14
\| \| \| \| \| \| \| \|	If we know that the given image doesn't have any metadata, we don't need to flush. Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Dave Airlie <[email protected]>
*	radv: fix implementation of VK_KHR_create_renderpass2 for multiviews	Samuel Pitoiset	2018-07-17	1	-12/+1
\| \| \| \| \| \| \| \| \| \| \|	The Vulkan 1.1.80 spec says: "viewMask has the same effect for the described subpass as VkRenderPassMultiviewCreateInfo::pViewMasks has on each corresponding subpass." Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	virgl: respect max_vertex_attrib_stride cap	Erik Faye-Lund	2018-07-17	3	-2/+5
\| \| \| \| \| \|	This is required for OpenGL 4.4 and OpenGL ES 3.1 support. Reviewed-by: Dave Airlie <[email protected]>
*	virgl: Fix flush in virgl_encoder_inline_write.	Lepton Wu	2018-07-17	1	-1/+1
\| \| \| \| \| \| \| \| \|	The current code is buggy: if there are only 12 dwords left in cbuf, we emit a zero data length command which will be rejected by virglrenderer. Fix it by calling flush in this case. Cc: [email protected] Reviewed-by: Dave Airlie <[email protected]>
*	virgl: implement set_min_samples	Erik Faye-Lund	2018-07-17	5	-0/+28
\| \| \| \| \| \| \| \|	This allows us to implement glMinSampleShading correctly, which up until now just got ignored. Signed-off-by: Erik Faye-Lund <[email protected]> Reviewed-by: Dave Airlie <[email protected]>
*	glsl: do second pass of const propagation in loops	Caio Marcelo de Oliveira Filho	2018-07-16	1	-6/+23
\| \| \| \| \| \| \| \| \| \| \| \| \|	When handling loops in constant propagation, implement the "FINISHME" comment like copy propagation: perform a first pass to find values that can't be propagated, then perform a second pass with the ACP containing still valid values. Certain values are killed because the loop may run more than one iteration, so we can't copy propagate them as they would be invalid in the later iterations. Reviewed-by: Eric Anholt <[email protected]>
*	glsl: don't let an 'if' then-branch kill const propagation for else-branch	Caio Marcelo de Oliveira Filho	2018-07-16	1	-16/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When handling 'if' in constant propagation, if a certain variable was killed when processing the first branch of the 'if', then the second would get any propagation from previous nodes. This is similar to the change done for copy propagation code. x = 1; if (...) { z = x; // This would turn into z = 1. x = 22; // x gets killed. } else { w = x; // This would NOT turn into w = 1. } With the change, we let constant propagation happen independently in the two branches and only then apply the killed values for the subsequent code. The new code use a single hash table for keeping the kills of both branches (the branches only write to it), and it gets deleted after we use -- instead of waiting for mem_ctx to collect it. NIR deals well with constant propagation, so it already covered for the missing ones that this patch fixes. Reviewed-by: Eric Anholt <[email protected]>
*	v3d: Disable shader-db cycle estimates until we sort out TMU estimates.	Eric Anholt	2018-07-16	1	-1/+4
\| \| \| \| \|	I keep having to ignore these shader-db changes since I don't trust them, so just disable the reports entirely.
*	v3d: Emit the lowered uniform just before its first use in a block.	Eric Anholt	2018-07-16	1	-20/+18
\| \| \| \| \| \| \| \|	total instructions in shared programs: 98578 -> 98119 (-0.47%) instructions in affected programs: 27571 -> 27112 (-1.66%) and it also eliminates most spills/fills on the CTS's randomized uniform usage testcases.
*	v3d: Add an assert that we don't provide an invalid texture return words.	Eric Anholt	2018-07-16	1	-0/+8
\| \| \| \|	The docs had an update noting this restriction, so reflect it in the code.
*	v3d: Apply GFXH-1625 restriction on TMUWT in the end of the shader.	Eric Anholt	2018-07-16	1	-0/+4
\| \| \| \| \|	This doesn't affect us yet since we're not doing TMUWTs, but I think we will for GLES 3.1.
*	intel/batch_decoder: decoding of 3DSTATE_CONSTANT_BODY.	Sergii Romantsov	2018-07-16	2	-30/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	SNB doesn't have a definition of 3DSTATE_CONSTANT_BODY, thats why we got segmentation fault when used INTEL_DEBUG=bat. Fixed by adding of 3DSTATE_CONSTANT_BODY into 3DSTATE_CONSTANT of VS, GS and PS structures. v2: added definition of 3DSTATE_CONSTANT_BODY to the gen6.xml Fixes: 169d8e011ae (intel: Fix 3DSTATE_CONSTANT buffer decoding.) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107190 Signed-off-by: Sergii Romantsov <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	r600: fix build after the removal of RADEON_PRIO_* flags	Marek Olšák	2018-07-16	7	-21/+12
\|