mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	ddebug: print the command line to all logs (v2)	Marek Olšák	2016-08-03	1	-0/+4
\| \| \| \| \| \| \| \|	for piglit with the pipelined hang detection mode v2: rebase on top of Brian's commit Reviewed-by: Nicolai Hähnle <[email protected]>
*	ddebug: don't use fmemopen on non-Linux OS	Marek Olšák	2016-08-03	1	-0/+5
\| \| \| \| \| \|	Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=97140 Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: don't set the last parameter component of llvm.AMDGPU.cube	Marek Olšák	2016-08-03	1	-2/+8
\| \| \| \| \| \|	LLVM doesn't use it. Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: use llvm.amdgcn.cube* if available	Marek Olšák	2016-08-03	1	-4/+28
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: use llvm.amdgcn.rsq.f64 if available	Marek Olšák	2016-08-03	1	-1/+2
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: use v_mad_f32 for fma	Marek Olšák	2016-08-03	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	v_fma_f32 runs at FP64 rate (= slow). Alien Isolation and F1 2015 seem to use fma for all d3d multiply-add instructions, which is silly. This tries to restore performance for those games. The main difference between v_mad_f32 and v_fma_f32 is that v_mad doesn't support denormals, which we don't enable anyway, because they are slow too. Also, there is code size reduction: Totals from affected shaders: VGPRS: 109796 -> 109808 (0.01 %) Spilled SGPRs: 29995 -> 30022 (0.09 %) Spilled VGPRs: 12 -> 13 (8.33 %) <-- it's just one shader going from 12 to 13 Code Size: 6667596 -> 6476356 (-2.87 %) bytes Max Waves: 26931 -> 26899 (-0.12 %) I've not actually tested real performance. Reviewed-by: Nicolai Hähnle <[email protected]>
*	swr: build swr with -fno-strict-aliasing	Tim Rowley	2016-08-02	1	-0/+1
\| \| \| \| \| \| \|	swr rasterizer contains numerous data transfers between vectors and ordinary C types. Fixing for strict aliasing will take time. Reviewed-by: Matt Turner <[email protected]>
*	gallium/util: fix align64	Marek Olšák	2016-08-01	1	-1/+1
\| \| \| \| \| \| \| \|	it cut off the upper 32 bits Cc: [email protected] Reviewed-by: Ilia Mirkin <[email protected]> Reviewed-by: Edward O'Callaghan <[email protected]>
*	draw: Avoid aliasing violations.	Matt Turner	2016-08-01	2	-3/+6
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	r600g: Avoid aliasing violations.	Matt Turner	2016-08-01	2	-13/+9
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	r300g: Avoid aliasing violation.	Matt Turner	2016-08-01	1	-1/+2
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	gallium/auxiliary: Add u_bitcast.h header.	Matt Turner	2016-08-01	2	-0/+58
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	auxiliary/os: add new os_get_command_line() function	Brian Paul	2016-08-01	2	-0/+52
\| \| \| \| \| \| \| \| \| \| \|	This can be used by the driver to get the command line which started the process. Will be used by the VMware driver for extra logging. For now, this is only implemented for Linux via /proc/self/cmdline and Windows via GetCommandLine(). Reviewed-by: Charmaine Lee <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	svga: avoid redundant SetVertexBuffer/SetIndexBuffer commands at rebind	Charmaine Lee	2016-08-01	1	-16/+19
\| \| \| \| \| \| \| \| \| \| \| \|	This patch eliminates the redundant SetVertexBuffers and SetIndexBuffer commands that are emitted for rebind purpose. With this patch, the set commands will be skipped, but we will still reference the associated resources to allow the kernel to bring in the resources. Tested with Lightsmark2008, Valley, MTT glretrace, piglit, conform. Reviewed-by: Brian Paul <[email protected]>
*	u_vbuf: fix potentially bogus assert	Rob Clark	2016-08-01	1	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	There are cases where we hit u_vbuf path due to alignment or pitch- alignment restrictions, but for an output-format that u_vbuf does not support translating (yet the driver does support natively). In which case we hit the memcpy() path and don't care that u_vbuf doesn't understand it. Fixes crash with debug build of mesa in: dEQP-GLES3.functional.vertex_arrays.single_attribute.strides.fixed.user_ptr_stride17_components2_quads1 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=95000 Signed-off-by: Rob Clark <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	vc4: Zero-initialize the hardware sampler view structure.	Eric Anholt	2016-07-31	1	-1/+1
\| \| \| \| \|	Fixes failure to initialize the force_first_level flag, causing failures in piglit levelclamp.
*	Revert "gallium/util: fix resource leak"	Roland Scheidegger	2016-07-30	1	-2/+0
\| \| \| \| \| \|	This reverts commit d1fe26a62862f4e47a799222dca1bc1dc14ca4af. Replacing a resource leak with a segfault isn't the solution.
*	gallium/util: fix resource leak	Eric Engestrom	2016-07-30	1	-0/+2
\| \| \| \| \| \| \|	CovID: 401540 Signed-off-by: Eric Engestrom <[email protected]> Signed-off-by: Marek Olšák <[email protected]> Reviewed-by: Edward O'Callaghan <[email protected]>
*	freedreno/a4xx: fix comparison out of range warnings	[email protected]	2016-07-30	1	-7/+7
\| \| \| \| \|	Signed-off-by: Francesco Ansanelli <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx: fix comparison out of range warnings	[email protected]	2016-07-30	1	-7/+7
\| \| \| \| \|	Signed-off-by: Francesco Ansanelli <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a2xx: fix comparison out of range warnings	[email protected]	2016-07-30	1	-4/+4
\| \| \| \| \|	Signed-off-by: Francesco Ansanelli <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: init ir3_shader_key with memset()	[email protected]	2016-07-30	1	-1/+2
\| \| \| \| \| \| \|	To silence missing initializers warning Signed-off-by: Francesco Ansanelli <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	gallium/freedreno: move cast to avoid integer overflow	Eric Engestrom	2016-07-30	1	-2/+2
\| \| \| \| \| \| \| \| \|	Previously, the bitshift would be performed on a simple int (32 bits on most systems), overflow, and then be cast to 64 bits. CovID: 1362461 Signed-off-by: Eric Engestrom <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a2xx: remove duplicate assignment	Eric Engestrom	2016-07-30	1	-2/+2
\| \| \| \| \| \|	CovID: 1362445, 1362446 Signed-off-by: Eric Engestrom <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	freedreno: defer flush_queue allocation	Rob Clark	2016-07-30	2	-2/+4
\| \| \| \| \| \| \|	Some apps, like warsow, create a bazillion contexts but don't render on most of them. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: add some hw query traces	Rob Clark	2016-07-30	1	-0/+16
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno: some locking	Rob Clark	2016-07-30	9	-23/+157
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	os: add pipe_mutex_assert_locked()	Rob Clark	2016-07-30	1	-0/+16
\| \| \| \| \| \| \|	Would be nice if we could also have lockdep, like in the linux kernel. But this is better than nothing. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: drop needs_rb_fbd	Rob Clark	2016-07-30	6	-31/+12
\| \| \| \| \| \| \| \| \| \| \|	We need to emit RB_FRAME_BUFFER_DIMENSION once per batch.. tracking this in fd_context is wrong when the gmem code executes asynchronously from the flush_queue worker. But in fact we don't really need to track it at all. We cannot assume previous value at the beginning of the batch (because of other processes potentially using the GPU), so just drop the tracking and emit it in _tile_init(). Signed-off-by: Rob Clark <[email protected]>
*	freedreno: move needs_wfi into batch	Rob Clark	2016-07-30	19	-94/+93
\| \| \| \| \| \| \|	This is also used in gmem code, which executes from the "bottom half" (ie. from the flush_queue worker thread), so it cannot be in fd_context. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: a bit of micro-optimization	Rob Clark	2016-07-30	2	-10/+10
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno: drop mem2gmem/gmem2mem query stages	Rob Clark	2016-07-30	2	-17/+1
\| \| \| \| \| \| \| \| \|	They weren't really used, and it gets somewhat more complicated to deal with if batches are flushed asynchronously (on another thread). So just drop them, and move _query_set_state(NULL) call into batch (so it is not happening on background thread). Signed-off-by: Rob Clark <[email protected]>
*	freedreno: threaded batch flush	Rob Clark	2016-07-30	9	-26/+99
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	With the state accessed from GMEM+submit factored out of fd_context and into fd_batch, now it is possible to punt this off to a helper thread. And more importantly, since there are cases where one context might force the batch-cache to flush another context's batches (ie. when there are too many in-flight batches), using a per-context helper thread keeps various different flushes for a given context serialized. TODO as with batch-cache, there are a few places where we'll need a mutex to protect critical sections, which is completely missing at the moment. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: track batch/blit types	Rob Clark	2016-07-30	5	-24/+52
\| \| \| \| \| \| \| \| \| \| \| \|	Add a bit of extra book-keeping about blits and back-blits (from resource shadowing). If the app uploads all mipmap levels, as opposed to uploading the first level and then glGenerateMipmap(), we can discard the back-blit (as opposed to being naive and shadowing the resource for each mipmap level). Also, after a normal blit, we might as well flush the batch immediately, since there is not likely to be further rendering to the surface. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: re-order support for hw queries	Rob Clark	2016-07-30	19	-264/+288
\| \| \| \| \| \| \| \| \| \| \|	Push query state down to batch, and use the resource tracking to figure out which batch(es) need to be flushed to get the query result. This means we actually need to allocate the prsc up front, before we know the size. So we have to add a special way to allocate an un- backed resource, and then later allocate the backing storage. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: use prsc for hw queries	Rob Clark	2016-07-30	3	-35/+45
\| \| \| \| \| \| \| \| \|	Switch to using a pipe_resource (rather than an fd_bo directly) for hw query result buffers. This is first step towards making queries work properly with reordered batches, since we'll need the additional dependency tracking to know which batches to flush. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: support discarding previous rendering in special cases	Rob Clark	2016-07-30	3	-5/+32
\| \| \| \| \| \| \| \| \| \|	Basically, to "DCE" blits triggered by resource shadowing, in cases where the levels are immediately completely overwritten. For example, mid-frame texture upload to level zero triggers shadowing and back-blits to the remaining levels, which are immediately overwritten by glGenerateMipmap(). Signed-off-by: Rob Clark <[email protected]>
*	freedreno: shadow textures if possible to avoid stall/flush	Rob Clark	2016-07-30	3	-11/+211
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	To make batch re-ordering useful, we need to be able to create shadow resources to avoid a flush/stall in transfer_map(). For example, uploading new texture contents or updating a UBO mid-batch. In these cases, we want to clone the buffer, and update the new buffer, leaving the old buffer (whose reference is held by cmdstream) as a shadow. This is done by blitting the remaining other levels (and whatever part of current level that is not discarded) from the old/shadow buffer to the new one. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: spiff up some debug traces	Rob Clark	2016-07-30	6	-6/+18
\| \| \| \| \| \| \|	Make it easier to track batches, to ensure things happen properly when they are reordered. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: add batch-cache and batch reordering	Rob Clark	2016-07-30	15	-111/+760
\| \| \| \| \| \| \| \| \| \| \| \|	Note that I originally also had a entry-point that would construct a key and do lookup from a pipe_surface. I ended up not needing that (yet?) but it is easy-enough to re-introduce later if we need it for the blit path. For now, not enabled by default, but can be enabled (on a3xx/a4xx) with FD_MESA_DEBUG=reorder. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: move more batch related tracking to fd_batch	Rob Clark	2016-07-30	23	-398/+420
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	To flush batches out of order, the gmem code needs to not depend on state from fd_context (since that may apply to a more recent batch). So this all moves into batch. The one exception is the gmem/pipe/tile state itself. But this is only used from gmem code (and batches are flushed serially). The alternative would be having to re-calculate GMEM layout on every batch, even if the dimensions of the render targets are the same. Note: This opens up the possibility of pushing gmem/submit into a helper thread. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: dynamically sized/growable cmd buffers	Rob Clark	2016-07-30	2	-23/+33
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno: push resource tracking down into batch	Rob Clark	2016-07-30	7	-42/+51
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno: introduce fd_batch	Rob Clark	2016-07-30	20	-177/+252
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Introduce the batch object, to track a batch/submit's worth of ringbuffers and other bookkeeping. In this first step, just move the ringbuffers into batch, since that is mostly uninteresting churn. For now there is just a single batch at a time. Note that one outcome of this change is that rb's are allocated/freed on each use. But the expectation is that the bo pool in libdrm_freedreno will save us the GEM bo alloc/free which was the initial reason to implement a rb pool in gallium. The purpose of the batch is to eventually facilitate out-of-order rendering, with batches associated to framebuffer state, and tracking the dependencies on other batches. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: limit non-user constant buffers to a4xx	Rob Clark	2016-07-29	1	-1/+1
\| \| \| \| \| \| \|	Seems to mostly work on a3xx. Except when it doesn't and kills gpu quite badly. Signed-off-by: Rob Clark <[email protected]>
*	virgl: add exported dmabuf to BO hash table	Rob Herring	2016-07-29	1	-0/+3
\| \| \| \| \| \| \| \| \|	Exported dmabufs can get imported by the same process, but the handle was not getting added to the hash table on export. Add the handle to the hash table on export. Signed-off-by: Rob Herring <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	vc4: automake: remove vc4_drm.h from the sources lists	Emil Velikov	2016-07-28	1	-1/+0
\| \| \| \| \| \| \| \| \|	The file was removed with earlier commit breaking 'make dist'. Drop it from Makefile.sources since it's no longer around. Fixes: 16985eb308e ("vc4: Switch to using the libdrm-provided vc4_drm.h.") Signed-off-by: Emil Velikov <[email protected]>
*	ddebug: use pclose to close a popen()'d FILE	Nicolai Hähnle	2016-07-28	1	-1/+1
\| \| \| \| \| \|	Found by Coverity. Reviewed-by: Marek Olšák <[email protected]>
*	clover: make GCC 4.8 happy	Dieter Nützel	2016-07-27	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Without this GCC 4.8.x throws below error: error: invalid initialization of non-const reference of type 'clover::llvm::compat::raw_ostream_to_emit_file {aka llvm::raw_svector_ostream&}' from an rvalue of type '<brace-enclosed initializer list>' v2: change commit title and add error message like Eric Engestrom requested Signed-off-by: Dieter Nützel <[email protected]> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=97019 [ Francisco Jerez: Trivial formatting fix. ] Reviewed-by: Francisco Jerez <[email protected]>
*	nvc0: enable ARB_tessellation_shader on GM107+	Samuel Pitoiset	2016-07-27	1	-3/+0
\| \| \| \| \| \| \|	This exposes OpenGL 4.1 on Maxwell (tested on GM107 and GM206). Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>