mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	virgl: store layer_stride in metadata	Gurchetan Singh	2018-12-19	2	-6/+6
\| \| \| \|	Reviewed-by: Elie Tournier <[email protected]>
*	virgl: move vrend_get_tex_image_offset to common code	Gurchetan Singh	2018-12-19	3	-26/+28
\| \| \| \| \| \|	Will be reused. Reviewed-by: Elie Tournier <[email protected]>
*	virgl: move virgl_resource_layout to common code	Gurchetan Singh	2018-12-19	3	-42/+51
\| \| \| \| \| \|	Will be reused. Reviewed-by: Elie Tournier <[email protected]>
*	virgl: move texture metadata to common code	Gurchetan Singh	2018-12-19	2	-12/+18
\| \| \| \| \| \|	Will be reused. Reviewed-by: Elie Tournier <[email protected]>
*	virgl: remove unnessecary code	Gurchetan Singh	2018-12-19	1	-3/+0
\| \| \| \| \| \| \| \| \|	With commit 89b479, we moved to tracking buffer cleanliness when binding. TEST=dEQP-GLES31.functional.image_load_store.buffer.load_store.r32ui Reviewed-by: Elie Tournier <[email protected]>
*	virgl: texture_transfer_pool --> transfer_pool	Gurchetan Singh	2018-12-19	6	-11/+11
\| \| \| \| \| \|	It's used for all types of resources. Reviewed-by: Elie Tournier <[email protected]>
*	radeonsi: const-ify the si_query_ops	Nicolai Hähnle	2018-12-19	3	-5/+5
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: split perfcounter queries from si_query_hw	Nicolai Hähnle	2018-12-19	1	-50/+93
\| \| \| \| \| \| \|	Remove a level of indirection to make the code more explicit -- should make it easier to follow what's going on. Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: factor si_query_buffer logic out of si_query_hw	Nicolai Hähnle	2018-12-19	4	-110/+99
\| \| \| \| \| \| \| \| \| \| \|	This is a move towards using composition instead of inheritance for different query types. This change weakens out-of-memory error reporting somewhat, though this should be acceptable since we didn't consistently report such errors in the first place. Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: move query suspend logic into the top-level si_query struct	Nicolai Hähnle	2018-12-19	3	-44/+62
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: move remaining perfcounter code into si_perfcounter.c	Nicolai Hähnle	2018-12-19	7	-766/+643
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: track constant buffer bind history in si_pipe_set_constant_buffer	Nicolai Hähnle	2018-12-19	1	-2/+3
\| \| \| \| \| \|	Other callers of si_set_constant_buffer don't need it. Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: use si_set_rw_shader_buffer for setting streamout buffers	Nicolai Hähnle	2018-12-19	1	-50/+11
\| \| \| \| \| \|	Reduce the number of places that encode buffer descriptors. Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: add an si_set_rw_shader_buffer convenience function	Nicolai Hähnle	2018-12-19	2	-45/+64
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: avoid using hard-coded SI_NUM_RW_BUFFERS	Nicolai Hähnle	2018-12-19	1	-1/+2
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: show the fixed function TCS in debug dumps	Nicolai Hähnle	2018-12-19	1	-2/+8
\| \| \| \| \| \|	This is rather important for merged VS/TCS as LSHS shaders... Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: const-ify si_set_tesseval_regs	Nicolai Hähnle	2018-12-19	1	-2/+2
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: rename SI_RESOURCE_FLAG_FORCE_TILING to clarify its purpose	Nicolai Hähnle	2018-12-19	3	-4/+4
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: don't set RAW_WAIT for CP DMA clears	Nicolai Hähnle	2018-12-19	1	-1/+2
\| \| \| \| \| \|	There is never a read-after-write hazard because the command doesn't read. Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi/gfx9: use SET_UCONFIG_REG_INDEX packets when available	Nicolai Hähnle	2018-12-19	2	-5/+15
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: add si_init_draw_functions and make some functions static	Nicolai Hähnle	2018-12-19	4	-22/+22
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: extract declare_vs_blit_inputs	Nicolai Hähnle	2018-12-19	1	-18/+25
\| \| \| \| \| \|	Prepare for some later refactoring. Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: move SI_FORCE_FAMILY functionality to winsys	Nicolai Hähnle	2018-12-19	2	-34/+36
\| \| \| \| \| \| \|	This helps some debugging cases by initializing addrlib with slightly more appropriate settings. Reviewed-by: Marek Olšák <[email protected]>
*	r600: remove redundant semicolon	Nicolai Hähnle	2018-12-19	1	-1/+1
\| \| \| \| \|	Reviewed-By: Gert Wollny <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	ddebug: always flush when requested, even when hang detection is disabled	Nicolai Hähnle	2018-12-19	1	-0/+2
\| \| \| \|	Reviewed-by: Marek Olšák <[email protected]>
*	ddebug: simplify watchdog loop and fix crash in the no-timeout case	Nicolai Hähnle	2018-12-19	3	-73/+52
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The following race condition could occur in the no-timeout case: API thread Gallium thread Watchdog ---------- -------------- -------- dd_before_draw u_threaded_context draw dd_after_draw add to dctx->records signal watchdog dump & destroy record execute draw dd_after_draw_async use-after-free! Alternatively, the same scenario would assert in a debug build when destroying the record because record->driver_finished has not signaled. Fix this and simplify the logic at the same time by - handing the record pointers off to the watchdog thread before each draw call and - waiting on the driver_finished fence in the watchdog thread Reviewed-by: Marek Olšák <[email protected]>
*	meson: add etnaviv to the tools option	Christian Gmeiner	2018-12-18	1	-1/+2
\| \| \| \| \|	Signed-off-by: Christian Gmeiner <[email protected]> Reviewed-by: Dylan Baker <[email protected]>
*	st/dri: replace format conversion functions with single mapping table	Lucas Stach	2018-12-18	2	-352/+149
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Each time I have to touch the buffer import/export functions in the dri state tracker I get lost in the maze of functions converting between DRI_IMAGE_FOURCC, DRI_IMAGE_FORMAT, DRI_IMAGE_COMPONENTS and pipe format. Rip it out and replace by a single table, which defines the correspondence between the different representations. Also this now stores all the known representations in the __DRIimageRec, to avoid the loss of information we currently have when importing a buffer with a fourcc, which doesn't have a corresponding dri format. Signed-off-by: Lucas Stach <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	st/dri: allow both render and sampler compatible dma-buf formats	Lucas Stach	2018-12-18	1	-12/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently all the EGL APIs are missing a way to specify how an imported dma-buf is intended to be used. Demanding the format to be both usable for sampling and rendering artificially restricts the list of formats a driver is able to import. Looking at how the Intel driver implements those DRI2 image APIs it doesn't distinguish between render or sampler compatible formats. So this patch aligns behavior between Intel and Gallium based drivers. Signed-off-by: Lucas Stach <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	etnaviv: use surface format directly	Lucas Stach	2018-12-18	2	-9/+4
\| \| \| \| \| \| \| \|	There is no need to do the detour over the resource behind the surface to get the format. Use the surface format directly. Signed-off-by: Lucas Stach <[email protected]> Reviewed-by: Philipp Zabel <[email protected]>
*	nir/opt_peephole_select: Don't peephole_select expensive math instructions	Ian Romanick	2018-12-17	2	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	On some GPUs, especially older Intel GPUs, some math instructions are very expensive. On those architectures, don't reduce flow control to a csel if one of the branches contains one of these expensive math instructions. This prevents a bunch of cycle count regressions on pre-Gen6 platforms with a later patch (intel/compiler: More peephole select for pre-Gen6). v2: Remove stray #if block. Noticed by Thomas. Signed-off-by: Ian Romanick <[email protected]> Reviewed-by: Thomas Helland <[email protected]> Reviewed-by: Lionel Landwerlin <[email protected]>
*	nir/opt_peephole_select: Don't try to remove flow control around indirect loads	Ian Romanick	2018-12-17	2	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	That flow control may be trying to avoid invalid loads. On at least some platforms, those loads can also be expensive. No shader-db changes on any Intel platform (even with the later patch "intel/compiler: More peephole select"). v2: Add a 'indirect_load_ok' flag to nir_opt_peephole_select. Suggested by Rob. See also the big comment in src/intel/compiler/brw_nir.c. v3: Use nir_deref_instr_has_indirect instead of deref_has_indirect (from nir_lower_io_arrays_to_elements.c). v4: Fix inverted condition in brw_nir.c. Noticed by Lionel. Signed-off-by: Ian Romanick <[email protected]> Reviewed-by: Lionel Landwerlin <[email protected]>
*	vc4: Reuse nir_format_convert.h in our blend lowering.	Eric Anholt	2018-12-17	1	-33/+3
\| \| \| \| \|	These helpers came along after and have effectively the same implementation.
*	nir: Add a bool to int32 lowering pass	Jason Ekstrand	2018-12-16	2	-0/+4
\| \| \| \| \| \| \| \|	We also enable it in all of the NIR drivers. Reviewed-by: Eric Anholt <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]> Tested-by: Bas Nieuwenhuizen <[email protected]>
*	nir: Rename Boolean-related opcodes to include 32 in the name	Jason Ekstrand	2018-12-16	2	-30/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a squash of a bunch of individual changes: nir/builder: Generate 32-bit bool opcodes transparently nir/algebraic: Remap Boolean opcodes to the 32-bit variant Use 32-bit opcodes in the NIR producers and optimizations Generated with a little hand-editing and the following sed commands: sed -i 's/nir_op_ball_fequal/nir_op_b32all_fequal/g' */.c sed -i 's/nir_op_bany_fnequal/nir_op_b32any_fnequal/g' */.c sed -i 's/nir_op_ball_iequal/nir_op_b32all_iequal/g' */.c sed -i 's/nir_op_bany_inequal/nir_op_b32any_inequal/g' */.c sed -i 's/nir_op_\([fiu]lt\)/nir_op_\132/g' */.c sed -i 's/nir_op_\([fiu]ge\)/nir_op_\132/g' */.c sed -i 's/nir_op_\([fiu]ne\)/nir_op_\132/g' */.c sed -i 's/nir_op_\([fiu]eq\)/nir_op_\132/g' */.c sed -i 's/nir_op_\([fi]\)ne32g/nir_op_\1neg/g' */.c sed -i 's/nir_op_bcsel/nir_op_b32csel/g' */.c Use 32-bit opcodes in the NIR back-ends Generated with a little hand-editing and the following sed commands: sed -i 's/nir_op_ball_fequal/nir_op_b32all_fequal/g' */.c sed -i 's/nir_op_bany_fnequal/nir_op_b32any_fnequal/g' */.c sed -i 's/nir_op_ball_iequal/nir_op_b32all_iequal/g' */.c sed -i 's/nir_op_bany_inequal/nir_op_b32any_inequal/g' */.c sed -i 's/nir_op_\([fiu]lt\)/nir_op_\132/g' */.c sed -i 's/nir_op_\([fiu]ge\)/nir_op_\132/g' */.c sed -i 's/nir_op_\([fiu]ne\)/nir_op_\132/g' */.c sed -i 's/nir_op_\([fiu]eq\)/nir_op_\132/g' */.c sed -i 's/nir_op_\([fi]\)ne32g/nir_op_\1neg/g' */.c sed -i 's/nir_op_bcsel/nir_op_b32csel/g' */.c Reviewed-by: Eric Anholt <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]> Tested-by: Bas Nieuwenhuizen <[email protected]>
*	nir/tgsi: Use nir_bany in ttn_kill_if	Jason Ekstrand	2018-12-16	1	-3/+1
\| \| \| \|	Reviewed-by: Eric Anholt <[email protected]>
*	vc4: Use the original bit size when scalarizing uniform loads.	Eric Anholt	2018-12-16	1	-1/+2
\| \| \| \| \| \|	Prevents a regression in jekstrand's 1-bit series. Reviewed-by: Jason Ekstrand <[email protected]>
*	clover: Fix build after clang r348827	Jan Vesely	2018-12-16	1	-1/+6
\| \| \| \| \| \| \| \| \| \|	CodeGenOptions were moved to Basic. Signed-off-by: Jan Vesely <[email protected]> Reviewed-by: Aaron Watry <[email protected]> Tested-by: Aaron Watry <[email protected]> Reviewed-by: Kai Wasserbäch <[email protected]> CC: [email protected]
*	v3d: Drop in a bunch of notes about performance improvement opportunities.	Eric Anholt	2018-12-14	2	-1/+13
\| \| \| \| \| \|	These have all been floating in my head, and while I've thought about encoding them in issues on gitlab once they're enabled, they also make sense to just have in the area of the code you'll need to work in.
*	v3d: Use the uniform pretty-printer in v3d_write_uniforms()'s debug code.	Eric Anholt	2018-12-14	1	-1/+3
\| \| \| \| \|	This will be a lot easier than my usual "38400.000000? that looks like a viewport scale" decoding strategy.
*	v3d: Move uinfo->data[] dereference to the top of v3d_write_uniforms().	Eric Anholt	2018-12-14	1	-15/+13
\| \| \| \| \| \|	Follows 3954331aff23 ("vc4: Pull uinfo->data[i] dereference out to the top of the loop.") which showed a large performance win for vc4, but also cleans up the code a decent bit.
*	v3d: Add support for draw indirect for GLES3.1.	Eric Anholt	2018-12-14	2	-2/+31
\| \| \| \| \| \|	In trying to enable compute shaders, I found that a bunch of deqp-gles31's compute stuff wanted to interact with indirect dispatch. This was easy to do on its own.
*	v3d: Add safety checks for resource_create().	Eric Anholt	2018-12-14	1	-0/+6
\| \| \| \|	This should ease my debugging next time I screw it up.
*	v3d: Add support for texturing from linear.	Eric Anholt	2018-12-14	6	-3/+110
\| \| \| \| \| \| \|	Just like vc4, we have to support linear shared BOs for X11 on arbitrary displays. When we're faced with a request to texture from one of those, make a shadow image that we copy using the TFU at the start of the draw call.
*	v3d: Add support for using the TFU to do some blits.	Eric Anholt	2018-12-14	1	-42/+129
\| \| \| \|	This will be useful in particular for blits from raster to UIF for X11.
*	v3d: Don't forget to bump the number of writes when doing TFU ops.	Eric Anholt	2018-12-14	1	-0/+2
\| \| \| \| \| \|	generatemipmap is just filling out the rest of the mipmap that's already been written (by a mapping or a draw call), so it didn't matter. As I reuse the TFU code for linear-to-UIF conversions, it'll start mattering.
*	v3d: Set up the right stride for raster TFU.	Eric Anholt	2018-12-14	1	-1/+1
\| \| \| \| \|	I didn't have any raster images in the generatemipmap path, so the pixels-vs-bytes mixup didn't matter here.
*	v3d: Don't forget to wait for our TFU job before rendering from it.	Eric Anholt	2018-12-14	1	-0/+8
\| \| \| \| \| \| \| \|	Otherwise we may race to read old contents. This didn't show up in the CTS and piglit for me, but it did once I started using the TFU to do linear->UIF blits for X11. Fixes: 2ebca177dc18 ("v3d: Use the TFU to do generatemipmap.")
*	nvc0: always keep TSC slot 0 bound to fix TXF	Ilia Mirkin	2018-12-14	2	-0/+21
\| \| \| \| \| \| \| \| \| \| \| \|	Same as on nv50, the TXF op always uses the TSC bound to slot 0, returning blank values if nothing is bound. An earlier change arranges for the TSC entries list to always have valid data at entry 0, so here we just make use of it. Fixes arb_texture_buffer_object-subdata-sync among others. Signed-off-by: Ilia Mirkin <[email protected]>
*	nvc0: replace use of explicit default_tsc with entry 0	Ilia Mirkin	2018-12-14	6	-22/+25
\| \| \| \| \| \| \| \| \| \| \|	This was used for implementing FBFETCH. However that uses TXF, which doesn't do much with a TSC. The only important bit is that sRGB-decoding works as expected, which we can achieve since all samplers we ever generate enable sRGB-decoding. Always point to entry 0 in the TSC table, and ensure that even before it ever gets initialized, the sRGB-decoding enable bit is set. Signed-off-by: Ilia Mirkin <[email protected]>