mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	freedreno/ir3: handle VTXID_BASE for indirect draws	Rob Clark	2017-12-19	1	-2/+41
\| \| \| \| \| \| \| \| \|	Need to do some gymnastics to copy the parameter from the indirect parameters buffer to uniform so shader sees the correct base-vertex-id. Fixes ./bin/arb_draw_indirect-vertexid on a5xx and probably a4xx too. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: add ctx->mem_to_mem()	Rob Clark	2017-12-19	4	-14/+49
\| \| \| \| \| \| \| \|	For dealing with indirect-draw + gl_VertexID, we'll introduce another case where we need to use CP_MEM_TO_MEM. Rather than adding more if(a5xx)/else make this a ctx vfunc. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a5xx: use vertex_id_zero_base	Rob Clark	2017-12-19	2	-20/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	Cmdstream traces from blob make it clear that the blob driver dev's think a5xx has a real (non-zero-based) vtxid. But reality claims differently. Fixes ./bin/gl-3.2-basevertex-vertexid and probably others. This means draw-indirect is going to need some gymnastics to copy base-vertex into uniform. (a4xx probably needs that too.) Signed-off-by: Rob Clark <[email protected]>
*	r600: clear compressed flags in image state on unbind.	Dave Airlie	2017-12-19	1	-0/+2
\| \| \| \| \| \| \| \| \|	If we aren't binding an image, clear the compressed flags. This fixes a segfault seen with an apitrace. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=104331 Signed-off-by: Dave Airlie <[email protected]>
*	swr: Account for index_bias in offsets	George Kyriazis	2017-12-18	1	-3/+3
\| \| \| \| \| \| \| \| \| \|	When calculating buffer offsets for client buffers account for info.index_bias. Fixes the follow piglit tests: arb_draw_elements_base_vertex-drawelements-user_varrays arb_draw_elements_base_vertex-negative-index-user_varrays Reviewed-By: Bruce Cherniak <[email protected]>
*	r600: only reported tgsi ir compute support on evergreen+	Dave Airlie	2017-12-18	1	-1/+3
\| \| \| \| \| \|	This fixes a crash on r600/r700. Signed-off-by: Dave Airlie <[email protected]>
*	amd/common: add ac_vgt_gs_mode() helper	Samuel Pitoiset	2017-12-18	1	-29/+3
\| \| \| \| \|	Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	amd/common: add ac_get_cb_shader_mask() helper	Samuel Pitoiset	2017-12-18	1	-33/+1
\| \| \| \| \|	Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	r600: export robust buffer access	Dave Airlie	2017-12-18	1	-1/+1
\| \| \| \|	Signed-off-by: Dave Airlie <[email protected]>
*	r600: export GLSL 430	Dave Airlie	2017-12-18	1	-1/+1
\| \| \| \|	Signed-off-by: Dave Airlie <[email protected]>
*	r600/cs: add compute support to caps	Dave Airlie	2017-12-18	1	-2/+2
\| \| \| \|	Signed-off-by: Dave Airlie <[email protected]>
*	r600: always flush between gfx and compute	Dave Airlie	2017-12-18	5	-0/+21
\| \| \| \| \| \| \| \|	This is in no way optimal, but there seems to be some problems mixing at the moment, lots of hangs, it is possible, just need to figure out more magic. Signed-off-by: Dave Airlie <[email protected]>
*	r600: fix unused variable warning	Dave Airlie	2017-12-18	1	-1/+0
\| \| \| \|	Signed-off-by: Dave Airlie <[email protected]>
*	freedreno/a5xx: add a5xx blitter	Rob Clark	2017-12-17	8	-1/+498
\| \| \| \| \| \|	FD_MESA_DEBUG=noblit to disable Signed-off-by: Rob Clark <[email protected]>
*	freedreno: add generic blitter	Rob Clark	2017-12-17	7	-2/+161
\| \| \| \| \| \| \|	Basically a clone of util_blitter_blit() but with special handling to blit PIPE_BUFFER as a PIPE_TEXTURE_1D. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: add non-draw batches for compute/blit	Rob Clark	2017-12-17	12	-32/+82
\| \| \| \| \| \| \| \| \|	Get rid of "gmem" (ie. tiling) ringbuffer, and just emit setup commands directly to "draw" ringbuffer for compute (and in future for blits not using the 3d pipe). This way we can have a simple flat cmdstream buffer and bypass setup related to 3d pipe. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: track staging and shadow perf ctrs for the HUD	Rob Clark	2017-12-17	5	-0/+16
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno: staging upload transfers	Rob Clark	2017-12-17	3	-43/+135
\| \| \| \| \| \| \| \| \| \|	In the busy && !needs_flush case, we can support a DISCARD_RANGE upload using a staging buffer. This is a bit different from the case of mid- batch uploads which require us to shadow the whole resource (because later draws in an earlier tile happen before earlier draws in a later tile). Signed-off-by: Rob Clark <[email protected]>
*	freedreno: update generated headers	Rob Clark	2017-12-17	7	-63/+334
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	radeonsi: don't call force_dcc_off for buffers	Marek Olšák	2017-12-16	1	-1/+1
\| \| \| \| \| \| \| \|	This was undefined yet harmless behavior in LLVM. Not anymore - it causes a hang now. Cc: 17.3 <[email protected]> Tested-by: Michel Dänzer <[email protected]>
*	radeon/uvd: add and manage render picture list	Boyuan Zhang	2017-12-15	1	-4/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Create a list in decoder to store all render picture buffer pointers that currently being used in reference picture lists. During get message buffer call, check each pointer in render_pic_list[] within given pic->ref[] list, remove pointer that no longer being used by pic->ref[]. Then add current render surface pointer to the render_pic_list[] and assign the associated index to result.curr_idx. As a result, result.curr_idx will have the correct index to represent the current render picture, instead of the previous increamenting values. Signed-off-by: Boyuan Zhang <[email protected]> Reviewed-by: Christian König <[email protected]>
*	radeon/vcn: add and manage render picture list	Boyuan Zhang	2017-12-15	1	-4/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Create a list in decoder to store all render picture buffer pointers that currently being used in reference picture lists. During get message buffer call, check each pointer in render_pic_list[] within given pic->ref[] list, remove pointer that no longer being used by pic->ref[]. Then add current render surface pointer to the render_pic_list[] and assign the associated index to result.curr_idx. As a result, result.curr_idx will have the correct index to represent the current render picture, instead of the previous increamenting values. Signed-off-by: Boyuan Zhang <[email protected]> Reviewed-by: Christian König <[email protected]>
*	radeon/vce: determine idr by pic type	Boyuan Zhang	2017-12-15	1	-1/+1
\| \| \| \| \| \| \| \| \|	Vaapi encode interface provides idr frame flags, where omx interface doesn't. Therefore, change to use picture type to determine idr frame, which will work for both interfaces. Signed-off-by: Boyuan Zhang <[email protected]> Reviewed-by: Leo Liu <[email protected]>
*	radeon/vcn: determine idr by pic type	Boyuan Zhang	2017-12-15	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	Vaapi encode interface provides idr frame flags, where omx interface doesn't. Therefore, change to use picture type to determine idr frame, which will work for both interfaces. Signed-off-by: Boyuan Zhang <[email protected]> Reviewed-by: Leo Liu <[email protected]> Reviewed-by: Christian König <[email protected]>
*	swr/rast: Move more RTAI handling out of binner	Tim Rowley	2017-12-15	2	-12/+2
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: EXTRACT2 changed from vextract/vinsert to vshuffle	Tim Rowley	2017-12-15	3	-61/+32
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Fix cache of API thread event manager	Tim Rowley	2017-12-15	1	-1/+1
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Replace VPSRL with LSHR	Tim Rowley	2017-12-15	4	-41/+4
\| \| \| \| \| \| \| \|	Replace use of x86 intrinsic with general llvm IR instruction. Generates the same final assembly. Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Rework thread binding parameters for machine partitioning	Tim Rowley	2017-12-15	7	-88/+322
\| \| \| \| \| \| \| \| \| \| \| \| \|	Add BASE_NUMA_NODE, BASE_CORE, BASE_THREAD parameters to SwrCreateContext. Add optional SWR_API_THREADING_INFO parameter to SwrCreateContext to control reservation of API threads. Add SwrBindApiThread() function to allow binding of API threads to reserved HW threads. Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Pull of RTAI gather & offset out of clip/bin code	Tim Rowley	2017-12-15	7	-146/+203
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Remove no-op VBROADCAST of vID	Tim Rowley	2017-12-15	1	-2/+2
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: SIMD16 Fetch - Fully widen 32-bit integer vertex components	Tim Rowley	2017-12-15	4	-17/+109
\| \| \| \| \| \|	Also widen the 16-bit a 8-bit integer vertex component gathers to SIMD16. Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Replace INSERT2 vextract/vinsert with JOIN2 vshuffle	Tim Rowley	2017-12-15	3	-105/+30
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: SIMD16 Fetch - Fully widen 16-bit float vertex components	Tim Rowley	2017-12-15	1	-7/+48
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: SIMD16 Fetch - Fully widen 32-bit float vertex components	Tim Rowley	2017-12-15	4	-32/+194
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Pass prim to ClipSimd	Tim Rowley	2017-12-15	1	-5/+5
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Pull most of the VPAI manipulation out of the binner/clipper	Tim Rowley	2017-12-15	7	-158/+177
\| \| \| \| \| \|	Move out of binner/clipper; hand them down from the frontend code instead. Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Move GatherScissors to header	Tim Rowley	2017-12-15	2	-127/+127
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Rewrite Shuffle8bpcGatherd using shuffle	Tim Rowley	2017-12-15	1	-182/+62
\| \| \| \| \| \|	Ease future code maintenance, prepare for folding simd8 and simd16 versions. Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Convert gather masks to Nx1bit	Tim Rowley	2017-12-15	2	-40/+14
\| \| \| \| \| \| \|	Simplifies calling code, gets gather function interface closer to llvm's masked_gather. Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: WIP - Widen fetch shader to SIMD16	Tim Rowley	2017-12-15	1	-27/+689
\| \| \| \| \| \|	Widen vertex gather/storage to SIMD16 for all component types. Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Corrections to multi-scissor handling	Tim Rowley	2017-12-15	1	-88/+88
\| \| \| \| \| \| \|	binner's GatherScissors() will be turned into a real gather in the not too distant future. Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Binner fixes for viewport index offset handling	Tim Rowley	2017-12-15	2	-2/+12
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Remove unneeded copy of gather mask	Tim Rowley	2017-12-15	2	-79/+23
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	freedreno: use u_transfer_helper	Rob Clark	2017-12-15	2	-229/+44
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	amd/common: add ac_build_waitcnt()	Samuel Pitoiset	2017-12-14	3	-15/+4
\| \| \| \| \|	Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	radeonsi: make use of ac_build_fdiv()	Samuel Pitoiset	2017-12-14	1	-7/+1
\| \| \| \| \| \| \|	And move the comment to amd/common. Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	radeonsi: make use of ac_get_spi_shader_z_format()	Samuel Pitoiset	2017-12-14	3	-23/+4
\| \| \| \| \|	Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	swr: Correct texture allocation and limit max size to 2GB	Bruce Cherniak	2017-12-13	2	-4/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch fixes piglit tex3d-maxsize by correcting 4 things: The total_size calculation was using 32-bit math, therefore a >4GB allocation request overflowed and was not returning false (unsupported). Changed AlignedMalloc arguments from "unsigned int" to size_t, to handle >4GB allocations. Added error checking on texture allocations to fail gracefully. Finally, temporarily decreased supported max texture size from 4GB to 2GB. The gallivm texture-sampler needs some additional work to correctly handle larger than 2GB textures (offsets to LLVMBuildGEP are signed). I'm working on a follow-on patch to allow up to 4GB textures, as this is useful in HPC visualization applications. Fixes piglit tex3d-maxsize. v2: Updated patch description to clarify ">4GB". Reviewed-By: George Kyriazis <[email protected]>
*	swr: Fix KNOB_MAX_WORKER_THREADS thread creation override.	Bruce Cherniak	2017-12-13	1	-2/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	Environment variable KNOB_MAX_WORKER_THREADS allows the user to override default thread creation and thread binding. Previous commit to adjust linux cpu topology caused setting this KNOB to bind all threads to a single core. This patch restores correct functionality of override. Cc: <[email protected]> Reviewed-by: Tim Rowley <[email protected]>