mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	freedreno: track staging and shadow perf ctrs for the HUD	Rob Clark	2017-12-17	5	-0/+16
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno: staging upload transfers	Rob Clark	2017-12-17	3	-43/+135
\| \| \| \| \| \| \| \| \| \|	In the busy && !needs_flush case, we can support a DISCARD_RANGE upload using a staging buffer. This is a bit different from the case of mid- batch uploads which require us to shadow the whole resource (because later draws in an earlier tile happen before earlier draws in a later tile). Signed-off-by: Rob Clark <[email protected]>
*	freedreno: update generated headers	Rob Clark	2017-12-17	7	-63/+334
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	radeonsi: don't call force_dcc_off for buffers	Marek Olšák	2017-12-16	1	-1/+1
\| \| \| \| \| \| \| \|	This was undefined yet harmless behavior in LLVM. Not anymore - it causes a hang now. Cc: 17.3 <[email protected]> Tested-by: Michel Dänzer <[email protected]>
*	radeon/uvd: add and manage render picture list	Boyuan Zhang	2017-12-15	1	-4/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Create a list in decoder to store all render picture buffer pointers that currently being used in reference picture lists. During get message buffer call, check each pointer in render_pic_list[] within given pic->ref[] list, remove pointer that no longer being used by pic->ref[]. Then add current render surface pointer to the render_pic_list[] and assign the associated index to result.curr_idx. As a result, result.curr_idx will have the correct index to represent the current render picture, instead of the previous increamenting values. Signed-off-by: Boyuan Zhang <[email protected]> Reviewed-by: Christian König <[email protected]>
*	radeon/vcn: add and manage render picture list	Boyuan Zhang	2017-12-15	1	-4/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Create a list in decoder to store all render picture buffer pointers that currently being used in reference picture lists. During get message buffer call, check each pointer in render_pic_list[] within given pic->ref[] list, remove pointer that no longer being used by pic->ref[]. Then add current render surface pointer to the render_pic_list[] and assign the associated index to result.curr_idx. As a result, result.curr_idx will have the correct index to represent the current render picture, instead of the previous increamenting values. Signed-off-by: Boyuan Zhang <[email protected]> Reviewed-by: Christian König <[email protected]>
*	radeon/vce: determine idr by pic type	Boyuan Zhang	2017-12-15	1	-1/+1
\| \| \| \| \| \| \| \| \|	Vaapi encode interface provides idr frame flags, where omx interface doesn't. Therefore, change to use picture type to determine idr frame, which will work for both interfaces. Signed-off-by: Boyuan Zhang <[email protected]> Reviewed-by: Leo Liu <[email protected]>
*	radeon/vcn: determine idr by pic type	Boyuan Zhang	2017-12-15	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	Vaapi encode interface provides idr frame flags, where omx interface doesn't. Therefore, change to use picture type to determine idr frame, which will work for both interfaces. Signed-off-by: Boyuan Zhang <[email protected]> Reviewed-by: Leo Liu <[email protected]> Reviewed-by: Christian König <[email protected]>
*	swr/rast: Move more RTAI handling out of binner	Tim Rowley	2017-12-15	2	-12/+2
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: EXTRACT2 changed from vextract/vinsert to vshuffle	Tim Rowley	2017-12-15	3	-61/+32
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Fix cache of API thread event manager	Tim Rowley	2017-12-15	1	-1/+1
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Replace VPSRL with LSHR	Tim Rowley	2017-12-15	4	-41/+4
\| \| \| \| \| \| \| \|	Replace use of x86 intrinsic with general llvm IR instruction. Generates the same final assembly. Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Rework thread binding parameters for machine partitioning	Tim Rowley	2017-12-15	7	-88/+322
\| \| \| \| \| \| \| \| \| \| \| \| \|	Add BASE_NUMA_NODE, BASE_CORE, BASE_THREAD parameters to SwrCreateContext. Add optional SWR_API_THREADING_INFO parameter to SwrCreateContext to control reservation of API threads. Add SwrBindApiThread() function to allow binding of API threads to reserved HW threads. Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Pull of RTAI gather & offset out of clip/bin code	Tim Rowley	2017-12-15	7	-146/+203
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Remove no-op VBROADCAST of vID	Tim Rowley	2017-12-15	1	-2/+2
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: SIMD16 Fetch - Fully widen 32-bit integer vertex components	Tim Rowley	2017-12-15	4	-17/+109
\| \| \| \| \| \|	Also widen the 16-bit a 8-bit integer vertex component gathers to SIMD16. Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Replace INSERT2 vextract/vinsert with JOIN2 vshuffle	Tim Rowley	2017-12-15	3	-105/+30
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: SIMD16 Fetch - Fully widen 16-bit float vertex components	Tim Rowley	2017-12-15	1	-7/+48
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: SIMD16 Fetch - Fully widen 32-bit float vertex components	Tim Rowley	2017-12-15	4	-32/+194
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Pass prim to ClipSimd	Tim Rowley	2017-12-15	1	-5/+5
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Pull most of the VPAI manipulation out of the binner/clipper	Tim Rowley	2017-12-15	7	-158/+177
\| \| \| \| \| \|	Move out of binner/clipper; hand them down from the frontend code instead. Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Move GatherScissors to header	Tim Rowley	2017-12-15	2	-127/+127
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Rewrite Shuffle8bpcGatherd using shuffle	Tim Rowley	2017-12-15	1	-182/+62
\| \| \| \| \| \|	Ease future code maintenance, prepare for folding simd8 and simd16 versions. Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Convert gather masks to Nx1bit	Tim Rowley	2017-12-15	2	-40/+14
\| \| \| \| \| \| \|	Simplifies calling code, gets gather function interface closer to llvm's masked_gather. Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: WIP - Widen fetch shader to SIMD16	Tim Rowley	2017-12-15	1	-27/+689
\| \| \| \| \| \|	Widen vertex gather/storage to SIMD16 for all component types. Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Corrections to multi-scissor handling	Tim Rowley	2017-12-15	1	-88/+88
\| \| \| \| \| \| \|	binner's GatherScissors() will be turned into a real gather in the not too distant future. Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Binner fixes for viewport index offset handling	Tim Rowley	2017-12-15	2	-2/+12
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	swr/rast: Remove unneeded copy of gather mask	Tim Rowley	2017-12-15	2	-79/+23
\| \| \| \|	Reviewed-by: Bruce Cherniak <[email protected]>
*	freedreno: use u_transfer_helper	Rob Clark	2017-12-15	2	-229/+44
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	amd/common: add ac_build_waitcnt()	Samuel Pitoiset	2017-12-14	3	-15/+4
\| \| \| \| \|	Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	radeonsi: make use of ac_build_fdiv()	Samuel Pitoiset	2017-12-14	1	-7/+1
\| \| \| \| \| \| \|	And move the comment to amd/common. Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	radeonsi: make use of ac_get_spi_shader_z_format()	Samuel Pitoiset	2017-12-14	3	-23/+4
\| \| \| \| \|	Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	swr: Correct texture allocation and limit max size to 2GB	Bruce Cherniak	2017-12-13	2	-4/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch fixes piglit tex3d-maxsize by correcting 4 things: The total_size calculation was using 32-bit math, therefore a >4GB allocation request overflowed and was not returning false (unsupported). Changed AlignedMalloc arguments from "unsigned int" to size_t, to handle >4GB allocations. Added error checking on texture allocations to fail gracefully. Finally, temporarily decreased supported max texture size from 4GB to 2GB. The gallivm texture-sampler needs some additional work to correctly handle larger than 2GB textures (offsets to LLVMBuildGEP are signed). I'm working on a follow-on patch to allow up to 4GB textures, as this is useful in HPC visualization applications. Fixes piglit tex3d-maxsize. v2: Updated patch description to clarify ">4GB". Reviewed-By: George Kyriazis <[email protected]>
*	swr: Fix KNOB_MAX_WORKER_THREADS thread creation override.	Bruce Cherniak	2017-12-13	1	-2/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	Environment variable KNOB_MAX_WORKER_THREADS allows the user to override default thread creation and thread binding. Previous commit to adjust linux cpu topology caused setting this KNOB to bind all threads to a single core. This patch restores correct functionality of override. Cc: <[email protected]> Reviewed-by: Tim Rowley <[email protected]>
*	radeonsi: create get_tcs_tes_buffer_address helper	Timothy Arceri	2017-12-13	1	-12/+32
\| \| \| \| \| \|	This will be shared between the NIR and TGSI backends. Reviewed-by: Nicolai Hähnle <[email protected]>
*	gallium/util: don't pass a pipe_resource to util_resource_is_array_texture()	Brian Paul	2017-12-12	1	-1/+1
\| \| \| \| \| \| \| \| \|	No need to pass a pipe_resource when we can just pass the target. This makes the function potentially more usable. Rename it too. Reviewed-by: Roland Scheidegger <[email protected]> Reviewed-by: Charmaine Lee <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	svga: trivial whitespace/formatting fixes in svga_pipe_rasterizer.c	Brian Paul	2017-12-12	1	-9/+5
\|
*	radeon/vce: move destroy command before feedback command	Leo Liu	2017-12-08	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	VCE processing IBs starts from session and task info at first level, other commands processed subsequently. The task info for destroy is embedded to destroy command, resulting that feedback command is not properly procoessed. This is causing kernel spin VM fault messages on Polaris and Vega10 card when running ends at encode application. The fix is also verified on VCE physical mode card. Signed-off-by: Leo Liu <[email protected]> Cc: [email protected] Acked-by: Christian König <[email protected]>
*	r600/sb: do not convert if-blocks that contain indirect array access	Gert Wollny	2017-12-07	3	-2/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If an array is accessed within an if block, then currently it is not known whether the value in the address register is involved in the evaluation of the if condition, and converting the if condition may actually result in out-of-bounds array access. Consequently, if blocks that contain indirect array access should not be converted. Fixes piglits on r600/BARTS: spec/glsl-1.10/execution/variable-indexing/ vs-output-array-float-index-wr vs-output-array-vec3-index-wr vs-output-array-vec4-index-wr Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=104143 Signed-off-by: Gert Wollny <[email protected]> Cc: <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	r600: add support for compute grid/block sizes. (v2)	Dave Airlie	2017-12-06	4	-3/+100
\| \| \| \| \| \| \| \| \| \|	We just pass these in from outside in a constant buffer. The shader side stores them once they are accessed once. v2: fix to not use a temp_reg. Signed-off-by: Dave Airlie <[email protected]>
*	r600: handle image/buffer sizes correctly.	Dave Airlie	2017-12-06	3	-4/+21
\| \| \| \| \| \|	This adds support to compute for the resq workarounds (buffer/cube sizes) Signed-off-by: Dave Airlie <[email protected]>
*	r600/compute: add support for emitting compute image/buffer atoms	Dave Airlie	2017-12-06	1	-1/+9
\| \| \| \|	Signed-off-by: Dave Airlie <[email protected]>
*	r600/compute: handle atomic counters in compute state.	Dave Airlie	2017-12-06	1	-0/+9
\| \| \| \|	Signed-off-by: Dave Airlie <[email protected]>
*	r600/compute: add support for TGSI compute shaders. (v1.1)	Dave Airlie	2017-12-06	2	-28/+103
\| \| \| \| \| \| \| \| \| \| \|	This add paths to handle TGSI compute shaders and shader selection. It also avoids emitting certain things on tgsi paths, CBs, vertex buffers, config reg init (not required). v1.1: fix rat mask calc Signed-off-by: Dave Airlie <[email protected]>
*	r600/shader: add compute support to shader assembler	Dave Airlie	2017-12-06	1	-0/+14
\| \| \| \|	Signed-off-by: Dave Airlie <[email protected]>
*	r600/texture: drop lowering 1d/2d images to linear.	Dave Airlie	2017-12-06	1	-8/+0
\| \| \| \| \| \| \|	This appears to cause hangs with compute images. Unless we can find more specifics, just don't do this for now. Signed-off-by: Dave Airlie <[email protected]>
*	swr/scons: Fix another intermittent build failure	George Kyriazis	2017-12-06	1	-0/+1
\| \| \| \| \| \| \|	gen_BackendPixelRate*.cpp depends on gen_ar_eventhandler.hpp. Fix missing dependency. Reviewed-by: Bruce Cherniak <[email protected]>
*	radeonsi: make const and stream uploaders allocate read-only memory	Marek Olšák	2017-12-06	1	-2/+5
\| \| \| \| \| \| \|	and anything that clones these uploaders, like u_threaded_context. Tested-by: Dieter Nützel <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: use a separate allocator for fine fences	Marek Olšák	2017-12-06	3	-1/+9
\| \| \| \| \|	Tested-by: Dieter Nützel <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi/gfx9: make shader binaries use read-only memory	Marek Olšák	2017-12-06	5	-3/+13
\| \| \| \| \|	Tested-by: Dieter Nützel <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>