mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	winsys/amdgpu: add support for allocating GDS and OA resources	Marek Olšák	2018-11-28	2	-23/+36
\| \| \| \|	Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	radeonsi: allow si_cp_dma_clear_buffer to clear GDS from any IB	Marek Olšák	2018-11-28	4	-31/+33
\| \| \| \|	Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	winsys/amdgpu: use optimal VM alignment for CPU allocations	Marek Olšák	2018-11-28	1	-2/+4
\| \| \| \|	Acked-by: Christian König <[email protected]>
*	winsys/amdgpu: use optimal VM alignment for imported buffers	Marek Olšák	2018-11-28	1	-20/+29
\| \| \| \| \| \|	Window system buffers didn't use the optimal alignment. Acked-by: Christian König <[email protected]>
*	winsys/amdgpu,radeon: pass vm_alignment to buffer_from_handle	Marek Olšák	2018-11-28	7	-5/+15
\| \| \| \|	Acked-by: Christian König <[email protected]>
*	winsys/amdgpu: overallocate buffers for faster address translation on Gfx9	Marek Olšák	2018-11-28	1	-0/+24
\| \| \| \| \| \| \| \|	Sadly, the 3 games I tested (DeusEx:MD, DiRT Rally, DOTA 2) are unaffected by the overallocation, because I guess their buffers don't fall into the small range below a power-of-two size. Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	winsys/amdgpu: increase the VM alignment to the MSB of the size for Gfx9	Marek Olšák	2018-11-28	1	-1/+11
\| \| \| \| \|	Reviewed-by: Christian König <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	winsys/amdgpu: use >= instead of > for VM address alignment	Marek Olšák	2018-11-28	1	-1/+1
\| \| \| \| \|	Reviewed-by: Christian König <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	winsys/amdgpu: clean up code around BO VM alignment	Marek Olšák	2018-11-28	1	-2/+7
\| \| \| \|	Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	winsys/amdgpu: optimize slab allocation for 2 MB amdgpu page tables	Marek Olšák	2018-11-28	3	-2/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	- the slab buffer size increased from 128 KB to 2 MB (PTE fragment size) - the max suballocated buffer size increased from 64 KB to 256 KB, this increases memory usage because it wastes memory - the number of suballocators increased from 1 to 3 and they are layered on top of each other to minimize unused space in slabs The final increase in memory usage is: DeusEx:MD: 1.8% DOTA 2: 1.75% DiRT Rally: 0.2% The kernel driver will also receive fewer buffers.
*	radeonsi: generalize the slab allocator code to allow layered slab allocators	Marek Olšák	2018-11-28	3	-24/+86
\| \| \| \| \|	There is no change in behavior. It just makes it easier to change the number of slab allocators.
*	winsys/amdgpu: always reclaim/release slabs if there is not enough memory	Marek Olšák	2018-11-28	1	-7/+13
\|
*	radeonsi: fix is_oneway_access_only for bindless images	Marek Olšák	2018-11-28	1	-6/+23
\|
*	radeonsi/nir: parse more information about bindless usage	Marek Olšák	2018-11-28	1	-4/+32
\| \| \| \|	fill more tgsi_shader_info fields.
*	tgsi/scan: add more information about bindless usage	Marek Olšák	2018-11-28	2	-2/+33
\| \| \| \|	radeonsi will use this.
*	radeonsi: small cleanup for memory opcodes	Marek Olšák	2018-11-28	1	-9/+4
\|
*	radeonsi: fix is_oneway_access_only for image stores	Marek Olšák	2018-11-28	1	-12/+37
\| \| \| \|	We need to look at the Dst for image stores.
*	radeonsi: use structured buffer intrinsics for image views	Marek Olšák	2018-11-28	2	-10/+42
\| \| \| \|	to stop using the workaround in si_make_buffer_descriptor.
*	radeonsi: clean up primitive binning enablement	Marek Olšák	2018-11-28	1	-11/+16
\| \| \| \| \| \|	no change in behavior. Reviewed-by: Bas Nieuwenhuizen <[email protected]>
*	virgl: fix undefined shift to use unsigned.	Dave Airlie	2018-11-29	1	-1/+1
\| \| \| \| \| \|	Ported from virglrenderer. Signed-off-by: Dave Airlie <[email protected]>
*	r600: make suballocator 256-bytes align	Dave Airlie	2018-11-29	1	-1/+1
\| \| \| \| \|	Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=108311 Cc: <[email protected]>
*	gallivm: Use nextafterf(0.5, 0.0) as rounding constant	Matt Turner	2018-11-28	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	The common truncf(x + 0.5) fails for the floating-point value just less than 0.5 (nextafterf(0.5, 0.0)). nextafterf(0.5, 0.0) + 0.5, after rounding is 1.0, thus truncf does not produce the desired value. The solution is to add nextafterf(0.5, 0.0) instead of 0.5 before truncating. This works for all values. Reviewed-by: Roland Scheidegger <[email protected]>
*	winsys/amdgpu: explicitly declare whether buffer_map is permanent or not	Nicolai Hähnle	2018-11-28	17	-62/+140
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Introduce a new driver-private transfer flag RADEON_TRANSFER_TEMPORARY that specifies whether the caller will use buffer_unmap or not. The default behavior is set to permanent maps, because that's what drivers do for Gallium buffer maps. This should eliminate the need for hacks in libdrm. Assertions are added to catch when the buffer_unmap calls don't match the (temporary) buffer_map calls. I did my best to update r600 for consistency (r300 needs no changes because it never calls buffer_unmap), even though the radeon winsys ignores the new flag. As an added bonus, this should actually improve the performance of the normal fast path, because we no longer call into libdrm at all after the first map, and there's one less atomic in the winsys itself (there are now no atomics left in the UNSYNCHRONIZED fast path). Cc: Leo Liu <[email protected]> v2: - remove comment about visible VRAM (Marek) - don't rely on amdgpu_bo_cpu_map doing an atomic write Reviewed-by: Marek Olšák <[email protected]>
*	winsys/amdgpu: add amdgpu_winsys_bo::lock	Nicolai Hähnle	2018-11-28	3	-13/+20
\| \| \| \| \| \| \|	We'll use it in the upcoming mapping change. Sparse buffers have always had one. Reviewed-by: Marek Olšák <[email protected]>
*	virgl: Don't try handling server fences when they are not supported	Gert Wollny	2018-11-28	1	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	vtest doesn't implement the according API and would segfault: Program received signal SIGSEGV, Segmentation fault. #0 0x0000000000000000 in ?? () #1 in virgl_fence_server_sync at src/gallium/drivers/virgl/virgl_context.c:1049 #2 in st_server_wait_sync at src/mesa/state_tracker/st_cb_syncobj.c:155 so just don't do the call when the function pointers are not set. Fixes dEQP: dEQP-GLES3.functional.fence_sync.wait_sync_smalldraw dEQP-GLES3.functional.fence_sync.wait_sync_largedraw Fixes: d1a1c21e7621b5177febf191fcd3d3b8ef69dc96 virgl: native fence fd support Signed-off-by: Gert Wollny <[email protected]> Reviewed-by: Dave Airlie <[email protected]> Reviewed-by: Robert Foss <[email protected]>
*	virgl,vtest: Initialize return value	Gert Wollny	2018-11-28	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Avoids: Conditional jump or move depends on uninitialised value(s) at 0x9E2B39F: virgl_vtest_winsys_resource_cache_create (virgl_vtest_winsys.c:379) by 0x9E2725F: virgl_buffer_create (virgl_buffer.c:169) by 0x9E246D5: virgl_resource_create (virgl_resource.c:60) by 0xA0C1B9F: bufferobj_data (st_cb_bufferobjects.c:344) by 0xA0C1B9F: st_bufferobj_data (st_cb_bufferobjects.c:390) by 0x9F4ACE3: vbo_use_buffer_objects (vbo_exec_api.c:1136) by 0xA0C68C3: st_create_context_priv (st_context.c:416) by 0xA0C707A: st_create_context (st_context.c:598) by 0x9F81C6B: st_api_create_context (st_manager.c:918) by 0x9BBE591: dri_create_context (dri_context.c:161) by 0x9BB6931: driCreateContextAttribs (dri_util.c:473) by 0x4E97A44: drisw_create_context_attribs (drisw_glx.c:630) by 0x4E7C591: glXCreateContextAttribsARB (create_context.c:78) Uninitialised value was created by a stack allocation at 0x9E2B249: virgl_vtest_winsys_resource_cache_create (virgl_vtest_winsys.c:342) Signed-off-by: Gert Wollny <[email protected]> Reviewed-by: Dave Airlie <[email protected]> Reviewed-by: Robert Foss <[email protected]>
*	v3d: Add renderonly support.	Eric Anholt	2018-11-27	6	-5/+77
\| \| \| \| \| \|	I've been using this with the kmsro series to test v3d on VKMS without my old KMS hack in the v3d kernel driver. KMSRO still needs some cleanup, but v3d RO support seems reasonable.
*	gallium: Remove unused variable in u_tests.	Eric Anholt	2018-11-27	1	-1/+0
\| \| \| \| \|	Fixes: 0d17b685b1ff ("gallium/u_tests: add a compute shader test that clears an image") Reviewed-by: Marek Olšák <[email protected]>
*	freedreno: implements get_sample_position	Hyunjun Ko	2018-11-27	1	-0/+45
\| \| \| \| \| \| \| \| \| \|	Since 1285f71d3e landed, it needs to provide apps with proper sample position for MSAA. Currently no way to query this to hw, these are taken from blob driver. Fixes: dEQP-GLES31.functional.texture.multisample.samples_#.sample_position Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a3xx: also set FSSUPERTHREADENABLE	Rob Clark	2018-11-27	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	We set equiv bit in SP_FS_CTRL_REG0. Somehow the hw doesn't hang with this mismatched config, but does run slower. It is faster with either neither bit set, or both bits set, but both is the fastest of the three configurations. Worth a bit over 10% gain in glmark2. Spotted-by: Jonathan Marek <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	freedreno: use MSM_BO_SCANOUT with scanout buffers	Jonathan Marek	2018-11-27	1	-1/+3
\| \| \| \|	Signed-off-by: Jonathan Marek <[email protected]>
*	freedreno: use GENERIC instead of TEXCOORD for blit program	Jonathan Marek	2018-11-27	1	-1/+1
\| \| \| \| \| \| \|	blip_fp uses GENERIC as input, so blit_vp should match for linking Signed-off-by: Jonathan Marek <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	freedreno: a2xx texture update	Jonathan Marek	2018-11-27	9	-20/+212
\| \| \| \| \| \| \| \| \| \| \|	Adds all missing texture related logic. For everything to work it also needs changes to ir2/fd2_program, which are part of the ir2 update patch. Note: it needs rnndb update Signed-off-by: Jonathan Marek <[email protected]> [remove stray patch] Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a2xx: Compute depth base in gmem correctly	Jonathan Marek	2018-11-27	1	-5/+7
\| \| \| \| \| \| \| \|	Note: it needs rnndb update Signed-off-by: Marek Vasut <[email protected]> Signed-off-by: Jonathan Marek <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a2xx: set VIZ_QUERY_ID on a20x	Jonathan Marek	2018-11-27	1	-0/+5
\| \| \| \| \|	Signed-off-by: Jonathan Marek <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	freedreno: add missing a20x ids	Jonathan Marek	2018-11-27	1	-0/+2
\| \| \| \| \| \| \| \|	200: 256KiB GMEM A200 (imx53) 201: 128KiB GMEM A200 (imx51) Signed-off-by: Jonathan Marek <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a2xx: fix POINT_MINMAX_MAX overflow	Jonathan Marek	2018-11-27	1	-1/+1
\| \| \| \| \| \| \|	As it stands, it overflows to zero. Signed-off-by: Jonathan Marek <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	freedreno: a2xx: fd2_draw update	Jonathan Marek	2018-11-27	6	-20/+114
\| \| \| \| \|	Signed-off-by: Jonathan Marek <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	freedreno: update generated headers	Rob Clark	2018-11-27	7	-39/+287
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a6xx: set guardband clip	Rob Clark	2018-11-27	4	-7/+57
\| \| \| \| \| \| \| \| \|	On older gens, the CLIP_ADJ bitfields were actually 3.6 fixed point. Which might make more sense. Although this formula comes up with values pretty close to what blob does for various viewport sizes (for at least a5xx and a6xx), and seems to work. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a6xx: disable LRZ for z32	Rob Clark	2018-11-27	1	-1/+13
\| \| \| \| \| \| \| \| \|	f6131d4ec7a had the side effect of enabling LRZ w/ 32b depth buffers. But there are some bugs with this, which aren't fully understood yet, so for now just skip LRZ w/ z32.. Fixes: f6131d4ec7a freedreno/a6xx: Clear z32 and separate stencil with blitter Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a6xx: Clear gmem buffers at flush time	Kristian H. Kristensen	2018-11-27	4	-178/+180
\| \| \| \| \| \| \| \| \|	We generate an IB to clear the gmem at flush time and jump to it before rendering each tile. This lets us get rid of the command stream patching for gmem offsets. Signed-off-by: Kristian H. Kristensen <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a6xx: Move resolve blits to an IB	Kristian H. Kristensen	2018-11-27	3	-8/+29
\| \| \| \| \|	Signed-off-by: Kristian H. Kristensen <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a6xx: Move restore blits to IB	Kristian H. Kristensen	2018-11-27	3	-19/+49
\| \| \| \| \|	Signed-off-by: Kristian H. Kristensen <[email protected]> Signed-off-by: Rob Clark <[email protected]>
*	freedreno: move ir3 to common location	Rob Clark	2018-11-27	40	-13732/+37
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Move (most of) the ir3 compiler to src/freedreno/ir3 so that it can be re-used by some future vulkan driver. The parts that are gallium specific have been refactored out and remain in the gallium driver. Getting the move done now so that it can happen before further refactoring to support a6xx specific instructions. NOTE also removes ir3_cmdline compiler tool from autotools build since that was easier than fixing it and I normally use meson build. Waiting patiently for the day that we can remove everything from the autotools build. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: remove u_inlines usage	Rob Clark	2018-11-27	1	-10/+10
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: split up ir3_shader	Rob Clark	2018-11-27	14	-667/+766
\| \| \| \| \| \| \|	Split the parts that are gallium specific into ir3_gallium so the rest can move to a common location outside of gallium. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: remove pipe_stream_output_info dependency	Rob Clark	2018-11-27	8	-17/+68
\| \| \| \| \| \| \| \|	A bit annoying to have to copy into our own struct. But this is something the compiler really needs to know, at least on earlier generations where streamout is implemented in shader. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: some header file cleanup	Rob Clark	2018-11-27	11	-26/+24
\| \| \| \| \| \|	Clean up some of the low-hanging-fruit usages of freedreno_util.h Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: use env_var_as_unsigned()	Rob Clark	2018-11-27	2	-14/+2
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>