mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	radeonsi: rename si_eliminate_const_vs_outputs -> si_optimize_vs_outputs	Marek Olšák	2017-05-08	1	-2/+2
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: split per-patch from per-vertex indices	Nicolai Hähnle	2017-05-08	3	-21/+42
\| \| \| \| \| \| \|	Make it a bit clearer that the index spaces are logically seperate by having them defined in different functions. Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: clarify documentation of existing SI workaround	Nicolai Hähnle	2017-05-08	1	-1/+3
\| \| \| \| \| \| \|	Limiting LS-HS to a single wave is required on all SI chips due to an issue with a power management feature. Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: fix gl_PrimitiveID in tessellation with instanced draws on SI	Nicolai Hähnle	2017-05-08	1	-0/+14
\| \| \| \| \|	Cc: [email protected] Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: load patch_id for TES-as-ES when exporting for PS	Nicolai Hähnle	2017-05-08	1	-2/+2
\| \| \| \| \| \| \|	For some reason, this change is only necessary on SI. Cc: [email protected] Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: fix primitive ID in fragment shader when using tessellation	Nicolai Hähnle	2017-05-08	1	-10/+17
\| \| \| \| \| \| \| \| \| \| \|	In a VS->TCS->TES->PS pipeline, the primitive ID is read from TES exports, so it is as if TES were using the primitive ID. Specifically, this fixes a bug where the primitive ID is not reset at the start of a new instance. Cc: [email protected] Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: mark fast-cleared textures as compressed when dirtying	Nicolai Hähnle	2017-05-08	1	-2/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	There are a bunch of piglit fast clear tests that regressed on SI, for example ./bin/ext_framebuffer_multisample-fast-clear single-sample. The problem is that a texture is bound as a framebuffer, cleared, and then rendered from in a loop that loops through different clear colors. The texture is never rebound during all this, so the change to tex->dirty_level_mask during fast clear was not taken into account when checking for compressed textures. I have considered simply reverting the problematic commit. However, I think this solution is better. It does require looping through all bound textures after a fast clear, but the alternative would require visiting more textures needless on every draw. Draws are much more common than clears. Note that the rendering feedback loop rules do not apply here, because the framebuffer binding is changed between the glClear and the draw that samples from the texture that was cleared. Fixes: bdd644976952 ("radeonsi: don't mark non-dirty textures with CMASK as compressed") Cc: 17.1 <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	nv50/ir: Replace NV50_PROGRAM_IR_* by PIPE_SHADER_IR_*	Pierre Moreau	2017-05-07	5	-10/+7
\| \| \| \| \|	Signed-off-by: Pierre Moreau <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	nv50/ir: Remove unused translation methods	Pierre Moreau	2017-05-07	2	-10/+3
\| \| \| \| \| \| \|	This code was merged commented out, and has stayed that way ever since. Signed-off-by: Pierre Moreau <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	nv50/ir: Free target if we failed to create a program	Pierre Moreau	2017-05-07	1	-1/+3
\| \| \| \| \|	Signed-off-by: Pierre Moreau <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	nv50/ir: Fail if encountering unknown shader type	Pierre Moreau	2017-05-07	1	-2/+2
\| \| \| \| \|	Signed-off-by: Pierre Moreau <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
*	Revert "radeonsi: constify a bunch of the perfcounter structs."	Marek Olšák	2017-05-06	3	-46/+52
\| \| \| \| \| \| \|	This reverts commit 7088b655e8828bb960f528dd33132de27c505b8f. It breaks performance counters. If you use them with this commit, they hang the machine hard. Sysrq and ssh don't work.
*	Revert "radeonsi: fix build with GCC 4.8"	Marek Olšák	2017-05-06	1	-1/+1
\| \| \| \| \| \|	This reverts commit 485ece83aceb3a35792efbc6fe2bca57ba46c04a. It's needed to revert 7088b655e8828bb960f528dd33132de27c505b8f.
*	freedreno/a3xx: fix hang w/ large render targets and small gmem	Rob Clark	2017-05-06	3	-0/+7
\| \| \| \| \| \| \| \| \|	Possibly other gen's have a similar limit. Fixes glmark2 -b shadow with larger resolutions on devices with small gmem (for example, fullscreen 1080p on 8x16/db410c). Cc: [email protected] Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: add macro to declare variable length arrays	Rob Clark	2017-05-06	1	-18/+21
\| \| \| \| \| \|	We have enough of these, that we should stop open coding this. Signed-off-by: Rob Clark <[email protected]>
*	radeonsi: apply the tess+GS hang workaround to Polaris12 as well	Marek Olšák	2017-05-05	1	-1/+2
\| \| \| \| \| \|	Cc: 17.1 <[email protected]> Reviewed-by: Alex Deucher <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: fix build with GCC 4.8	Samuel Pitoiset	2017-05-05	1	-1/+1
\| \| \| \| \| \| \|	Fixes: 7088b655e8 ("radeonsi: constify a bunch of the perfcounter structs.") Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100937 Signed-off-by: Samuel Pitoiset <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	winsys/amdgpu: fix Polaris12 (RX 550) breakage	Marek Olšák	2017-05-05	1	-0/+1
\| \| \| \| \| \| \|	reported by Greg White. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100892 Cc: 17.1 <[email protected]>
*	radeonsi/gfx9: allow the scratch buffer in HS and GS	Marek Olšák	2017-05-05	1	-10/+0
\| \| \| \| \| \|	It works now. Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: prevent race conditions when doing scratch patching	Marek Olšák	2017-05-05	1	-2/+30
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: separate scratch state patching code into its own function	Marek Olšák	2017-05-05	1	-46/+55
\| \| \| \| \| \| \|	Picked from a different branch. When we stop using the scratch patching, this function will not be called. Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi/gfx9: also apply scratch relocations to the 1st shader of merged ↵	Marek Olšák	2017-05-05	1	-0/+3
\| \| \| \| \| \|	shaders Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi/gfx9: set correct LLVM calling conventions for merged shaders	Marek Olšák	2017-05-05	2	-2/+18
\| \| \| \| \| \|	for scratch support Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: remove unused parameters from si_shader_apply_scratch_relocs	Marek Olšák	2017-05-05	4	-10/+6
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: inline si_llvm_shader_type into si_llvm_create_func	Marek Olšák	2017-05-05	3	-33/+22
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: don't use util_memcpy_cpu_to_le32 for shader uploads	Marek Olšák	2017-05-05	1	-7/+8
\| \| \| \| \| \|	at least I think this is correct. Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: make si_compile_llvm static	Marek Olšák	2017-05-05	2	-16/+8
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: fold surrounding code into si_llvm_finalize_module	Marek Olšák	2017-05-05	3	-21/+12
\| \| \| \| \| \|	and rename to si_llvm_optimize_module. Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: don't call eliminate_const_vs_outputs in shaders without VS exports	Marek Olšák	2017-05-05	1	-4/+4
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: drop support for LLVM 3.8	Marek Olšák	2017-05-05	5	-83/+26
\| \| \| \| \| \| \| \| \| \| \| \|	LLVM 3.8: - had broken indirect resource indexing - didn't have scratch coalescing - was the last user of problematic v16i8 - only supported OpenGL 4.1 This leaves us with LLVM 3.9 and LLVM 4.0 support for Mesa 17.2. Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: stop using v16i8	Marek Olšák	2017-05-05	3	-12/+8
\| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi/gfx9: make some PA & DB registers match the closed Vulkan driver	Marek Olšák	2017-05-05	1	-3/+18
\| \| \| \| \|	Cc: 17.1 <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	freedreno/a5xx: compute shader support	Rob Clark	2017-05-04	9	-5/+264
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno: core compute state support	Rob Clark	2017-05-04	7	-6/+216
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: compute shader support	Rob Clark	2017-05-04	4	-19/+154
\| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
*	freedreno/a5xx: SSBO support	Rob Clark	2017-05-04	3	-7/+114
\| \| \| \| \| \| \| \| \| \|	To simplify things for now, since all the gfx shader stages share a single SSBO state block, only advertise SSBO support for fragment shader (and compute when we have that). We could possibly use a fixed- partitioning of the SSBO index space to support SSBOs on other stages without having to resort to shader variants. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: SSBO/atomic support	Rob Clark	2017-05-04	6	-13/+309
\| \| \| \| \| \| \| \|	TODO cwabbott pointed out a write-after-read hazzard, which effects both this and arrays. A write needs to depend on all reads since the last write, not just the last read. Signed-off-by: Rob Clark <[email protected]>
*	freedreno: core SSBO support	Rob Clark	2017-05-04	5	-0/+73
\| \| \| \| \| \|	The generation-independent support for binding shader buffer objects. Signed-off-by: Rob Clark <[email protected]>
*	freedreno/ir3: resync instr-a3xx.h/disasm-a3xx.c	Rob Clark	2017-05-04	7	-20/+161
\| \| \| \| \| \| \|	Sync to the same files from freedreno.git to correct decoding of ldgb/ stgb instructions. Signed-off-by: Rob Clark <[email protected]>
*	targets/libgl-xlib: remove unneeded GLX_SHARED_GLAPI define	Emil Velikov	2017-05-04	1	-2/+0
\| \| \| \| \| \| \|	There's no users in-tree that use it. Signed-off-by: Emil Velikov <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	gallium/dri: remove unneeded HAVE_SHARED_GLAPI guard	Emil Velikov	2017-05-04	1	-5/+1
\| \| \| \| \| \| \| \| \| \| \|	Always true, since the dri modules required shared glapi. With earlier commit (da410e6afad "configure: explicitly require shared glapi for enable-dri") we even made that explicit during the configure stage. Signed-off-by: Emil Velikov <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	gallium/dri: always link against shared glapi	Emil Velikov	2017-05-04	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In the early days of Xorg and Mesa we had multiple providers of the GLAPI. All of those were the ones responsible for dlopening the DRI module. Hence it was perfectly fine, and actually expected, for the DRI modules to have unresolved symbols. Since then we've moved the API to a separate shared library and no other libraries provide the symbols. Here comes the picky part: It's possible that one uses old Xorg (where libglx.so provides the GLAPI) and new Mesa (with DRI modules linking against libglapi.so). That should still work, since the the libglx.so symbols will take precedence over the libglapi.so ones. I've verified this while running 1.14 series Xorg alongside this (and next) patch. It may seem a bit fragile, but that's of reasonably OK since all of the affected Xorg versions have been EOL for years. The final one being the 1.14 series, which saw its final bug fix release 1.14.7 in June 2014. To ensure that the binaries do not have unresolved symbols add -no-undefined and $(LD_NO_UNDEFINED), just like we do everywhere else throughout mesa. Cc: [email protected] Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=98428 Signed-off-by: Emil Velikov <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi: constify a bunch of the perfcounter structs.	Dave Airlie	2017-05-04	3	-52/+46
\| \| \| \| \| \| \| \|	This moves the structs from the data segment to the rodata segment, which seems like the more correct place for them. Reviewed-by: Nicolai Hähnle <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	radeonsi/gfx9: fix gl_ViewportIndex	Marek Olšák	2017-05-03	2	-8/+40
\| \| \| \| \| \| \|	v2: remove unnecessary LLVMBuildAnd calls Cc: 17.1 <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	radeonsi/gfx9: set VGT_REUSE_OFF = 0	Marek Olšák	2017-05-03	1	-3/+7
\| \| \| \| \| \|	same as Vulkan Reviewed-by: Nicolai Hähnle <[email protected]>
*	etnaviv: add L8A8_UNORM texture format	Christian Gmeiner	2017-05-03	1	-0/+2
\| \| \| \| \| \| \| \|	No piglit regressions. CC: <[email protected]> Signed-off-by: Christian Gmeiner <[email protected]> Reviewed-by: Philipp Zabel <[email protected]>
*	ac: rename ac_eliminate_const_vs_outputs -> ac_optimize_vs_outputs	Marek Olšák	2017-05-03	1	-5/+5
\| \| \| \| \|	Reviewed-by: Dave Airlie <[email protected]> Reviewed-by: Nicolai Hähnle <[email protected]>
*	vc4: Use runtime CPU detection for whether NEON is available.	Eric Anholt	2017-05-02	2	-14/+16
\| \| \| \| \| \| \| \|	This will allow Raspbian's ARMv6 builds to take advantage of the new NEON code, and could prevent problems if vc4 ends up getting used on a v7 CPU without NEON. v2: Drop dead NEON_SUFFIX (noted by Erik Faye-Lund)
*	vc4: Use a wrapper file to set VC4_BUILD_NEON instead of CFLAGS.	Eric Anholt	2017-05-02	4	-8/+31
\| \| \| \| \| \| \| \| \|	Android.mk was setting the flag across the entire driver, so we didn't have non-NEON versions getting built. This was going to be a problem with the next commit, when I start auto-detecting NEON support and use the non-NEON version when appropriate. Reviewed-by: Rob Herring <[email protected]>
*	gallium: Enable ARM NEON CPU detection.	Eric Anholt	2017-05-02	3	-0/+46
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I wrote this code with reference to pixman, though I've only decided to cover Linux (what I'm testing) and Android (seems obvious enough). Linux has getauxval() as a cleaner interface to the /proc entry, but it's more glibc-specific and I didn't want to add detection for that. This will be used to enable NEON at runtime on ARMv6 builds of vc4. v2: Actually initialize the temp vars in the Android path (noticed by daniels) v3: Actually pull in the cpufeatures library (change by robher). Use O_CLOEXEC. Break out of the loop when we find our feature. v4: Drop VFP code, which was confused about what it was detecting and not actually used yet. Reviewed-by: Grazvydas Ignotas <[email protected]>