mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	nir/opt_algebraic: Remove the encoding line	Jason Ekstrand	2016-04-14	1	-1/+0
\| \| \| \|	This is an unneeded diff between the vulkan and master branches
*	spirv: Move to compiler/	Jason Ekstrand	2016-04-14	11	-8/+8
\| \| \| \| \| \|	While it does rely on NIR, it's not really part of the NIR core. At the moment, it still builds as part of libnir but that can be changed later if desired.
*	nir: Remove some pointless delta between vulkan and master	Jason Ekstrand	2016-04-14	2	-3/+0
\|
*	Merge remote-tracking branch 'public/master' into vulkan	Jason Ekstrand	2016-04-13	281	-2798/+4973
\|\
\| *	clover: Fix build against LLVM SVN >= r266163	Michel Dänzer	2016-04-14	1	-0/+14
\| \| \| \| \| \| \| \| \| \| \| \|	createInternalizePass now takes a callback instead of a StringSet. Reviewed-by: Francisco Jerez <[email protected]>
\| *	nir/dead_variables: Configurably work with any variable mode	Jason Ekstrand	2016-04-13	5	-13/+28
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The old version of the pass only worked on globals and locals and always left inputs, outputs, uniforms, etc. alone. Reviewed-by: Kenneth Graunke <[email protected]>
\| *	i965: Switch to NIR for ldexp lowering.	Kenneth Graunke	2016-04-13	2	-2/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The old GLSL IR based lowering doesn't quite work right in all cases, and fails several dEQP-GLES31 and Vulkan CTS tests. Jason's new approach in NIR passes all the tests. There's not likely to be a ton of advantage to lowering early in GLSL IR anyway, so...switch. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]> Reviewed-by: Matt Turner <[email protected]>
\| *	nir/algebraic: Add lowering for ldexp	Jason Ekstrand	2016-04-13	1	-0/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The algorithm used is different from both the naive suggestion from the GLSL spec and the one used in GLSL IR today. Unfortunately, the GLSL IR implementation that we have today doesn't handle denormals (for those that care) or the case where the float source is +-inf. Reviewed-by: Matt Turner <[email protected]>
\| *	i965: Implement the new imod and irem opcodes	Jason Ekstrand	2016-04-13	2	-0/+72
\| \| \| \| \| \| \| \|	Reviewed-by: Matt Turner <[email protected]>
\| *	nir: Add more modulus opcodes	Jason Ekstrand	2016-04-13	2	-1/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	These are all needed for SPIR-V Reviewed-by: Rob Clark <[email protected]> Reviewed-by: Matt Turner <[email protected]>
\| *	i965/vec4: Inline get_pull_constant_offset	Jason Ekstrand	2016-04-13	2	-25/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	It's not really doing enough anymore to justify a helper function. Reviewed-by: Eduardo Lima Mitev <[email protected]> Reveiewed-by: Kristian Høgsberg <[email protected]>
\| *	nir/lower_io: Allow for a full bitmask of modes	Jason Ekstrand	2016-04-13	2	-7/+7
\| \| \| \| \| \| \| \| \| \| \| \|	Acked-by: Eric Anholt <[email protected]> Reviewed-by: Rob Clark <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]>
\| *	nir/lower_indirect: nir_variable_mode is now a bitfield	Jason Ekstrand	2016-04-13	2	-7/+7
\| \| \| \| \| \| \| \| \| \| \| \|	Acked-by: Eric Anholt <[email protected]> Reviewed-by: Rob Clark <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]>
\| *	nir: Convert nir_variable_mode to a bitfield	Jason Ekstrand	2016-04-13	2	-17/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	There are several passes where we need to specify some set of variable modes that the pass needs top operate on. This lets us easily do that. Acked-by: Eric Anholt <[email protected]> Reviewed-by: Rob Clark <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]>
\| *	gallium/swr: Make flat shading tris work.	George Kyriazis	2016-04-13	2	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	- Incorporate flatshade flag into the shader generation - Use provoking vertex (vc) in shader when flat shading. Reviewed-by: Bruce Cherniak <[email protected]>
\| *	Revert "freedreno/a4xx: better occlusion/sample counting"	Rob Clark	2016-04-13	1	-6/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This reverts commit 62fa868728c729152af0d7cecd1d3e47e831cb7d. dEQP-GLES3.functional.occlusion_query.* was unhappy about that change. Still not really sure what the other slots in the sample results buffer are. Signed-off-by: Rob Clark <[email protected]>
\| *	freedreno/a4xx: rasterizer_discard support	Rob Clark	2016-04-13	1	-0/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This one is slightly annoying, since trying to write RBRC from draw would clobber values set in the tiling/gmem code. We could do command- stream patching for RBRC, as is done on a3xx. Although since it seems to be a rarely used feature, it is easier just to do RMW to set/clear the bit. Fixes dEQP-GLES3.functional.rasterizer_discard.basic.write_depth_triangles and related tests. a3xx still needs the same feature, although there it probably makes more sense to take advantage of the existing cmdstream patching which is required for RBRC for other reasons. Signed-off-by: Rob Clark <[email protected]>
\| *	freedreno/ir3: fix array textures on a4xx	Rob Clark	2016-04-13	1	-3/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Seems like a4xx needs offset added to array index for all arrays, whereas a3xx only for cubemap arrays. Fixes a whole swath of dEQP fails (roughly sampler2darray). Signed-off-by: Rob Clark <[email protected]>
\| *	freedreno: fix stream-out offset handling for lines/tris	Rob Clark	2016-04-13	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We need to increment offset by # of vertices, not by # of prims. Fixes a bunch of dEQP fails involving prims other than points. For example, dEQP-GLES3.functional.transform_feedback.position.lines_separate Signed-off-by: Rob Clark <[email protected]>
\| *	freedreno: fix handling for stream-out offsets	Rob Clark	2016-04-13	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If changed && append, we shouldn't be resetting the internal offset back to zero. This fixes issues w/ sequences like: glBeginTransformFeedback() glDraw() glPauseTransformFeedback() glDraw() glResumeTransformFeedback() glDraw() glEndTransformFeedback() Fixes dEQP-GLES3.functional.transform_feedback.array.separate.points.lowp_vec3 and related tests. Signed-off-by: Rob Clark <[email protected]>
\| *	freedreno: fix prims-emitted query	Rob Clark	2016-04-13	3	-2/+12
\| \| \| \| \| \| \| \| \| \| \| \|	This should only count when TF is not paused. Signed-off-by: Rob Clark <[email protected]>
\| *	freedreno: fix max-line-width	Rob Clark	2016-04-13	1	-0/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	dEQP noticed that we were advertising completely bogus values. The actual maximum is 127.0f. But we have to use an artifically low maximum to work around a bug in the dEQP test, which gets confused when the max line width is too large and lines start going off-screen. Signed-off-by: Rob Clark <[email protected]>
\| *	freedreno: add flag to enable dEQP hacks	Rob Clark	2016-04-13	2	-0/+2
\| \| \| \| \| \| \| \|	Signed-off-by: Rob Clark <[email protected]>
\| *	freedreno/ir3: hack to avoid getting stuck in a loop	Rob Clark	2016-04-13	1	-1/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	There are still some edge cases which result in a neighbor-loop. Which needs to be fixed, but this hack at least makes deqp tests finish. Signed-off-by: Rob Clark <[email protected]>
\| *	freedreno/ir3: use (ss) instead of (sy) for ldlv	Rob Clark	2016-04-13	1	-1/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Fixes a bunch of flat-varying fail on a4xx (where we need to use ldlv to read the un-interpolated varying). Signed-off-by: Rob Clark <[email protected]>
\| *	freedreno/ir3: cleanup double cmps.s from frontend	Rob Clark	2016-04-13	1	-0/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Since we cannot mov into a predicate register, the frontend uses a 'cmps.s p0.x, cond, 0' as a stand-in for mov to p0.x. It does this since it has no way to know that the source cond instruction (ie. for a kill, br, etc) will only be used to write the predicate reg. Detect this, and re-write the instruction writing p0.x to skip the original cmps.[sfu]. (It is done like this, rather than re-writing the dest of the first cmps.[sfu] in case the first cmps.[sfu] actually has other users.) Signed-off-by: Rob Clark <[email protected]>
\| *	glsl: Rename "vertex_input_slots" -> "is_vertex_input"	Matt Turner	2016-04-13	6	-14/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	vertex_input_slots would be an appropriate name for an integer, but not a bool. Also remove a cond ? true : false from a count_attribute_slots() call site, noticed during the rename. Reviewed-by: Timothy Arceri <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]>
\| *	gallivm: Workaround LLVM PR 27332.	Jose Fonseca	2016-04-13	1	-3/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The credit for finding and isolating this bug goes to Vinson and Roland. The buggy LLVM versions were found by doing opt -instcombine llvm-pr27332.ll > /dev/null where llvm-pr27332.ll is the IR from https://llvm.org/bugs/show_bug.cgi?id=27332#c3 Reviewed-by: Roland Scheidegger <[email protected]>
\| *	gallium/radeon: move a comment to the correct place	Marek Olšák	2016-04-13	1	-3/+3
\| \| \| \| \| \| \| \|	trivial
\| *	radeonsi: gate PIPE_CAP_SHADER_BUFFER_OFFSET_ALIGNMENT by LLVM version	Nicolai Hähnle	2016-04-13	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Otherwise we incorrectly claim ARB_ssbo support even with older LLVM versions. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=94917 Reviewed-by: Michel Dänzer <[email protected]> Reviewed-by: Bas Nieuwenhuizen <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
\| *	gallium/radeon: add an env variable to force a level of aniso filtering	Marek Olšák	2016-04-13	5	-9/+31
\| \| \| \| \| \| \| \|	Reviewed-by: Nicolai Hähnle <[email protected]>
\| *	llvmpipe: Test rounding of x.5.	Jose Fonseca	2016-04-13	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Leverage nearbyintif function, which should be available on all C99 implementations. Trivial.
\| *	gallivm: use llvm.nearbyint instead of llvm.round.	Roland Scheidegger	2016-04-13	1	-98/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We used to use sse roundps intrinsic directly, but switched to use the llvm intrinsics for rounding with e4f01da15d8c6ce3e8c77ff3ff3d2ce2574a3f7b. However, llvm semantics follows standard math lib round function which is specced to do roundNearestAwayFromZero but we really want roundNearestEven (moreoever, using round generates atrocious code since the cpu can't do it directly and it results in scalar calls to libm __roundf). So, use llvm.nearbyint instead, which does exactly the right thing, and even has the advantage of being available with llvm 3.3 too. (I've verified it actually generates a roundps instruction with llvm 3.3.) This fixes https://bugs.freedesktop.org/show_bug.cgi?id=94909 Reviewed-by: Jose Fonseca <[email protected]>
\| *	nv50/ra: `isinf()` is in namespace `std` since C++11.	Pierre Moreau	2016-04-13	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This fixes a compile error while building Nouveau with C++11 enabled (and glibc >= 2.23). This happens if SWR is enabled, as it forces C++11. Signed-off-by: Pierre Moreau <[email protected]> Signed-off-by: Jose Fonseca <[email protected]> https://bugs.freedesktop.org/show_bug.cgi?id=94907
\| *	scons: Allow building with Address Sanitizer.	Jose Fonseca	2016-04-13	2	-6/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	libasan is never linked to shared objects (which doesn't go well with -z,defs). It must either be linked to the main executable, or (more practically for OpenGL drivers) be pre-loaded via LD_PRELOAD. Otherwise works. I didn't find anything with llvmpipe. I suspect the fact that the JIT compiled code isn't instrumented means there are lots of errors it can't catch. But for non-JIT drivers, the Address/Leak Sanitizers seem like a faster alternative to Valgrind. Usage (Ubuntu 15.10): scons asan=1 libgl-xlib export LD_LIBRARY_PATH=$PWD/build/linux-x86_64-debug/gallium/targets/libgl-xlib LD_PRELOAD=libasan.so.2 any-opengl-application Acked-by: Roland Scheidegger <[email protected]>
\| *	mesa: Change an error code in glSamplerParameterI[iu]v().	Kenneth Graunke	2016-04-12	1	-4/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is supposed to be INVALID_OPERATION in ES. We already did this for the fv/iv variants, but not Iiv/Iuv, which are new in ES 3.2 (or extensions). Fixes: ES31-CTS.texture_border_clamp.samplerparameteri_non_gen_sampler_error Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Ilia Mirkin <[email protected]>
\| *	softpipe: Free tgsi.image elements on context destruction.	Jose Fonseca	2016-04-13	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Courtesy of address sanitizer. [airlied: free buffers as well] Reviewed-by: Roland Scheidegger <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
\| *	softpipe: Enable ARB_framebuffer_no_attachments	Edward O'Callaghan	2016-04-13	1	-1/+2
\| \| \| \| \| \| \| \| \| \|	Signed-off-by: Edward O'Callaghan <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
\| *	vc4: Work around hardware limits on the number of verts in a single draw.	Eric Anholt	2016-04-12	1	-18/+92
\| \| \| \| \| \| \| \| \| \|	Fixes rendering failures in glmark2's refract and bump:render-mode=high-poly demos, and partially in its terrain demo.
\| *	softpipe: avoid buffer overflow	Thomas Hindoe Paaboel Andersen	2016-04-13	2	-8/+8
\| \| \| \| \| \| \| \|	Signed-off-by: Dave Airlie <[email protected]>
\| *	tgsi: fix buffer overflow	Thomas Hindoe Paaboel Andersen	2016-04-13	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Increase r to four channels as rgba is written to it Reviewed-by: Edward O'Callaghan <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
\| *	swr: handle pci cap requests	Tim Rowley	2016-04-12	1	-0/+4
\| \| \| \| \| \| \| \|	Reviewed-by: George Kyriazis <[email protected]>
\| *	swr: support samplers in vertex shaders	Tim Rowley	2016-04-12	6	-105/+228
\| \| \| \| \| \| \| \|	Reviewed-by: George Kyriazis <[email protected]>
\| *	radeonsi: enable GLSL 4.20 and therefore OpenGL 4.2	Nicolai Hähnle	2016-04-12	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is the last necessary bit for OpenGL 4.2 support. All driver-specific functionality has already been implemented as part of extensions. Reviewed-by: Edward O'Callaghan <[email protected]>
\| *	va: check null context in vlVaDestroyContext	Iurie Salomov	2016-04-13	1	-0/+4
\| \| \| \| \| \| \| \| \| \|	Signed-off-by: Iurie Salomov <[email protected]> Reviewed-by: Julien Isorce <[email protected]>
\| *	nir/clone: Copy bit size when cloning registers	Jason Ekstrand	2016-04-12	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	Reported-by: Mark Janes <[email protected]> Reviewed-by: Eduardo Lima Mitev <[email protected]>
\| *	radeonsi: fix a critical SI hang since PIPELINESTAT_START/STOP was added	Marek Olšák	2016-04-13	2	-9/+2
\| \| \| \| \| \| \| \| \| \|	For some reason unknown to me, SI hangs if the event is written after CONTEXT_CONTROL.
\| *	glsl: Don't copy propagate or tree graft precise values.	Kenneth Graunke	2016-04-12	3	-1/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is kind of a hack. We currently track precise requirements by decorating ir_variables. Propagating or grafting the RHS of an assignment to a precise value into some other expression tree can lose those decorations. In the long run, it might be better to replace these ir_variable decorations with an "exact" decoration on ir_expression nodes, similar to what NIR does. In the short run, this is probably good enough. It preserves enough information for glsl_to_nir to generate "exact" decorations, and NIR will then handle optimizing these expressions reasonably. Fixes ES31-CTS.gpu_shader5.precise_qualifier. v2: Drop invariant handling, as it shouldn't be necessary (caught by Jason Ekstrand). Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
\| *	i965/tiled_memcpy: Fix rgba8_copy_16_aligned_dst() typo	Kristian Høgsberg Kristensen	2016-04-12	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Copy and paste error in commit eafeb8db66dae7619ff3cb039706b990d718cba7: i965/tiled_memcpy: Unroll bytes==64 case. Signed-off-by: Kristian Høgsberg Kristensen <[email protected]> Reviewed-by: Matt Turner <[email protected]>
\| *	glsl/linker: Recurse on struct fields when adding shader variables	Kristian Høgsberg Kristensen	2016-04-12	1	-6/+45
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	ARB_program_interface_query requires that we add struct fields recursively down to basic types. Fixes 52 struct test cases in dEQP-GLES31.functional.program_interface_query.* Signed-off-by: Kristian Høgsberg Kristensen <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>