mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	i965: Consolidate is_3src() functions.	Matt Turner	2015-11-12	3	-8/+7
\| \| \| \|	Otherwise I'll have to add another later in this series.
*	st/wgl: add a comment about recursive locking in stw_make_current()	Brian Paul	2015-11-12	1	-0/+4
\| \| \| \| \|	Reviewed-by: Sinclair Yeh <[email protected]> Reviewed-by: Charmaine Lee <[email protected]>
*	st/wgl: add a lock assertion in stw_framebuffer_from_hwnd_locked()	Brian Paul	2015-11-12	1	-0/+1
\| \| \| \| \|	Reviewed-by: Sinclair Yeh <[email protected]> Reviewed-by: Charmaine Lee <[email protected]>
*	st/wgl: add some mutex checking code	José Fonseca	2015-11-12	1	-0/+26
\| \| \| \| \| \| \| \| \| \| \|	This would have caught the locking bug that was fixed in the earlier "st/wgl: fix locking issue in stw_st_framebuffer_present_locked()" patch. v2: minor coding style changes by Brian. Reviewed-by: Sinclair Yeh <[email protected]> Reviewed-by: Charmaine Lee <[email protected]>
*	st/wgl: rename stw_framebuffer_release() to stw_framebuffer_unlock()	Brian Paul	2015-11-12	5	-19/+19
\| \| \| \| \| \| \|	To match the new stw_framebuffer_lock() function. Reviewed-by: Sinclair Yeh <[email protected]> Reviewed-by: Charmaine Lee <[email protected]>
*	st/wgl: reimplement stw_framebuffer::mutex with CRITICAL_SECTION	Brian Paul	2015-11-12	4	-29/+32
\| \| \| \| \| \| \| \|	v2: update comments on the stw_framebuffer::mutex field regarding locking order. Reviewed-by: Sinclair Yeh <[email protected]> Reviewed-by: Charmaine Lee <[email protected]>
*	st/wgl: include u_debug.h	Brian Paul	2015-11-12	3	-0/+6
\| \| \| \| \| \| \| \|	To get declaration for debug_printf() directly instead of getting it indirectly through os_thread.h Reviewed-by: Sinclair Yeh <[email protected]> Reviewed-by: Charmaine Lee <[email protected]>
*	st/wgl: reimplement stw_device::fb_mutex with CRITICAL_SECTION	Brian Paul	2015-11-12	3	-15/+29
\| \| \| \| \|	Reviewed-by: Sinclair Yeh <[email protected]> Reviewed-by: Charmaine Lee <[email protected]>
*	st/wgl: re-implement stw_device::ctx_mutex with CRITICAL_SECTION	Brian Paul	2015-11-12	3	-19/+34
\| \| \| \| \| \| \| \| \|	This is Windows-only code so we can use the native Win32 functions for critical sections. This will also allow us to (cleanly) add some mutex check/debug code in subsequent patches. Reviewed-by: Sinclair Yeh <[email protected]> Reviewed-by: Charmaine Lee <[email protected]>
*	gallium/hud: add cpu graph support for Windows	Brian Paul	2015-11-12	1	-0/+54
\| \| \| \| \| \| \| \| \| \|	We support "cpu" but not "cpu#" because there's no good way of querying per-cpu usage. Also, the cpu usage is for the process, not the whole system. Original code cobbled together by Brian and then fixed/polished by Jose. Signed-off-by: Brian Paul <[email protected]>
*	glsl: set matrix_stride for non matrices with atomic counter buffers	Tapani Pälli	2015-11-12	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Patch sets matrix_stride as 0 for non matrix uniforms that are in a atomic counter buffer. Matrix stride calculation for actual matrix uniforms is done during link_assign_uniform_locations. From ARB_program_interface_query specification: GL_MATRIX_STRIDE: "For active variables not declared as a matrix or array of matrices, zero is written to <params>. For active variables not backed by a buffer object, -1 is written to <params>, regardless of the variable type." Signed-off-by: Tapani Pälli <[email protected]> Reviewed-by: Marta Lofstedt <[email protected]>
*	mesa: validate precision of varyings during ValidateProgramPipeline	Tapani Pälli	2015-11-12	3	-0/+80
\| \| \| \| \| \| \| \| \| \| \|	Fixes following failing ES3.1 CTS tests: ES31-CTS.sepshaderobjs.InterfacePrecisionMatchingFloat ES31-CTS.sepshaderobjs.InterfacePrecisionMatchingInt ES31-CTS.sepshaderobjs.InterfacePrecisionMatchingUInt Signed-off-by: Tapani Pälli <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]>
*	glsl: do not lose precision information when packing varyings	Tapani Pälli	2015-11-12	1	-0/+1
\| \| \| \| \| \| \| \|	This information will be used by cross stage validation of varyings for pipeline objects. Signed-off-by: Tapani Pälli <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]>
*	glsl: Add precision information to ir_variable	Iago Toral Quiroga	2015-11-12	4	-24/+303
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We will need this later on when we implement proper support for precision qualifiers in the drivers and also to do link time checks for uniforms as indicated by the spec. This patch also adds compile-time checks for variables without precision information (currently, Mesa only checks that a default precision is set for floats in fragment shaders). As indicated by Ian, the addition of the precision information to ir_variable has been done using a bitfield and pahole to identify an available hole so that memory requirements for ir_variable stay the same. v2 (Ian): - Avoid if-ladders by defining arrays of supported sampler names and indexing into them with type->sampler_array + 2 * type->sampler_shadow - Make the code that selects the precision qualifier to use an utility function - Fix a typo v3 (Tapani): - rebased - squashed in "Precision qualifiers are not allowed on structs" - fixed select_gles_precision for sampler arrays - fixed precision_qualifier_allowed for arrays of structs v4 (Tapani): - add atomic_uint handling - do not allow precision qualifier on images (issues reported by Marta) v5 (Tapani): - support precision qualifier on image types v6 (Tapani): - set precision qualifier on interface block members Reviewed-by: Samuel Iglesias Gonsálvez <[email protected]>
*	glsl: Move the definition of precision_qualifier_allowed	Iago Toral Quiroga	2015-11-12	1	-36/+35
\| \| \| \| \| \| \|	We will need this to build later patches Reviewed-by: Tapani Pälli <[email protected]> Reviewed-by: Samuel Iglesias Gonsálvez <[email protected]>
*	glsl: Add user-defined default precision qualifiers to the symbol table	Iago Toral Quiroga	2015-11-12	1	-19/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Notice that the spec requires that a default precision has been set for every type used by a shader that can use a precision qualifier and does not have a predefined precision, however, at the moment, Mesa only checks this for floats in the fragment shader. This is probably because the GLSL ES 1.0 specs mentions this case specifically, but GLSL ES 3.0 clarifies that the same applies to other types: "The fragment language has no default precision qualifier for floating point types. Hence for float, floating point vector and matrix variable declarations, either the declaration must include a precision qualifier or the default float precision must have been previously declared. Similarly, there is no default precision qualifier for the following sampler types in either the vertex or fragment language: sampler3D; samplerCubeShadow; sampler2DShadow; sampler2DArray; sampler2DArrayShadow; isampler2D; isampler3D; isamplerCube; isampler2DArray; usampler2D; usampler3D; usamplerCube; usampler2DArray;" we will fix this in a later patch. Reviewed-by: Tapani Pälli <[email protected]> Reviewed-by: Samuel Iglesias Gonsálvez <[email protected]>
*	glsl: Add default precision qualifiers to the symbol table	Iago Toral Quiroga	2015-11-12	1	-0/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The GLSL ES spec specifies default precision qualifiers for certain types, so populate the symbol table with these. Notice that the desktop GLSL spec also indicates defaults for some types but this is not really useful since precision qualifiers are completely ignored in desktop GLSL. v2: simplify and add samplerExternalOES, specified by OES_EGL_image_external (Tapani) v3: add atomic_uint (reported missing by Marta) Reviewed-by: Tapani Pälli <[email protected]>
*	glsl: Add API to put default precision qualifiers in the symbol table	Iago Toral Quiroga	2015-11-12	2	-0/+26
\| \| \| \| \| \| \| \|	These have scoping rules that match the ones defined for other things such as variables, so we want them in the symbol table. Reviewed-by: Tapani Pälli <[email protected]> Reviewed-by: Samuel Iglesias Gonsálvez <[email protected]>
*	i965/fs/nir: fix the number of register written by FS_OPCODE_GET_BUFFER_SIZE	Samuel Iglesias Gonsálvez	2015-11-12	1	-2/+14
\| \| \| \| \| \| \| \| \| \| \|	FS_OPCODE_GET_BUFFER_SIZE is calculated with a resinfo's sampler message. This patch adjusts the number of registers written by the opcode following what the PRM spec says about the number of registers written by the SIMD8 and SIMD16's writeback messages for sampler messages. Signed-off-by: Samuel Iglesias Gonsálvez <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	i965/skl/gt4: Fix URB programming restriction.	Ben Widawsky	2015-11-11	1	-0/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The comment in the code details the restriction. Thanks to Ken for having a very helpful conversation with me, and spotting the blurb in the link I sent him :P. There are still stability problems for me on GT4, but this definitely helps with some of the failures. v2: Comment fixes Cc: [email protected] Reviewed-by: Jordan Justen <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	nv50,nvc0: add ARB_clear_texture support	Ilia Mirkin	2015-11-11	7	-8/+103
\| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]>
*	st/mesa: implement ARB_clear_texture	Ilia Mirkin	2015-11-11	2	-0/+30
\| \| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	gallium: add PIPE_CAP_CLEAR_TEXTURE and clear_texture prototype	Ilia Mirkin	2015-11-11	18	-0/+31
\| \| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	glsl: add helper to check for enhanced layouts support	Timothy Arceri	2015-11-12	1	-0/+5
\| \| \| \|	Reviewed-by: Emil Velikov <[email protected]>
*	mesa: add ARB_enhanced_layouts	Timothy Arceri	2015-11-12	5	-0/+6
\| \| \| \|	Reviewed-by: Emil Velikov <[email protected]>
*	r600: initialised PGM_RESOURCES_2 for ES/GS	Dave Airlie	2015-11-12	2	-0/+6
\| \| \| \| \| \| \| \| \| \| \|	This fixes the corruption on rendering that we are seeing in certain geometry shaders. Fixes: https://bugs.freedesktop.org/show_bug.cgi?id=91780 Reviewed-by: Alex Deucher <[email protected]> Tested / Reviewed-by: Glenn Kennard <[email protected]> Cc: "10.6" "11.0" <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	i965: Split nir_emit_intrinsic by stage with a general fallback.	Kenneth Graunke	2015-11-11	2	-277/+381
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Many intrinsics only apply to a particular stage (such as discard). In other cases, we may want to interpret them differently based on the stage (such as load_primitive_id or load_input). The current method isn't that pretty - we handle all intrinsics in one giant function. Sometimes we assert on stage, sometimes we forget. Different behaviors are handled via if-ladders based on stage. This commit introduces new nir_emit_<stage>_intrinsic() functions, and makes nir_emit_instr() call those. In turn, those fall back to the generic nir_emit_intrinsic() function for cases they don't want to handle specially. This makes it clear which intrinsics only exist in one stage, and makes it easy to handle inputs/outputs differently for various stages. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]> Reviewed-by: Jordan Justen <[email protected]>
*	mesa/copyimage: allow width/height to not be multiples of block	Ilia Mirkin	2015-11-11	1	-3/+34
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	For compressed textures, the image size is not necessarily a multiple of the block size (e.g. the last mip levels). Section 18.3.2 (Copying Between Images) of the OpenGL 4.5 Core Profile spec says: An INVALID_VALUE error is generated if the dimensions of either subregion exceeds the boundaries of the corresponding image object, or if the image format is compressed and the dimensions of the subregion fail to meet the alignment constraints of the format. and Section 8.7 (Compressed Texture Images) says: An INVALID_OPERATION error is generated if any of the following conditions occurs: * width is not a multiple of four, and width + xoffset is not equal to the value of TEXTURE_WIDTH. * height is not a multiple of four, and height + yoffset is not equal to the value of TEXTURE_HEIGHT. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=92860 Signed-off-by: Ilia Mirkin <[email protected]> Acked-by: Alex Deucher <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]> Reviewed-by: Ian Romanick <[email protected]> Cc: [email protected]
*	i965/brw_reg: Add a brw_VxH_indirect helper	Jason Ekstrand	2015-11-11	1	-0/+11
\| \| \| \|	Reviewed-by: Kenneth Graunke <[email protected]>
*	mesa: remove old comments in arrayobj.c	Brian Paul	2015-11-11	1	-5/+0
\|
*	st/wgl: clarify code in stw_framebuffer_from_hwnd_locked()	Brian Paul	2015-11-11	1	-2/+2
\| \| \| \| \| \| \| \| \|	Just a minor code change to make it obvious that NULL is returned when we don't find the given HWND. Reviewed-by: Sinclair Yeh <[email protected]> Reviewed-by: Charmaine Lee <[email protected]> Reviewed-by: José Fonseca <[email protected]>
*	st/wgl: improve some function comments	Brian Paul	2015-11-11	1	-6/+30
\| \| \| \| \| \| \| \| \|	In particular, explain when stw_framebuffer objects are locked/unlocked/etc. Reviewed-by: Sinclair Yeh <[email protected]> Reviewed-by: Charmaine Lee <[email protected]> Reviewed-by: José Fonseca <[email protected]>
*	st/wgl: whitespace/formatting fixes	Brian Paul	2015-11-11	3	-63/+48
\|
*	st/wgl: fix locking issue in stw_st_framebuffer_present_locked()	Brian Paul	2015-11-11	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \|	When stw_st_framebuffer_present_locked() is called, the stw_framebuffer's mutex will already be locked. Normally, the stw_framebuffer_present_locked() function calls stw_framebuffer_release() to unlock the mutex when it's done. But if for some reason the 'resource' pointer in stw_st_framebuffer_present_locked() is null, we'd return without unlocking the stw_framebuffer. This fixes that to avoid potential deadlocks. Reviewed-by: Charmaine Lee <[email protected]>
*	i965: Print force_writemask_all in dump_instructions().	Kenneth Graunke	2015-11-11	2	-0/+6
\| \| \| \| \|	Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	i965: Combine BRW_NEW_*_BINDING_TABLE dirty bits.	Kenneth Graunke	2015-11-11	5	-26/+14
\| \| \| \| \| \| \| \| \| \| \|	A while back, we moved to directly emitting the Gen7+ state when constructing the binding tables. These flags are only used on Gen4-6, which emit all the binding table pointers at once. We gain nothing by having separate flags, so combine them. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Topi Pohjolainen <[email protected]>
*	i965: Map GL_PATCHES to 3DPRIM_PATCHLIST_n.	Kenneth Graunke	2015-11-11	2	-1/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Inspired by a patch by Fabian Bieler. Fabian defined a _3DPRIM_PATCHLIST_0 macro (which isn't actually a valid topology type); I instead chose to make a macro that takes an argument. He also took the number of patch vertices from _mesa_prim (which was set to ctx->TessCtrlProgram.patch_vertices) - I chose to use it directly to avoid the need for the VBO patch. v2: Change macro to 0x20 + (n - 1) instead of 0x1F + n to better match the documentation (suggested by Ian). Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Ian Romanick <[email protected]>
*	docs: add news item and link release notes for 11.0.5	Emil Velikov	2015-11-11	2	-0/+7
\| \| \| \|	Signed-off-by: Emil Velikov <[email protected]>
*	docs: add sha256 checksums for 11.0.5	Emil Velikov	2015-11-11	1	-1/+2
\| \| \| \| \|	Signed-off-by: Emil Velikov <[email protected]> (cherry picked from commit 66c949d0a19b1e601243be22b6506528b866388b)
*	docs: add release notes for 11.0.5	Emil Velikov	2015-11-11	1	-0/+173
\| \| \| \| \|	Signed-off-by: Emil Velikov <[email protected]> (cherry picked from commit ee57c22141c42d9b511a7dfa5971c4428cd1c6e7)
*	r600g: Pass conservative depth parameters to hw	Glenn Kennard	2015-11-11	7	-1/+53
\| \| \| \| \| \| \| \|	Supported on R700 and up. Signed-off-by: Glenn Kennard <[email protected]> Reviewed-by: Marek Olšák <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	Revert "r600g: Pass conservative depth parameters to hw"	Dave Airlie	2015-11-11	6	-46/+0
\| \| \| \| \| \|	This reverts commit a1fc78911e9a6439db94d6ae91d5672c76e5fb1c. I pushed the wrong patch.
*	r600g: Implement ARB_texture_view	Glenn Kennard	2015-11-11	4	-9/+20
\| \| \| \| \| \|	Signed-off-by: Glenn Kennard <[email protected]> Reviewed-by: Marek Olšák <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
*	r600g: Pass conservative depth parameters to hw	Glenn Kennard	2015-11-11	6	-0/+46
\| \| \| \| \| \| \|	Supported on R700 and up. Signed-off-by: Glenn Kennard <[email protected]> Reviewed-by: Marek Olšák <[email protected]>
*	i965/nir/opt_peephole_ffma: Bypass fusion if any operand of fadd and fmul is ↵	Eduardo Lima Mitev	2015-11-10	1	-0/+31
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	a const When both fadd and fmul instructions have at least one operand that is a constant and it is only used once, the total number of instructions can be reduced from 3 (1 ffma + 2 load_const) to 2 (1 fmul + 1 fadd); because the constants will be progagated as immediate operands of fmul and fadd. This patch detects these situations and prevents fusing fmul+fadd into ffma. Shader-db results on i965 Haswell: total instructions in shared programs: 6235835 -> 6225895 (-0.16%) instructions in affected programs: 1124094 -> 1114154 (-0.88%) total loops in shared programs: 1979 -> 1979 (0.00%) helped: 7612 HURT: 843 GAINED: 4 LOST: 0 Reviewed-by: Jason Ekstrand <[email protected]>
*	util: Add list_is_singular() helper function	Eduardo Lima Mitev	2015-11-10	1	-0/+8
\| \| \| \| \| \|	Returns whether the list has exactly one element. Reviewed-by: Matt Turner <[email protected]>
*	nir/nir_opt_peephole_ffma: Move this lowering pass to the i965 driver	Eduardo Lima Mitev	2015-11-10	6	-9/+10
\| \| \| \| \| \| \| \| \|	Because the next patch will add an optimization that is specific to i965, we want to move this loweing pass to that driver altogether. This is safe because i965 is the only consumer. Reviewed-by: Jason Ekstrand <[email protected]>
*	glsl: Use array deref for access to vector components	Kristian Høgsberg Kristensen	2015-11-10	10	-68/+138
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We've assumed that we could lower per-component vector access from vec[i] = scalar to vec = ir_triop_vector_insert(vec, scalar, i) but with SSBOs (and compute shader SLM and tesselation outputs) this is no longer valid. If a vector is "externally visible", multiple threads can write independent components simultaneously. With lowering to ir_triop_vector_insert, each thread read the entire vector, changes one component, then writes out the entire vector. This is racy. Instead of generating a ir_binop_vector_extract when we see v[i], we generate ir_dereference_array. We then add a lowering pass to lower the ir_dereference_array to ir_binop_vector_extract for rvalues and for to vector_insert for lvalues in a separate lowering pass. The resulting IR is the same as before, but we now have a window between ast->ir conversion and the lowering pass where v[i] appears in the IR as an array deref. This lets us run lowering passes that lower the vector access to I/O (eg for SSBO load/store) before we lower the per-component access to full vector writes. Reviewed-by: Jordan Justen <[email protected]> Signed-off-by: Kristian Høgsberg Kristensen <[email protected]>
*	glsl: Lower UBO and SSBO access in glsl linker	Kristian Høgsberg Kristensen	2015-11-10	6	-3/+13
\| \| \| \| \| \| \| \| \| \| \|	All GLSL IR consumers run this lowering pass so we can move it to the linker. This moves the pass up quite a bit, but that's the point: it needs to run before we throw away information about per-component vector access. Reviewed-by: Iago Toral Quiroga <[email protected]> Reviewed-by: Jordan Justen <[email protected]> Signed-off-by: Kristian Høgsberg Kristensen <[email protected]>
*	glsl: Drop exec_list argument to lower_ubo_reference	Kristian Høgsberg Kristensen	2015-11-10	4	-5/+5
\| \| \| \| \| \| \| \| \| \| \|	We always pass in shader->ir and we already pass in the shader, so just drop the exec_list. Most passes either take just a exec_list or a shader, so this seems more consistent. Reviewed-by: Timothy Arceri <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]> Reviewed-by: Jordan Justen <[email protected]> Signed-off-by: Kristian Høgsberg Kristensen <[email protected]>