mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	glapi: add GL_EXT_polygon_offset_clamp	Ilia Mirkin	2015-02-02	4	-1/+24
\| \| \| \| \|	Signed-off-by: Ilia Mirkin <[email protected]> Reviewed-by: Glenn Kennard <[email protected]>
*	glsl: Pick ast_conditional branch regardless of op1/2 being constant.	Kenneth Graunke	2015-02-02	1	-4/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If the ?: operator's condition is a constant value, and both branches were pure expressions, we can just make the resulting value one or the other. Previously, we only did this if op[1] and op[2] were also constant values - but there's no actual reason for that restriction. No changes in shader-db, probably because we usually optimize this later anyway. But it does make us generate less stupid code up front. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	i965: Add a better PRM citation for the IMS dimension mangling.	Kenneth Graunke	2015-02-02	1	-1/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Paul originally had to reverse engineer these formulas based on the description about how the sampler works. The description here is not the easiest to follow - especially given that it's from the Sandybridge era, when the hardware only did 4x multisampling. Jordan and I recently found another part of the documentation where they simply state that IMS dimensions must be adjusted by a set of formulas. Quoting this section provides an easy to follow explanation for the code, including 2x/4x/8x/16x. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Chad Versace <[email protected]>
*	swrast: Whitespace fixes.	Laura Ekstrand	2015-02-02	3	-12/+12
\| \| \| \| \|	Reviewed-by: Jason Ekstrand <[email protected]> Reviewed-by: Brian Paul <[email protected]>
*	DD: Refactor BlitFramebuffer.	Laura Ekstrand	2015-02-02	20	-83/+130
\| \| \| \| \| \| \| \| \|	In preparation for glBlitNamedFramebuffer, the DD table function BlitFramebuffer needs to accept two arbitrary framebuffer objects rather than assuming ctx->ReadBuffer and ctx->DrawBuffer. Reviewed-by: Jason Ekstrand <[email protected]> Reviewed-by: Brian Paul <[email protected]>
*	i965: Don't use tiled_memcpy to download from RGBX or BGRX surfaces	Jason Ekstrand	2015-02-02	2	-0/+14
\| \| \| \| \| \|	Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=88841 Reviewed-by: Anuj Phogat <[email protected]>
*	dir-locals.el: Don't set variables for non-programming modes	Neil Roberts	2015-02-02	8	-8/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This limits the style changes to modes inherited from prog-mode. The main reason to do this is to avoid setting fill-column for people using Emacs to edit commit messages because 78 characters is too many to make it wrap properly in git log. Note that makefile-mode also inherits from prog-mode so the fill column should continue to apply there. v2: Apply to all the .dir-locals.el files, not just the one in the root directory. Acked-by: Michel Dänzer <[email protected]>
*	i965: Fix intel_miptree_copy_teximage for GL_TEXTURE_1D_ARRAY	Iago Toral Quiroga	2015-02-02	1	-1/+6
\| \| \| \| \| \| \| \| \| \| \|	For GL_TEXTURE_1D_ARRAY targets we store the depth of the array in the Height field and leave Depth=1 in the underlying texture object. When we call intel_miptree_copy_teximage in the process of re-creating a miptree (possibily because the number of miplevels has changed) we didn't account for this, so we where only copying texture images for the first slice. Reviewed-by: Jason Ekstrand <[email protected]>
*	vc4: Kill a bunch of color write calculation when colormask is all off.	Eric Anholt	2015-02-01	1	-8/+35
\| \| \| \| \| \| \| \| \| \| \| \| \|	I could have done this in the bit that generates the ANDs and ORs, but it's probably generally useful. Sadly, I still need this even if I move to NIR, because I can't yet express my read of the destination color in NIR, which I would need to move my blend/logicop/colormask handling into NIR. total uniforms in shared programs: 13497 -> 13455 (-0.31%) uniforms in affected programs: 101 -> 59 (-41.58%) total instructions in shared programs: 40797 -> 40296 (-1.23%) instructions in affected programs: 1639 -> 1138 (-30.57%)
*	vc4: Dump the VPM read index in QIR disasm.	Eric Anholt	2015-02-01	1	-4/+9
\| \| \| \| \|	Since the VPM reads have to be in order, it's useful to see their indices in the dump.
*	i965/pixel_read: Don't try to do a tiled_memcpy from a multisampled buffer	Jason Ekstrand	2015-01-31	1	-0/+7
\| \| \| \| \| \| \| \| \| \|	The GL spec guarantees that glGetTexImage will never get a multisampled texture, but this is not true for glReadPixels. If we get a multisampled buffer, we have to do a multisample resolve on it before we can pull the data down for the user. Since this isn't practical to handle in tiled_memcpy, we just fall back to the other paths that can handle this. Reviewed-by: Kenneth Graunke <[email protected]>
*	i965: Enable L3 caching of buffer surfaces.	Francisco Jerez	2015-01-31	4	-9/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	And remove the mocs argument of the emit_buffer_surface_state vtbl hook. Its semantics vary greatly from one generation to another, so it kind of encourages the caller to pass 0 which is the only valid setting across generations. After this commit the hardware-specific code decides what the best cacheability settings are for buffer surfaces, just like we do for textures. This together with some additional changes coming is expected to improve performance of pull constants, buffer textures, atomic counters and image objects on Gen7 and up. Reviewed-by: Kenneth Graunke <[email protected]>
*	egl: Pass the correct X visual depth to xcb_put_image().	José Fonseca	2015-01-31	1	-11/+13
\| \| \| \| \| \| \| \| \| \|	The dri2_x11_add_configs_for_visuals() function happily matches a 32 bits EGLconfig with a 24 bits X visual. However it was passing 32bits depth to xcb_put_image(), making X server unhappy: https://github.com/apitrace/apitrace/issues/313#issuecomment-70571911 Cc: "10.4" <[email protected]>
*	intel/pixel_read: Properly flip the results for window system buffers	Jason Ekstrand	2015-01-30	1	-0/+15
\| \| \| \| \| \|	Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=88841 Reviewed-by: Chad Versace <[email protected]>
*	i965/tiled_memcpy: Support a signed linear pitch	Jason Ekstrand	2015-01-30	2	-17/+17
\| \| \| \|	Reviewed-by: Chad Versace <[email protected]>
*	main: Add STENCIL_INDEX formats to base_tex_format	Jason Ekstrand	2015-01-30	1	-0/+10
\| \| \| \| \| \| \| \|	This fixes a bug on BDW when our meta-based stencil blit path assert-fails due to an invalid internal format even though we do support the ARB_stencil_texturing extension. Reviewed-by: Matt Turner <[email protected]>
*	teximage: Don't indent switch cases	Jason Ekstrand	2015-01-30	1	-146/+146
\| \| \| \| \| \|	No functional change. Reviewed-by: Matt Turner <[email protected]>
*	mesa: remove some dead display list code	Brian Paul	2015-01-30	1	-80/+11
\| \| \| \| \| \| \|	The size of a Node is always four bytes so no need for the old code that was used when sizeof(Node)==8. Reviewed-by: Matt Turner <[email protected]>
*	mesa: remove stale comment in dlist.c code	Brian Paul	2015-01-30	1	-1/+1
\| \| \| \| \| \|	sizeof(Node) is always 4 bytes. Reviewed-by: Matt Turner <[email protected]>
*	mesa: s/union gl_dlist_node/Node/ in dlist.c code	Brian Paul	2015-01-30	1	-3/+3
\| \| \| \| \| \|	Just minor clean-up. Reviewed-by: Matt Turner <[email protected]>
*	mesa: fix display list 8-byte alignment issue	Brian Paul	2015-01-30	3	-7/+74
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The _mesa_dlist_alloc() function is only guaranteed to return a pointer with 4-byte alignment. On 64-bit systems which don't support unaligned loads (e.g. SPARC or MIPS) this could lead to a bus error in the VBO code. The solution is to add a new _mesa_dlist_alloc_aligned() function which will return a pointer to an 8-byte aligned address on 64-bit systems. This is accomplished by inserting a 4-byte NOP instruction in the display list when needed. The only place this actually matters is the VBO code where we need to allocate a 'struct vbo_save_vertex_list' which needs to be 8-byte aligned (just as if it were malloc'd). The gears demo and others hit this bug. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=88662 Cc: "10.4" <[email protected]> Reviewed-by: José Fonseca <[email protected]>
*	util/u_atomic: Provide a _InterlockedCompareExchange8 for older MSVC.	José Fonseca	2015-01-30	1	-1/+29
\| \| \| \| \| \| \| \|	Fixes build with Windows SDK 7.0.7600. Tested with u_atomic_test, both on x86 and x86_64. Reviewed-by: Roland Scheidegger <[email protected]>
*	util/u_atomic: Use _Interlocked* intrinsics for non 64bits.	José Fonseca	2015-01-30	1	-12/+12
\| \| \| \| \| \| \|	The intrinsics are universally available, whereas older Windows SDKs (e.g. 7.0.7600) don't have the non-intrisic entrypoint. Reviewed-by: Roland Scheidegger <[email protected]>
*	i965/skl: Force a BINDING_TABLE_POINTER_* after push constant command	Neil Roberts	2015-01-30	1	-0/+7
\| \| \| \| \| \| \| \| \| \| \| \| \|	According to the SKL bspec the 3DSTATE_CONSTANT_* commands only take effect on the next corresponding 3DSTATE_BINDING_TABLE_POINTER_* command. This patch just makes it set the BRW_NEW_SURFACES state when uploading the push constants to ensure the binding tables will be updated. This fixes the fbo-blending-formats Piglit test and possibly others. Reviewed-by: Kristian Høgsberg <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	meta: Don't write depth when decompressing tex-images	Topi Pohjolainen	2015-01-30	1	-1/+1
\| \| \| \| \| \|	Signed-off-by: Topi Pohjolainen <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	meta: Don't write depth when generating miptrees	Topi Pohjolainen	2015-01-30	1	-1/+1
\| \| \| \| \| \|	Signed-off-by: Topi Pohjolainen <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	meta/blit: Compile programs with and without depth	Topi Pohjolainen	2015-01-30	2	-5/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	When color buffers alone are concerned the depth is not needed. No regression on BDW where meta blit is used instead of blorp. I also disabled blorp temporarily for fbo-blits on IVB and saw no regressions there either. I also compared several graphics benchmarks on BDW and saw neither regressions or improvements. Signed-off-by: Topi Pohjolainen <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	meta/blit: Write depth only when asked for	Topi Pohjolainen	2015-01-30	1	-2/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Implementing an idea from Ken, on i965 the shader program for 2D blits becomes significantly simpler. Before: pln(8) g6<1>F g4<0,1,0>F g2<8,8,1>F { align1 1Q compacted }; pln(8) g7<1>F g4.4<0,1,0>F g2<8,8,1>F { align1 1Q compacted }; send(8) g2<1>UW g6<8,8,1>F sampler (1, 0, 0, 1) mlen 2 rlen 4 { align1 1Q }; mov(8) g123<1>F g2<8,8,1>F { align1 1Q compacted }; mov(8) g124<1>F g3<8,8,1>F { align1 1Q compacted }; mov(8) g125<1>F g4<8,8,1>F { align1 1Q compacted }; mov(8) g126<1>F g5<8,8,1>F { align1 1Q compacted }; mov(8) g127<1>F g2<8,8,1>F { align1 1Q compacted }; nop ; sendc(8) null g123<8,8,1>F render RT write SIMD8 LastRT Surface = 0 mlen 5 rlen 0 { align1 1Q EOT }; After: pln(8) g6<1>F g4<0,1,0>F g2<8,8,1>F { align1 1Q compacted }; pln(8) g7<1>F g4.4<0,1,0>F g2<8,8,1>F { align1 1Q compacted }; send(8) g124<1>UW g6<8,8,1>F sampler (1, 0, 0, 1) mlen 2 rlen 4 { align1 1Q }; sendc(8) null g124<8,8,1>F render RT write SIMD8 LastRT Surface = 0 mlen 4 rlen 0 { align1 1Q EOT }; v2 (Matt): Removed unintended white-space change Signed-off-by: Topi Pohjolainen <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	meta/blit: Add plumbing for shaders without depth	Topi Pohjolainen	2015-01-30	4	-3/+5
\| \| \| \| \| \| \| \| \|	Currently all blit programs are unconditionally compiled with gl_FragDepth. Signed-off-by: Topi Pohjolainen <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Matt Turner <[email protected]>
*	nir/opt_algebraic: Add some constant bcsel reductions	Jason Ekstrand	2015-01-29	1	-2/+28
\| \| \| \| \| \| \| \|	total instructions in shared programs: 5998190 -> 5997603 (-0.01%) instructions in affected programs: 54276 -> 53689 (-1.08%) helped: 293 Reviewed-by: Kenneth Graunke <[email protected]>
*	nir/opt_algebraic: Add some boolean simplifications	Jason Ekstrand	2015-01-29	1	-4/+5
\| \| \| \| \| \| \| \|	total instructions in shared programs: 5998321 -> 5998287 (-0.00%) instructions in affected programs: 4520 -> 4486 (-0.75%) helped: 8 Reviewed-by: Kenneth Graunke <[email protected]>
*	nir/algebraic: Support specifying variable as constant or by type	Jason Ekstrand	2015-01-29	2	-6/+26
\| \| \| \|	Reviewed-by: Kenneth Graunke <[email protected]>
*	nir/algebraic: Fail to compile of a variable is used in a replace but not ↵	Jason Ekstrand	2015-01-29	1	-0/+7
\| \| \| \| \| \|	the search Reviewed-by: Kenneth Graunke <[email protected]>
*	nir/search: Allow for matching variables based on types	Jason Ekstrand	2015-01-29	2	-0/+23
\| \| \| \| \| \| \| \| \| \| \|	This allows you to match on an unknown value but only if it is of a given type. 90% of the uses of this are for matching only booleans, but adding the generality of arbitrary types is no more complex. nir_algebraic.py doesn't handle this yet but that's ok because the C language will ensure that the default type on all variables is void. Reviewed-by: Kenneth Graunke <[email protected]>
*	nir/search: Add support for matching unknown constants	Jason Ekstrand	2015-01-29	2	-0/+13
\| \| \| \| \| \| \| \| \| \| \| \| \|	There are some algebraic transformations that we want to do but only if certain things are constants. For instance, we may want to replace a * (b + c) with (a * b) + (a * c) as long as a and either b or c is constant. While this generates more instructions, some of it will get constant folded. nir_algebraic.py doesn't handle this yet, but that's ok because the C language will make sure that false is the default for now. Reviewed-by: Kenneth Graunke <[email protected]>
*	nir: Add an invalid type	Jason Ekstrand	2015-01-29	1	-0/+1
\| \| \| \| \| \|	This allows us to indicate a concept of an invalid type. Reviewed-by: Kenneth Graunke <[email protected]>
*	gallium/docs: fix docs wrt ARL/ARR/FLR	Roland Scheidegger	2015-01-29	1	-10/+8
\| \| \| \| \| \| \| \| \|	since the address reg holds integer values, ARL/ARR do an implicit float-to-int conversion, so clarify that. Thus it is also incorrect to say that FLR really does the same as ARL. Reviewed-by: Alex Deucher <[email protected]> Reviewed-by: Jose Fonseca <[email protected]>
*	nir: Add variants of some of the comparison simplifications.	Eric Anholt	2015-01-29	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \|	We end up with these from TGSI-to-NIR because the pass generating the comparisons doesn't know if the arg is actually a bool input or not. vc4 results: total instructions in shared programs: 41801 -> 41508 (-0.70%) instructions in affected programs: 4253 -> 3960 (-6.89%) Reviewed-by: Matt Turner <[email protected]>
*	vc4: Fix point size handling when it's the first output.	Eric Anholt	2015-01-29	1	-1/+1
\|
*	nir: Don't try to to-SSA ALU instructions that are already SSA.	Eric Anholt	2015-01-29	1	-0/+3
\| \| \| \|	Reviewed-by: Jason Ekstrand <[email protected]>
*	nir: Fix a bit of broken indentation.	Eric Anholt	2015-01-29	1	-1/+1
\| \| \| \| \|	Reviewed-by: Matt Turner <[email protected]> Reviewed-by: Jason Ekstrand <[email protected]>
*	nir: Add a couple of helpers for glsl types.	Eric Anholt	2015-01-29	2	-1/+16
\| \| \| \| \| \| \| \| \| \|	This will be used by tgsi_to_nir, which needs to get vec4 types for declaring shader input/output variables. v2: Add a missing space. Reviewed-by: Matt Turner <[email protected]> (v2) Reviewed-by: Jason Ekstrand <[email protected]>
*	Mesa: Advertise GL_OES_texture_float extensions support with i965.	Kalyan Kondapally	2015-01-29	1	-0/+5
\| \| \| \| \| \| \| \| \|	This patch advertises support for GL_OES_texture_float extensions when using i965 drivers. Signed-off-by: Kevin Rogovin <[email protected]> Signed-off-by: Kalyan Kondapally <[email protected]> Reviewed-by: Tapani Pälli <[email protected]>
*	Mesa: Add support for HALF_FLOAT_OES type.	Kalyan Kondapally	2015-01-29	4	-6/+52
\| \| \| \| \| \| \| \| \|	This patch adds needed support for accepting HALF_FLOAT_OES as valid type for TexImageD and TexSubImageD when Texture FLoat extensions are supported. Signed-off-by: Kevin Rogovin <[email protected]> Signed-off-by: Kalyan Kondapally <[email protected]> Reviewed-by: Tapani Pälli <[email protected]>
*	Mesa: Add support for GL_OES_texture_float extensions.	Kalyan Kondapally	2015-01-29	4	-0/+132
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch series adds support for following GLES2 Texture Float extensions: 1)GL_OES_texture_float, 2)GL_OES_texture_half_float, 3)GL_OES_texture_float_linear, 4)GL_OES_texture_half_float_linear. This patch adds basic infrastructure and needed boolean flags to advertise support for these extensions, by default the support is disabled. Next patch in the series introduces support for HALF_FLOAT_OES token. v4: take assert away and make valid_filter_for_float conditional (Tapani), fix the alphabetical order (Emil) Signed-off-by: Kevin Rogovin <[email protected]> Signed-off-by: Kalyan Kondapally <[email protected]> Reviewed-by: Tapani Pälli <[email protected]>
*	nir: Make vec-to-movs handle src/dest aliasing.	Eric Anholt	2015-01-28	1	-10/+72
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	It now emits vector MOVs instead of a series of individual MOVs, which should be useful to any vector backends. This pushes the problem of src/dest aliasing of channels on a scalar chip to the backend, but if there are any vector operations in your shader then you needed to be handling this already. Fixes fs-swap-problem with my scalarizing patches. v2: Rename to insert_mov(), and add a comment about what it does. v3: Rewrite the comment. Reviewed-by: Connor Abbott <[email protected]> (v3)
*	gallium: Replace u_simple_list.h with util/simple_list.h	Eric Anholt	2015-01-28	25	-228/+23
\| \| \| \| \| \| \|	The code was exactly the same, except util/ has c++ guards and a struct simple_node declaration. Reviewed-by: Marek Olšák <[email protected]>
*	mesa: Port a variant of 68afbe89c72d085dcbbf2b264f0201ab73fe339e to util/	Eric Anholt	2015-01-28	1	-0/+1
\| \| \| \| \| \| \| \|	The idea is that after a remove_from_list(), you might want to be able to do a remove_from_list() on it again or an is_empty_list(). This is apparently relied on by r300g. Reviewed-by: Marek Olšák <[email protected]>
*	mesa: Move simple_list.h to src/util.	Eric Anholt	2015-01-28	31	-28/+29
\| \| \| \| \| \|	We have two copies of it in the tree, I'm going to delete one. Reviewed-by: Marek Olšák <[email protected]>
*	radeonsi: Enable VGPR spilling for all shader types v5	Tom Stellard	2015-01-28	8	-52/+217
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	v2: - Only emit write SPI_TMPRING_SIZE once per packet. - Use context global scratch buffer. v3: - Patch shaders using WRITE_DATA packet instead of map/unmap. - Emit ICACHE_FLUSH, CS_PARTIAL_FLUSH, PS_PARTIAL_FLUSH, and VS_PARTIAL_FLUSH when patching shaders. v4: - Code cleanups. - Remove unnecessary multiplies. v5: - Patch shaders in system memory and re-upload to vram. Reviewed-by: Michel Dänzer <[email protected]>