mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	i965/blorp: Modify manual_blend() to avoid unnecessary loss of precision.	Paul Berry	2012-07-20	1	-27/+90
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When downsampling from an MSAA image to a single-sampled image, it is inevitable that some loss of numerical precision will occur, since we have to use 32-bit floating point registers to hold the intermediate results while blending. However, it seems reasonable to expect that when all samples corresponding to a given pixel have the exact same color value, there will be no loss of precision. Previously, we averaged samples as follows: blend = (((sample[0] + sample[1]) + sample[2]) + sample[3]) / 4 This had the potential to lose numerical precision when all samples have the same color value, since ((sample[0] + sample[1]) + sample[2]) may not be precisely representable as a 32-bit float, even if the individual samples are. This patch changes the formula to: blend = ((sample[0] + sample[1]) + (sample[2] + sample[3])) / 4 This avoids any loss of precision in the event that all samples are the same, by ensuring that each addition operation adds two equal values. As a side benefit, this puts the formula in the form we will need in order to implement correct blending of integer formats. Reviewed-by: Anuj Phogat <[email protected]>
*	i965: Add support for AVG instruction.	Paul Berry	2012-07-20	2	-0/+23
\| \| \| \| \| \| \| \| \| \| \| \| \|	From the Ivy Bridge PRM, Vol4 Part3 p152: "The avg instruction performs component-wise integer average of src0 and src1 and stores the results in dst. An integer average uses integer upward rounding. It is equivalent to increment one to the addition of src0 and src1 and then apply an arithmetic right shift to this intermediate value." Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Anuj Phogat <[email protected]>
*	i965: Replace fs_visitor::kill_emitted with gl_fragment_program::UsesKill.	Paul Berry	2012-07-20	2	-4/+1
\| \| \| \| \| \| \|	The kill_emitted variable was duplicating the functionality of gl_fragment_program::UsesKill. There's no need for both. Reviewed-by: Eric Anholt <[email protected]>
*	mesa: Set gl_fragment_program::UsesKill in do_set_program_inouts.	Paul Berry	2012-07-20	3	-33/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously, the code for setting this flag for GLSL programs was duplicated in three places: brw_link_shader(), glsl_to_tgsi_visitor, and ir_to_mesa_visitor. In addition to the unnecessary duplication, there was a performance problem on i965: brw_link_shader() set the flag before doing its final round of optimizations, which meant that if the optimizations managed to eliminate all the discard operations, the flag would still be set, resulting (at least in theory) in slower performance. This patch consolidates all of the code that sets UsesKill for GLSL programs into do_set_program_inouts(), which already is doing a similar job for UsesDFdy, and which occurs after i965's final round of optimizations. Non-GLSL programs (ARB programs and the state tracker's glBitmap program) are unaffected. Reviewed-by: Eric Anholt <[email protected]>
*	i965: Avoid unnecessary recompiles for shaders that don't use dFdy().	Paul Berry	2012-07-19	4	-14/+10
\| \| \| \| \| \| \| \| \| \| \| \|	The i965 back-end needs to compile dFdy() differently for FBOs and window system framebuffers, because Y coordinates are flipped between the two (see commit 82d2596: i965: Compute dFdy() correctly for FBOs). This patch avoids unnecessarily recompiling shaders that don't use dFdy(), by only setting render_to_fbo in the wm program key if the shader actually uses dFdy(). Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Ian Romanick <[email protected]>
*	mesa: Set UsesDFdy appropriately for assembly programs.	Paul Berry	2012-07-19	3	-0/+4
\| \| \| \| \| \|	Reviewed-by: Brian Paul <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Ian Romanick <[email protected]>
*	mesa: Add UsesDFdy to struct gl_fragment_program.	Paul Berry	2012-07-19	2	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \|	The i965 back-end needs to compile dFdy() differently for FBOs and window system framebuffers, because Y coordinates are flipped between the two (see commit 82d2596: i965: Compute dFdy() correctly for FBOs). This boolean will allow it to avoid unnecessarily recompiling shaders that don't use dFdy(). Reviewed-by: Brian Paul <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Ian Romanick <[email protected]>
*	drirc: Add disable_blend_func_extended workaround for Unigine OilRush.	Kenneth Graunke	2012-07-19	1	-0/+6
\| \| \| \| \| \| \| \| \| \|	The previous commit implemented the workaround, cited a bug report about OilRush, but actually only enabled the workaround for the demos. Turn it on for OilRush too. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=50291 Signed-off-by: Kenneth Graunke <[email protected]>
*	i965: Add a driconf option to disable GL_ARB_blend_func_extended.	Kenneth Graunke	2012-07-19	4	-2/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Unigine Heaven (at least) has a bug where it incorrectly uses the GL_ARB_blend_func_extended extension. Dual source blending allows two color outputs per render target; individual shader outputs can be assigned to be either the first or second blending input by setting the 'index' via one of two methods: - An API call: glBindFragDataLocationIndexed() - The GLSL 'layout' qualifier provided by GL_ARB_explicit_attrib_location Both of these only work on user defined fragment shader outputs; it's an error to use either on built-in outputs like gl_FragData. Unigine uses gl_FragData and gl_FragColor exclusively, and doesn't even attempt to use either method to set index == 1. However, it does set the blending function to SRC1 enums, which requires a fragment shader output with index == 1 or else rendering is undefined. In other words, enabling ARB_blend_func_extended causes Unigine to render incorrectly, resulting in an apparent regression, even though our driver code (as far as I can tell) is perfectly fine. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=50291 Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Chad Versace <[email protected]> Reviewed-by: Eric Anholt <[email protected]>
*	mesa: remove stale comment	Brian Paul	2012-07-18	1	-1/+0
\|
*	mesa: use gl_program cast wrappers	Brian Paul	2012-07-18	6	-49/+37
\| \| \| \| \| \| \|	In a few cases, remove unneeded casts. And fix a few other const-correctness issues. Reviewed-by: Kenneth Graunke <[email protected]>
*	mesa: add some gl_program cast wrappers	Brian Paul	2012-07-18	1	-0/+42
\| \| \| \|	Reviewed-by: Kenneth Graunke <[email protected]>
*	i965/fs: Make register spill/unspill only do the regs for that instruction.	Eric Anholt	2012-07-18	1	-33/+33
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously, if we were spilling the result of a texture call, we would store all 4 regs, then for each use of one of those regs as the source of an instruction, we would unspill all 4 regs even though only one was needed. In both lightsmark and l4d2 with my current graphics config, the shaders that produce spilling do so on split GRFs, so this doesn't help them out. However, in a capture of the l4d2 shaders with a different snapshot and playing the game instead of using a demo, it reduced one shader from 2817 instructions to 2179, due to choosing a now-cheaper texture result to spill instead of piles of texcoords. v2: Fix comment noted by Ken, and fix the if condition associated with it for the current state of what constitutes a partial write of the destination. Reviewed-by: Kenneth Graunke <[email protected]> (v1)
*	i965/fs.h: Refactor tests for instructions modifying a register.	Eric Anholt	2012-07-18	4	-34/+16
\| \| \| \| \| \| \| \| \| \|	There's one instance of a potential behavior change: propagate_constants may now propagate into a part of a vgrf after a different part of it was overwritten by a send that returns multiple registers. I don't think we ever generate IR that meets that condition, but it's something to note if we bisect behavior change to this. Reviewed-by: Kenneth Graunke <[email protected]>
*	i965/fs: Replace usage is_tex() with regs_written() checks.	Eric Anholt	2012-07-18	1	-9/+9
\| \| \| \| \| \| \| \| \| \|	In these places, we care about any sort of send that hits more than one reg, not just textures. We don't yet have anything else returning more than one reg, so there's no change. v2: Use mlen instead of is_tex() for the is-it-a-send check. Reviewed-by: Kenneth Graunke <[email protected]>
*	i965/fs: Rename virtual_grf_next to virtual_grf_count.	Eric Anholt	2012-07-18	6	-22/+21
\| \| \| \| \| \| \|	"count" is a more useful name, since most of the time we're using it for looping over the variables. Reviewed-by: Kenneth Graunke <[email protected]>
*	i965/fs: Move a block out of a loop in live variables setup.	Eric Anholt	2012-07-18	1	-4/+5
\| \| \| \| \| \|	This was accidentally copy-and-pasted inside. Reviewed-by: Kenneth Graunke <[email protected]>
*	i965/msaa: Disable alpha-to-{coverage, one} when drawbuffer zero is in ↵	Anuj Phogat	2012-07-18	1	-7/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	integer format OpenGL specification 3.3 (page 196), section 4.1.3 says: If drawbuffer zero is not NONE and the buffer it references has an integer format, the SAMPLE_ALPHA_TO_COVERAGE and SAMPLE_ALPHA_TO_ONE operations are skipped." This should work properly even if there are other draw buffers that are not in integer format. This patch makes following piglit tests pass on mesa: int-draw-buffers-alpha-to-coverage int-draw-buffers-alpha-to-one Reviewed-by: Chad Versace <[email protected]> Reviewed-by: Ian Romanick <[email protected]> Signed-off-by: Anuj Phogat <[email protected]>
*	msaa: Generate proper error for operations prohibited on MSAA buffers.	Paul Berry	2012-07-17	2	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	From the GL 3.0 spec, section 4.3.3, in the documentation for CopyPixels(): "An INVALID_OPERATION error will be generated if the object bound to READ_FRAMEBUFFER_BINDING is framebuffer complete and the value of SAMPLE_BUFFERS is greater than zero." The same applies to CopyTexImage...() and CopyTexSubImage...() functions, since they are defined in terms of CopyPixels(). Previously we were generating an INVALID_FRAMEBUFFER_OPERATION error in these cases. Fixes piglit tests "EXT_framebuffer_multisample/negative-{copypixels,copyteximage}". Reviewed-by: Brian Paul <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	intel: Add a comment explaining why we early return on matching BO names.	Eric Anholt	2012-07-17	1	-0/+4
\| \| \| \| \|	Reviewed-by: Chad Versace <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	intel: Drop other checks for old loader version.	Eric Anholt	2012-07-17	1	-38/+26
\| \| \| \| \|	Reviewed-by: Chad Versace <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	intel: Replace the non-getBuffersWithFormat compat path with an error message.	Eric Anholt	2012-07-17	2	-21/+8
\| \| \| \| \| \| \| \| \| \|	It's been broken (using NULL getBuffersWithFormat() instead of getBuffers()) due to a copy and paste error for a year now. GetBuffersWithFormat has been around since 2009, so I don't feel any guilt in not supporting it. Reviewed-by: Chad Versace <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	intel: Remove dead intel_framebuffer_has_hiz().	Eric Anholt	2012-07-17	2	-13/+0
\| \| \| \| \|	Reviewed-by: Chad Versace <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	intel: Convert to using private depth/stencil buffers (v2)	Eric Anholt	2012-07-17	3	-652/+39
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This means that GLX buffer sharing of these no longer works. On the other hand, just look at this code reduction. v2: - [chad] Fix intelCreateBuffer for gen < 6. When the branch for !screen->hw_has_separate_stencil was taken, intel_create_private_renderbuffer was incorrectly not used. - [chad] Remove all code in intel_process_dri2_buffer for processing depth, stencil, and hiz buffers. That code is now dead. CC: Eric Anholt <[email protected]> Signed-off-by: Chad Versace <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	intel: Add a function for creating a private window system buffer.	Eric Anholt	2012-07-17	2	-2/+20
\| \| \| \| \|	Reviewed-by: Chad Versace <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]>
*	mesa: include inttypes.h to get uint8_t type	Brian Paul	2012-07-16	1	-0/+1
\| \| \| \|	To fix MSVC build.
*	Fix building mesa with assembly enabled since a112ca5d	Jon TURNEY	2012-07-16	1	-2/+3
\| \| \| \| \| \| \| \| \| \| \| \|	a112ca5d rather crassly smashed all the compiler flags together into AM_CFLAGS. Separate them out the way they were before, putting pre-processor flags into AM_CPPFLAGS, so assembly source gets preprocessed with the correct pre-processor flags as well. Also, remove unneeded CFLAGS from AM_CFLAGS, and CXXFLAGS from AM_CXXFLAGS Signed-off-by: Jon TURNEY <[email protected]> Tested-by: Brian Paul <[email protected]>
*	intel: Fix build broken by ETC1 patch	Chad Versace	2012-07-16	1	-0/+12
\| \| \| \| \| \| \|	I suck at resolving merge conflicts and broke the build in a5a34b1. This patch adds the missing field intel_mipmap_tree::wraps_etc1. Signed-off-by: Chad Versace <[email protected]>
*	intel: Enable GL_OES_compressed_ETC1_RGB8_texture	Chad Versace	2012-07-16	4	-1/+78
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Enable it for all hardware. No current hardware supports ETC1, so this patch implements it by translating the ETC1 data to RGBX data during the call to glCompressedTexImage2D(). For details, see the doxygen for intel_mipmap_tree::wraps_etc1. Passes the Piglit test spec/OES_compressed_ETC1_RGB8_texture/miptree and the ETC1 test in the GLES2 conformance suite. Reviewed-by: Kenneth Graunke <[email protected]> Signed-off-by: Chad Versace <[email protected]>
*	mesa: Add function for decoding ETC1 textures	Chad Versace	2012-07-16	2	-0/+40
\| \| \| \| \| \| \| \| \|	Add function _mesa_etc1_unpack_rgba8888. It is intended to be used by glCompressedTexSubImage2D to decode ETC1 textures into RGBA. CC: Chia-I <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]> Signed-off-by: Chad Versace <[email protected]>
*	gallium/util, mesa: Refactor etc1 unpack function	Chad Versace	2012-07-16	1	-0/+34
\| \| \| \| \| \| \| \| \| \|	Move the body of util_etc1_rgb8_unpack_rgba_unorm8 into a new function that can be shared between gallium and dri drivers, texcompress_etc_tmp.h:etc1_unpack_rgba8888. CC: Chia-I <[email protected]> Reviewed-by: Kenneth Graunke <[email protected]> Signed-off-by: Chad Versace <[email protected]>
*	gbm: Add new gbm_bo_import entry point	Kristian Høgsberg	2012-07-16	1	-1/+7
\| \| \| \| \|	This generalizes and replaces gbm_bo_create_for_egl_image. gbm_bo_import will create a gbm_bo from either an EGLImage or a struct wl_buffer.
*	intel: Don't call _mesa_get_format_bytes for MESA_FORMAT_NONE	Kristian Høgsberg	2012-07-16	1	-1/+4
\| \| \| \| \| \| \| \| \| \|	When we don't intend to texture from or render to a __DRIimage we use __DRI_IMAGE_FORMAT_NONE. In that case, we just create the __DRIimage to reference the underlying buffer, and will create usable __DRIimages from it using createSubImage later. If we try to use _mesa_get_format_bytes() on MESA_FORMAT_NONE in a debug build, we hit an assertion, so let's not do that.
*	mesa/st: Generates TGSI that always recognizes INSTANCEID/VERTEXID as integers.	José Fonseca	2012-07-13	2	-0/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Tested by running piglit draw-instanced, and by forcing llvmpipe advertise no native integer support, which now produces: VERT DCL IN[0] DCL SV[0], INSTANCEID DCL OUT[0], POSITION DCL OUT[1], COLOR DCL CONST[0..19] DCL TEMP[0], LOCAL DCL TEMP[1], LOCAL DCL TEMP[2], LOCAL DCL ADDR[0] 0: U2F TEMP[0].x, SV[0] 1: ARL ADDR[0].x, TEMP[0].xxxx 2: MOV TEMP[1].xy, CONST[ADDR[0].x+8].xyxx 3: ADD TEMP[2].x, IN[0].xxxx, TEMP[1].xxxx 4: ADD TEMP[1].x, IN[0].yyyy, TEMP[1].yyyy 5: MUL TEMP[2], CONST[16], TEMP[2].xxxx 6: MAD TEMP[2], CONST[17], TEMP[1].xxxx, TEMP[2] 7: MAD TEMP[2], CONST[18], IN[0].zzzz, TEMP[2] 8: MAD TEMP[2], CONST[19], IN[0].wwww, TEMP[2] 9: ARL ADDR[0].x, TEMP[0].xxxx 10: MOV TEMP[1], CONST[ADDR[0].x] 11: MOV OUT[0], TEMP[2] 12: MOV OUT[1], TEMP[1] 13: END
*	Don't explicitly link libOsmesa with libmesa's dependency libglsl	Jon TURNEY	2012-07-13	1	-2/+1
\| \| \| \| \| \| \| \| \| \|	The libmesa convenience library is linked with the libglsl convenience library. libOsmesa is linked with libmesa, and also directly with libglsl. When using libtool, this gives rise to duplicate symbol errors. Signed-off-by: Jon TURNEY <[email protected]> Reviewed-by: Eric Anholt <[email protected]> Tested-by: Matt Turner <[email protected]>
*	automake: convert libglapi	Jon TURNEY	2012-07-13	2	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* "configure substitutions are not allowed in _SOURCES variables" in automake, so remove the AC_SUBST'ed GLAPI_ASM_SOURCES and instead use some AM_CONDITIONALS to choose which asm sources are used * Change GLAPI_LIB to point to the .la file in other Makefile.am files, and make a link to the .a file for the convenience of other Makefiles which have not yet been converted to automake v2: - Use AM_CPPFLAGS for cleaner build output - EXTRA_SOURCES is not needed - Remove libglapi.a compatibility link on clean Signed-off-by: Jon TURNEY <[email protected]> Reviewed-by: Eric Anholt <[email protected]> Tested-by: Matt Turner <[email protected]>
*	automake: convert libmesa and libmesagallium	Jon TURNEY	2012-07-13	5	-63/+47
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* "configure substitutions are not allowed in _SOURCES variables" in automake, so instead of MESA_ASM_FILES, use some AM_CONDITIONALS to choose which architecture's asm sources are used in libmesa_la_SOURCES. (Can't remove MESA_ASM_FILES autoconf variable as it's still used in sources.mak) * Update to link with the .la file in other Makefile.am files, and make a link to the .a file for the convenience of other Makefiles which have not yet been converted to automake v2: Remove stray -static from LDFLAGS v3: Remove .a compatibility link on clean Signed-off-by: Jon TURNEY <[email protected]> Reviewed-by: Eric Anholt <[email protected]> Tested-by: Matt Turner <[email protected]>
*	Rename sparc/clip.S -> sparc/sparc_clip.S	Jon TURNEY	2012-07-13	3	-2/+2
\| \| \| \| \| \| \| \| \| \| \|	Automake can't handle having both clip.S and clip.c, even though they have different paths "src/mesa/Makefile.am: object `clip.lo' created by `$(SRCDIR)/sparc/clip.S' and `$(SRCDIR)/main/clip.c'" Signed-off-by: Jon TURNEY <[email protected]> Reviewed-by: Matt Turner <[email protected]> Reviewed-by: Eric Anholt <[email protected]> Tested-by: Matt Turner <[email protected]>
*	automake: convert libglsl	Jon TURNEY	2012-07-13	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	v2: Use AM_V_GEN to silence generated code rules. Add BUILT_SOURCES to CLEANFILES v3: - Fix an accidental // in a path - Use automake make rules for lex/yacc rather than writing our own - Update .gitignore appropriately - Build a libglcpp convenience library rather than awkwardly including the files in libglsl and delegating the generation - Remove libglsl.a compatibility link on clean v4: - Automake's rules for lex/yacc make .cc if source is .ll or .yy, and apparently we must use those extensions "because of scons", so update everywhere glsl_parser.cpp -> glsl_parser.cc and glsl_lexer.cpp -> glsl_lexer.cc. This fixes 'make tarballs' and building with dricore enabled. Signed-off-by: Jon TURNEY <[email protected]> Reviewed-by: Eric Anholt <[email protected]> Tested-by: Matt Turner <[email protected]>
*	automake: convert libOSmesa	Laurent Carlier	2012-07-13	3	-51/+51
\| \| \| \| \| \| \| \| \| \| \| \| \|	This also currently fix the installation of libOSmesa. v2: Remove old Makefile, libOSmesa is now versioned, fix typos v3: Keep config substitution alphabetized v4: Update .gitignore v5: Libraries will be in the builddir, not the srcdir. Reviewed-by: Jon TURNEY <[email protected]> Reviewed-by: Eric Anholt <[email protected]> Tested-by: Matt Turner <[email protected]>
*	mesa,st/mesa: implement GL_RGB565 from ARB_ES2_compatibility	Marek Olšák	2012-07-13	6	-0/+29
\| \| \| \| \| \| \| \| \| \|	This was not implemented, because the spec was changed just recently. Everything has been in place already. Gallium has PIPE_FORMAT_B5G6R5_UNORM, while Mesa has MESA_FORMAT_RGB565. Reviewed-by: Kenneth Graunke <[email protected]>
*	i965: Move loop over texture units into brw_populate_sampler_prog_key.	Kenneth Graunke	2012-07-12	3	-82/+81
\| \| \| \| \| \| \| \| \| \| \|	The whole reason I avoided this was because it might operate on a brw_vertex_program or a brw_fragment_program. However, that isn't a problem: all we need is the gl_program base type. This avoids awkwardly passing the loop counter 'i' as a parameter, simplifies both callers, and also plumbs prog in place for future use. Signed-off-by: Kenneth Graunke <[email protected]>
*	i965: Always emit alpha when nr_color_buffers == 0.	Kenneth Graunke	2012-07-12	3	-11/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If alpha-testing is enabled, we need to send alpha down the pipeline even if nr_color_buffers == 0. However, tracking whether alpha-testing is enabled in the WM program key is expensive: it causes us to compile multiple specializations of the same shader, using program cache space. This patch removes the check for alpha-testing, and simply emits alpha whenever nr_color_buffers == 0. We believe this will also be necessary for alpha-to-coverage, and it should add minimal overhead to an uncommon case. Saving the recompiles should more than make up the difference. Reviewed-by: Eric Anholt <[email protected]> Signed-off-by: Kenneth Graunke <[email protected]>
*	i965: Use the blitter in intel_bufferobj_subdata for busy BOs on Gen6+.	Kenneth Graunke	2012-07-12	1	-16/+10
\| \| \| \| \| \| \| \| \| \|	Previously we only did this pre-Gen6, and used pwrite on Gen6+. In one workload, this cuts significant amount of overhead. v2: Simplify the function based on Eric's suggestions. Reviewed-by: Eric Anholt <[email protected]> Signed-off-by: Kenneth Graunke <[email protected]>
*	i965: Delete previous workaround for textureGrad with shadow samplers.	Kenneth Graunke	2012-07-12	4	-75/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	It had many problems: - The shadow comparison was done post-filtering. - It required state-dependent recompiles whenever the comparison function changed. - It didn't even work: many cases hit assertion failures. - I never implemented it for the VS. The new lowering pass which converts textureGrad to textureLod by computing the LOD value works much better. Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Eric Anholt <[email protected]>
*	i965: Add a lowering pass to convert TXD to TXL by computing the LOD.	Kenneth Graunke	2012-07-12	4	-0/+157
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Intel hardware doesn't natively support textureGrad with shadow comparisons. So we need to generate code to handle it somehow. Based on the equations of page 205 of the OpenGL 3.0 specification, it's possible to compute the LOD value that would be selected given the gradient values. Then, we can simply convert the TXD to a TXL. Currently, this passes 34/46 of oglconform's shadow-grad subtests; four cubemap tests are regressed. We should investigate this in the future. v2: Apply abs() to the scalar case (thanks to Eric). Signed-off-by: Kenneth Graunke <[email protected]> Reviewed-by: Eric Anholt <[email protected]>
*	st/mesa: expose new transform feedback extensions	Marek Olšák	2012-07-12	1	-0/+6
\|
*	mesa: add ARB_transform_feedback_instanced extension enable flag	Marek Olšák	2012-07-12	2	-0/+2
\| \| \| \|	Acked-by: Ian Romanick <[email protected]>
*	mesa: implement new DrawTransformFeedback functions	Marek Olšák	2012-07-12	5	-12/+130
\| \| \| \|	Acked-by: Ian Romanick <[email protected]>
*	mesa: implement display list support for new DrawTransformFeedback functions	Marek Olšák	2012-07-12	2	-0/+85
\| \| \| \|	Acked-by: Ian Romanick <[email protected]>