summaryrefslogtreecommitdiffstats
path: root/src/mesa
Commit message (Collapse)AuthorAgeFilesLines
* glsl: Add a new ir_unop_round_even opcode for GLSL 1.30's roundEven.Kenneth Graunke2010-10-141-0/+1
| | | | Also, update ir_to_mesa's "1.30 is unsupported" case to "handle" it.
* i965: Clean up a warning in the old fragment backend.Kenneth Graunke2010-10-141-1/+5
| | | | Hopefully this code can just go away soon.
* i965: Enable the new FS backend on pre-gen6 as well.Eric Anholt2010-10-141-12/+1
| | | | | | | | | | | It is now to the point where we have no regressing piglit tests. It also fixes Yo Frankie! and Humus DynamicBranching, probably due to the piglit bias tests that work that didn't on the Mesa IR backend. As a downside, performance takes about a 5-10% performance hit at the moment (e.g. nexuiz 19.8fps -> 18.8fps), which I plan to resolve by reintroducing 16-wide fragment shaders where possible. It is a win, though, for fragment shaders using flow control.
* i965: Correctly emit the RNDZ instruction.Kenneth Graunke2010-10-142-3/+31
| | | | | | | | | | | | | | Simply using RNDU, RNDZ, or RNDE does not produce the desired result. Rather, the RND* instructions place a value in the destination register that may be 1 less than the correct answer. They can also set per-channel "increment bits" in a flag register, which, if set, mean dest needs to be incremented by 1. A second instruction - a predicated add - completes the job. Notably, RNDD always produces the correct answer in a single instruction. Fixes piglit test glsl-fs-trunc.
* i965: Use RNDZ for ir_unop_trunc in the new FS.Kenneth Graunke2010-10-141-1/+1
| | | | The existing code used RNDD, which rounds down, rather than toward zero.
* i965: Use logical-not when emitting ir_unop_ceil.Kenneth Graunke2010-10-141-1/+1
| | | | Fixes piglit test glsl-fs-ceil.
* i965: Add peepholing of conditional mod generation from expressions.Eric Anholt2010-10-141-0/+88
| | | | | | | | | | This cuts usually 2 out of 3 instructions for flag reg generation (if statements, conditional assignment) by producing the conditional mod in the expression representing the boolean value. Fixes glsl-fs-vec4-indexing-temp-dst-in-nested-loop-combined (register allocation no longer fails for the conditional generation proliferation)
* i965: Add a function for handling the move of boolean values to flag regs.Eric Anholt2010-10-142-8/+19
| | | | | | This will be a place to peephole comparisions directly to the flag regs, and for now avoids using MOV with conditional mod on gen6, which is now illegal.
* Only install vtxfmt tables for OpenGLKristian Høgsberg2010-10-141-2/+4
| | | | | | GLES1 and GLES2 install their own exec pointers and don't need the Save table. Also, the SET_* macros use different indices for the different APIs so the offsets used in vtxfmt.c are actually wrong for the ES APIs.
* i965: Add a pass to the FS to split virtual GRFs to float channels.Eric Anholt2010-10-142-2/+116
| | | | Improves nexuiz performance 0.91% (+/- 0.54%, n=8)
* i965: Update the live interval when coalescing regs.Eric Anholt2010-10-141-0/+4
|
* i965: Set class_sizes[] for the aligned reg pair class.Eric Anholt2010-10-141-0/+1
| | | | So far, I've only seen this be a valgrind warning and not a real failure.
* Drop the "neutral" tnl moduleKristian Høgsberg2010-10-145-120/+9
| | | | | | | Just always check for FLUSH_UPDATE_CURRENT and call Driver.BeginVertices when necessary. By using the unlikely() macros, this ends up as a 10% performance improvement (for isosurf, anyway) over the old, complicated function pointer swapping.
* Revert "i965: fallback lineloop on sandybridge for now"Zhenyu Wang2010-10-141-7/+0
| | | | This reverts commit 73dab75b4165f7d2214a68d4ba8e3cb7aab9b4ac.
* i965: Fix GS hang on SandybridgeZhenyu Wang2010-10-144-14/+46
| | | | | | Don't use r0 for FF_SYNC dest reg on Sandybridge, which would smash FFID field in GS payload, that cause later URB write fail. Also not use r0 in any URB write requiring allocate.
* i965: Add support for rescaling GL_TEXTURE_RECTANGLE coords to new FS.Eric Anholt2010-10-131-5/+40
|
* mesa: Clean up various 'unused parameter' warnings in shaderapiIan Romanick2010-10-131-6/+14
|
* mesa: Clean up two 'comparison between signed and unsigned' warningsIan Romanick2010-10-131-2/+2
|
* mesa: Refactor validation of shader targetsIan Romanick2010-10-131-10/+33
| | | | | | | | | Actually validate that the implementation supports the particular shader target as well. Previously if a driver only supported vertex shaders, for example, glCreateShaderObjectARB would gladly create a fragment shader. NOTE: this is a candidate for the 7.9 branch.
* mesa: Silence unused variable warningIan Romanick2010-10-131-0/+1
|
* x11: fix breakage from gl_config::visualType removalBrian Paul2010-10-133-5/+6
|
* Drop GLcontext typedef and use struct gl_context insteadKristian Høgsberg2010-10-13780-5680/+5677
|
* Drop GLframebuffer typedef and just use struct gl_framebufferKristian Høgsberg2010-10-1353-127/+126
|
* Rename GLvisual and __GLcontextModes to struct gl_configKristian Høgsberg2010-10-1389-178/+165
|
* gl: Remove unused GLcontextModes fieldsKristian Høgsberg2010-10-138-48/+7
|
* Get rid of GL/internal/glcore.hKristian Høgsberg2010-10-137-10/+131
| | | | | | | __GLcontextModes is always only used as an implementation internal struct at this point and we shouldn't install glcore.h anymore. Anything that needs __GLcontextModes should just include the struct in its headers files directly.
* tdfx: Silence unused variable warning on non-debug builds.Vinson Lee2010-10-121-0/+1
| | | | | | Fixes this GCC warning. tdfx_texman.c: In function 'tdfxTMMoveOutTM_NoLock': tdfx_texman.c:897: warning: unused variable 'shared'
* r300: Silence uninitialized variable warning.Vinson Lee2010-10-121-0/+1
| | | | | | | Fixes this GCC warning. r300_state.c: In function 'r300InvalidateState': r300_state.c:2247: warning: 'hw_format' may be used uninitialized in this function r300_state.c:2247: note: 'hw_format' was declared here
* mesa: reformatting, comments, code movementBrian Paul2010-10-121-78/+99
|
* mesa: remove assertion w/ undeclared variable texelBytesBrian Paul2010-10-121-1/+0
|
* st/mesa: enable stencil shader export extension if supportedDave Airlie2010-10-131-0/+4
|
* glsl: add support for shader stencil exportDave Airlie2010-10-132-0/+2
| | | | | This adds proper support for the GL_ARB_shader_stencil_export extension to the GLSL compiler. Thanks to Ian for pointing out where I need to add things.
* st/mesa: use shader stencil export to accelerate shader drawpixels.Dave Airlie2010-10-134-57/+158
| | | | | | If the pipe driver has shader stencil export we can accelerate DrawPixels using it. It tries to pick an S8 texture and works its way to X24S8 and S8X24 if that isn't supported.
* st/mesa: add option to choose a texture format that we won't render to.Dave Airlie2010-10-133-8/+22
| | | | | | | We need a texture to put the drawpixels stuff into, an S8 texture is less memory/bandwidth than the 32-bit X24S8, but we might not be able to render directly to an S8, so this lets us specify we won't be rendering to this texture.
* mesa: improve texstore for 8/24 formats and add texstore for S8.Dave Airlie2010-10-131-119/+144
| | | | | | | | | | | | | this improves mesa texstore for 8/24 so it can create S24X8/X24S8 variants by keeping the depth bits static. it also adds a texstore for S8 so we can write out an S8 texture to use in the sampler for accel draw pixels to save memory bw. The logic seems sound here, I've worked it out a few times on paper, though it would be good to have some review. Signed-off-by: Dave Airlie <[email protected]>
* mesa: add support for FRAG_RESULT_STENCIL.Dave Airlie2010-10-131-2/+3
| | | | | | this is needed to add support for stencil shader export. Signed-off-by: Dave Airlie <[email protected]>
* i965: Don't rebase the index buffer to min 0 if any arrays are in VBOs.Eric Anholt2010-10-124-11/+15
| | | | | | | | | There was a check to only do the rebase if we didn't have everything in VBOs, but nexuiz apparently hands us a mix of VBOs and arrays, resulting in blocking on the GPU to do a rebase. Improves nexuiz 800x600, high-settings performance on my Ironlake 41% (+/- 1.3%), from 14.0fps to 19.7fps.
* intel: Allow CopyTexSubImage to InternalFormat 3/4 textures, like RGB/RGBA.Eric Anholt2010-10-121-0/+2
| | | | | | The format selection of the CopyTexSubImage is pretty bogus still, but this at least avoids software fallbacks in nexuiz, bringing performance from 7.5fps to 12.8fps on my machine.
* i965: Fix missing "break;" in i2b/f2b, and missing AND of CMP result.Eric Anholt2010-10-121-2/+3
| | | | Fixes glsl-fs-i2b.
* glsl: Fix incorrect assertionIan Romanick2010-10-121-1/+1
| | | | | | | This assertion was added in commit f1c1ee11, but it did not notice that the array is accessed with 'size-1' instead of 'size'. As a result, the assertion was off by one. This caused failures in at least glsl-orangebook-ch06-bump.
* mesa: Validate assembly shaders when GLSL shaders are usedIan Romanick2010-10-121-12/+40
| | | | | | | | | | If an GLSL shader is used that does not provide all stages and assembly shaders are provided for the missing stages, validate the assembly shaders. Fixes bugzilla #30787 and piglit tests glsl-invalid-asm0[12]. NOTE: this is a candidate for the 7.9 branch.
* ir_to_mesa: assorted clean-ups, const qualifiers, new commentsBrian Paul2010-10-121-14/+45
|
* nouveau: Get larger push buffers.Francisco Jerez2010-10-122-2/+2
| | | | | Useful to amortize the command submission/reloc overhead (e.g. etracer goes from 72 to 109 FPS on nv4b).
* dri/nouveau: Initialize tile_flags when allocating a render target.Francisco Jerez2010-10-122-6/+14
|
* i965: Always use the new FS backend on gen6.Eric Anholt2010-10-111-2/+7
| | | | | | | | | | It's now much more correct for gen6 than the old backend, with just 2 regressions I've found (one of which is common with pre-gen6 and will be fixed by an array splitting IR pass). This does leave the old Mesa IR backend getting used still when we don't have GLSL IR, but the plan is to get GLSL IR input to the driver for the ARB programs and fixed function by the next release.
* i965: Fix gen6 pixel_[xy] setup to avoid mixing int and float src operands.Eric Anholt2010-10-111-6/+15
| | | | | | | | | | | | | Pre-gen6, you could mix int and float just fine. Now, you get goofy results. Fixes: glsl-arb-fragment-coord-conventions glsl-fs-fragcoord glsl-fs-if-greater glsl-fs-if-greater-equal glsl-fs-if-less glsl-fs-if-less-equal
* i965: Don't compute-to-MRF in gen6 VS math.Eric Anholt2010-10-111-7/+15
| | | | | There was code to do this for pre-gen6 already, this just enables it for gen6 as well.
* i965: Expand uniform args to gen6 math to full registers to get hstride == 1.Eric Anholt2010-10-111-0/+25
| | | | | | | | | | This is a hw requirement in math args. This also is inefficient, as we're calculating the same result 8 times, but then we've been doing that on pre-gen6 as well. If we're doing math on uniforms, though, we'd probably be better served by having some sort of mechanism for precalculating those results into another uniform value to use. Fixes 7 piglit math tests.
* i965: Don't compute-to-MRF in gen6 math instructions.Eric Anholt2010-10-111-0/+16
|
* i965: Add a couple of checks for gen6 math instruction limits.Eric Anholt2010-10-111-0/+26
|