summaryrefslogtreecommitdiffstats
path: root/src
Commit message (Collapse)AuthorAgeFilesLines
* tgsi: remove set-but-unused variablesMarek Olšák2011-05-011-4/+0
|
* draw: remove set-but-unused variableMarek Olšák2011-05-011-2/+1
|
* rbug: remove set-but-unused variablesMarek Olšák2011-05-013-28/+0
|
* ir_to_mesa: remove set-but-unused variablesMarek Olšák2011-05-011-3/+2
|
* mesa: remove set-but-unused variables in texcompress_s3tcMarek Olšák2011-05-011-16/+0
|
* mesa: remove set-but-unused variable in bind_attrib_locationMarek Olšák2011-05-011-9/+1
|
* mesa: remove unused-but-set variable in extract_uint_rgbaMarek Olšák2011-05-011-3/+0
|
* r300/compiler: remove set-but-unused variablesMarek Olšák2011-05-012-5/+1
|
* r600g: remove set-but-unused variableMarek Olšák2011-05-011-4/+0
|
* r600g: remove FIXME commentMarek Olšák2011-05-011-3/+0
| | | | DONTBLOCK is correctly handled in r600_bo_map.
* mesa: GL_PROVOKING_VERTEX_EXT is a GLenum, not GLboolean.José Fonseca2011-04-301-1/+1
|
* r300/compiler: Cleanups from the regalloc mergeTom Stellard2011-04-303-8/+4
|
* r300/compiler: Add return statement to rc_list()Tom Stellard2011-04-301-0/+2
|
* r300g: Fix scons buildTom Stellard2011-04-301-0/+1
| | | | | Broken by the dependency on ralloc introduced by fe622bac0c1b5b9f2a9fcf9f35b51232a06bea42
* i915g: enable X-tiling for s3tc texturesDaniel Vetter2011-04-301-2/+1
| | | | | Tested-by: Christopher Egert <[email protected]> Signed-off-by: Daniel Vetter <[email protected]>
* r300/compiler: Enable swizzle packing in the allocator for r300 and r400Tom Stellard2011-04-306-41/+120
|
* r300/compiler: Handle loops in the register allocatorTom Stellard2011-04-304-19/+103
|
* r300/compiler: Handle loops in rc_get_readers()Tom Stellard2011-04-304-75/+150
|
* r300/compiler: Rewrite register allocatorTom Stellard2011-04-3019-236/+1548
| | | | | | | The new allocator uses ra and does swizzle packing. Also, a data structure (struct rc_variable) and associated functions have been added for generating UD and DU chains.
* ra: Add ra_set_node_reg()Tom Stellard2011-04-302-4/+25
| | | | | | | | This function can be used to avoid creating single register classes for input/payload registers. This makes optimistic coloring less likely to fail. Reviewed-by: Eric Anholt <[email protected]>
* r300/compiler: Use RC_MASK_W when converting RGB to Alpha instructionsTom Stellard2011-04-301-1/+1
|
* r300/compiler: rc_reader_data: Add ExitOnAbort flagTom Stellard2011-04-305-3/+10
|
* r300/compiler: Add more info to struct rc_readerTom Stellard2011-04-305-16/+41
| | | | | For pair instructions we need a reference to both the arg and source.
* r300/compiler: Add remove dead sources passTom Stellard2011-04-305-0/+66
| | | | | | | | The instruction scheduler will sometimes leave orphaned sources when converting instructions from RGB to Alpha. If one of these orphaned sources has an index greater than the maximum temporary register index, then the compiler will incorrectly report "Too many hardware temporaries used". The dead sources pass cleans up these orphaned sources.
* st/mesa: expose ARB_ES2_compatibility if GL_FIXED vertex format is supportedMarek Olšák2011-04-301-0/+6
| | | | | | Tested with softpipe and llvmpipe. Reviewed-by: Brian Paul <[email protected]>
* mesa: accept GL_FIXED in glVertexAttribPointer (ARB_ES2_compatibility)Marek Olšák2011-04-301-10/+13
| | | | | | | | GL_FIXED should not be accepted in the other gl*Pointer calls in OpenGL. There is a new piglit for this: arb_es2_compatibility-fixed-type. Reviewed-by: Brian Paul <[email protected]>
* mesa: fix error string, remove out of date commentBrian Paul2011-04-291-12/+1
|
* mesa: remove commented-out #includeBrian Paul2011-04-291-1/+0
|
* i965/gen6: Fix LogicOp handling for GL_COPY and/or floating-point RTs.Eric Anholt2011-04-291-5/+13
| | | | | | | | We were accidentally leaving blending enabled for LogicOp GL_COPY, which ARB_color_buffer_float/GL_RGBA32F-render (and friends) caught. Additionally, the GL spec says that no LogicOp should be done to floating-point targets, and the GPU gets really angry even if you say to LogicOp GL_COPY to float.
* i965: Remove dead entrypoints to state cache, rename the one that's left.Eric Anholt2011-04-298-157/+54
| | | | | | | | | | As we expanded the usage of the state cache, it grew extra functionality. However, with the recent state streaming rework, we're back to the state cache being used only for shader kernels, which is the piece of GPU state that's actually expensive to compute again from scratch, since it involves compiling. Reviewed-by: Kenneth Graunke <[email protected]>
* i965/gen4: Move the GS state to state streaming.Eric Anholt2011-04-295-98/+45
| | | | Reviewed-by: Kenneth Graunke <[email protected]>
* i965/gen4: Move clip state to state streamingEric Anholt2011-04-294-106/+60
| | | | Reviewed-by: Kenneth Graunke <[email protected]>
* i965: Drop the now unused brw_cache_data() function.Eric Anholt2011-04-292-47/+0
| | | | Reviewed-by: Kenneth Graunke <[email protected]>
* i965: Fix up state dumping for SF state.Eric Anholt2011-04-291-2/+3
| | | | | | It was moved to state streaming a while back and this was left over. Reviewed-by: Kenneth Graunke <[email protected]>
* i965/gen4: Move VS state to state streaming.Eric Anholt2011-04-295-88/+53
| | | | Reviewed-by: Kenneth Graunke <[email protected]>
* mesa: Add a bunch of documentation to the register allocator.Eric Anholt2011-04-291-3/+65
| | | | Reviewed-by: Kenneth Graunke <[email protected]>
* i965/gen6: Use the dynamic state base address to reduce relocations.Eric Anholt2011-04-299-46/+55
| | | | | | | | | | Now that all the dynamic state is streamed through the top of the batchbuffer, we can cut out many of our relocations to that state by using the base address. Improves 3DMMES taiji performance 3.3% +/- 0.4% (n=15). Reviewed-by: Kenneth Graunke <[email protected]>
* i965/gen6: Move scissor state to state streaming.Eric Anholt2011-04-297-52/+26
| | | | Reviewed-by: Kenneth Graunke <[email protected]>
* i965: Move sampler state to state streaming.Eric Anholt2011-04-299-238/+143
| | | | | | | | | | | Overall, across this series since the last set of numbers, gen6 3DMMES taiji performance has dropped 0.8% +/- 0.3% (n=15), probably due to the increased reissuing of state from some of the state objects that otherwise never changed, and increased occurrence of the per-batch overhead as we've increased how much we put in the batch BO without increasing the batch BO's size. Reviewed-by: Kenneth Graunke <[email protected]>
* i965/gen4: Move WM state to state streaming.Eric Anholt2011-04-295-192/+125
| | | | | | | The samplers are about to become streamed for gen6 performance, which would cause this unit to blow out the state cache. Reviewed-by: Kenneth Graunke <[email protected]>
* i965/gen6: Move the depth/stencil state to state streaming.Eric Anholt2011-04-295-131/+50
| | | | Reviewed-by: Kenneth Graunke <[email protected]>
* i965/gen6: Move the blend state to state streaming.Eric Anholt2011-04-294-111/+40
| | | | Reviewed-by: Kenneth Graunke <[email protected]>
* i965/gen6: Move the color calc state to state streaming.Eric Anholt2011-04-294-78/+21
| | | | Reviewed-by: Kenneth Graunke <[email protected]>
* i965: Move the CC VP to state streaming.Eric Anholt2011-04-2910-99/+32
| | | | | | | | This is in a way a revert of f5bb775fd1f333d8e579d07a5cac1ded2bd54a2f. The tiny win that had will be overwhelmed by the win of using the gen6 dynamic state base address. Reviewed-by: Kenneth Graunke <[email protected]>
* i965/gen6: Move the SF VP to state streaming.Eric Anholt2011-04-295-30/+20
| | | | Reviewed-by: Kenneth Graunke <[email protected]>
* i965/gen6: Move the clip VP to state streaming.Eric Anholt2011-04-294-21/+22
| | | | Reviewed-by: Kenneth Graunke <[email protected]>
* i965/gen6: Stream the VS push constants.Eric Anholt2011-04-295-35/+62
| | | | | | Improves 3DMMES taiji demo performance by 10.1% +/- 0.9% (n=15). Reviewed-by: Kenneth Graunke <[email protected]>
* i965/gen6: Stream the WM push constants.Eric Anholt2011-04-294-22/+19
| | | | | | | Improves 3DMMES taiji demo performance by 5.1% +/- 1.9% (n=15), by reducing CPU time spent thrashing around those tiny little constant BOs. Reviewed-by: Kenneth Graunke <[email protected]>
* i965: Give the push constants and pull constants unique function names.Eric Anholt2011-04-292-4/+4
| | | | | | This helps clarify profiling results. Reviewed-by: Kenneth Graunke <[email protected]>
* i965: Fix fragcoord_w on gen6 with 16-wide.Eric Anholt2011-04-291-5/+5
| | | | | | | | | | | | The payload regs can go all the way up to register 60+, so just give them 8 bits to be addressed by instead of 3-4 (which made source_w_reg of 8 end up 0). There's no reason to aggressively pack these fields, as they are just used as compiler information, where being easier to access is probably more important than shaving a byte or two off of the structure. Fixes piglit fragcoord_w. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=36649