summaryrefslogtreecommitdiffstats
path: root/src
Commit message (Collapse)AuthorAgeFilesLines
* nv50/ir: temporarily exclude nv50 code emitter from buildChristoph Bumiller2012-04-141-1/+0
| | | | | It's not used yet and shouldn't have been included in the first place.
* nv50/ir: copy value size in SSA-rename passChristoph Bumiller2012-04-141-0/+1
|
* nv50/ir/opt: improve post-multiply and check target for supportChristoph Bumiller2012-04-144-35/+90
|
* nv50/ir: add setFlagsDef/Src helperChristoph Bumiller2012-04-144-3/+29
| | | | Will be used by nv50 target.
* nv50/ir: add isAccessSupported check for memory access coalescingChristoph Bumiller2012-04-146-5/+19
|
* nv50/ir: add function for splitting a BasicBlockChristoph Bumiller2012-04-143-9/+78
| | | | Fixes to initial implementation by Francisco Jerez.
* nv50/ir: Allow attaching two nodes when either one is already inside the graph.Francisco Jerez2012-04-141-5/+5
|
* nv50/ir: Allow inserting isolated nodes to a graph.Francisco Jerez2012-04-141-6/+4
|
* nv50/ir: Fix memory corruption in Function::orderInstructions().Francisco Jerez2012-04-141-3/+7
| | | | | "iter" doesn't reference a BasicBlock directly, but a Node::Graph, i.e. BasicBlock::get() is casting to the wrong pointer type.
* nv50/ir: Fix up insertion of PHI instructions using bb->insertHead().Francisco Jerez2012-04-141-1/+1
|
* nv50/ir: fix insertHead and remove for BBs with PHI ops onlyChristoph Bumiller2012-04-141-3/+10
|
* nv50/ir: Don't crash on zero sized BitSets.Francisco Jerez2012-04-141-0/+1
|
* nv50/ir: Fix Interval::clear().Francisco Jerez2012-04-141-0/+1
|
* nv50/ir/tgsi: handle inferSrcType(NOT) to be u32Christoph Bumiller2012-04-141-0/+1
|
* nv50/ir/opt: Fix OP_NOT to modifier conversion.Francisco Jerez2012-04-141-1/+1
|
* r600g: disable dual-src hangs evergreen for some reason.Dave Airlie2012-04-141-1/+1
| | | | | | This did work previously, so I've broken something. Signed-off-by: Dave Airlie <[email protected]>
* r300/compiler: Exit immediately from rc_vert_fc() if there is an errorTom Stellard2012-04-141-0/+4
| | | | | | | | This way we correctly report "Too many temporaries" errors. https://bugs.freedesktop.org/show_bug.cgi?id=48680 Note: This is a candidate for the stable branches.
* r300/compiler: Copy all instruction attributes during local transfomsTom Stellard2012-04-141-31/+42
| | | | | | | | | | | | | | | | | | | | | Instruction attributes like WriteALUResult and ALUResultCompare were being discarded during the some of the local transformations. This fixes the following piglit tests: glsl1-inequality (vec2, pass) loopfunc fs-any-bvec2-using-if fs-op-ne-bvec2-bvec2-using-if fs-op-ne-ivec2-ivec2-using-if fs-op-ne-mat2-mat2-using-if fs-op-ne-vec2-vec2-using-if fs-op-ne-mat2x3-mat2x3-using-if fs-op-ne-mat2x4-mat2x4-using-if https://bugs.freedesktop.org/show_bug.cgi?id=45921 NOTE: This is a candidate for the stable branches.
* r300/compiler: Fix nested flow control in r500 vertex shadersTom Stellard2012-04-1311-178/+438
|
* r300/compiler: Clear loop registers in vertex shaders w/o loopsTom Stellard2012-04-132-16/+13
| | | | | | | | | | | | | | The loop registers weren't being cleared, so any shader that was executed after a shader containing loops was at risk of having a loop randomly inserted into it. This fixes over one hundred piglit tests, although these test only failed during full piglit runs and would pass if run individually. The exact number of piglit tests that this patch fixes will vary depending on the version of piglit and the order the tests are run. NOTE: This is a candidate for the stable branches.
* glsl: If an "if" has no "then" or "else" code left, remove it.Eric Anholt2012-04-131-0/+8
| | | | | | | Cuts 8/1068 instructions from glyphy's fragment shaders on i965. Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Ian Romanick <[email protected]>
* glsl: Add a helper for generating temporary variables in ir_builder.Eric Anholt2012-04-133-31/+23
| | | | Reviewed-by: Kenneth Graunke <[email protected]>
* glsl: Add a helper for ir_builder to make dereferences for assignments.Eric Anholt2012-04-133-49/+59
| | | | | | v2: Fix writemask setup for non-vec4 assignments. Reviewed-by: Kenneth Graunke <[email protected]>
* glsl: Make a little tracking class for emitting IR lists.Eric Anholt2012-04-133-50/+42
| | | | | | | This lets us significantly shorten p->instructions->push_tail(ir), and will be used in a few more places. Reviewed-by: Kenneth Graunke <[email protected]>
* glsl: Add common swizzles to ir_builder.Eric Anholt2012-04-133-48/+110
| | | | | | | | | | Now we can fold a bunch of our expression setup in ff_fragment_shader into single-line, parseable commits. v2: Make it actually work. I wasn't setting num_components in the mask structure, and not setting up a mask structure is way easier. Reviewed-by: Kenneth Graunke <[email protected]>
* glsl: Let ir_builder expressions take un-dereferenced variables.Eric Anholt2012-04-133-29/+46
| | | | | | | | | | | Having to explicitly dereference is irritating and bloats the code, when the compiler can detect and do the right thing. v2: Use a little shim class to produce the automatic dereference generation at compile time as opposed to runtime, while also allowing compile-time type checking. Reviewed-by: Kenneth Graunke <[email protected]>
* glsl: Create an ir_builder helper for hand-generating IR.Eric Anholt2012-04-134-82/+151
| | | | | | | | | | | | | | The C++ constructors with placement new, while functional, are extremely verbose, leading to generation of simple GLSL IR expressions like (a * b + c * d) expanding to many lines of code and using lots of temporary variables. By creating a new ir_builder.h that puts simple generators in our namespace and taking advantage of ralloc_parent(), we can generate much more compact code, at a minor runtime cost. v2: Replace ir_instruction usage with just ir_rvalue. v3: Drop remaining missed as_rvalue() in v2. Reviewed-by: Kenneth Graunke <[email protected]>
* nv50,nvc0: fix handling of user vbufs with stride < access sizeChristoph Bumiller2012-04-146-35/+63
|
* nvc0: prefix all macro methods with MACROChristoph Bumiller2012-04-146-35/+36
| | | | Some of them have non-macro counterparts.
* nvc0: replace VERTEX_DATA push mode with translate to bufferChristoph Bumiller2012-04-145-57/+748
| | | | | While pushing vertices through the FIFO is relatively fast on nv50, it's horribly slow on nvc0.
* nvc0: improve vertex state validationChristoph Bumiller2012-04-146-97/+159
| | | | Now updating vertex attribute format only when necessary.
* nvc0: track texture dirty state individuallyChristoph Bumiller2012-04-145-40/+81
|
* nv50,nvc0: use new scratch buffers codeChristoph Bumiller2012-04-148-84/+62
|
* nouveau: add new shared scratch buffersChristoph Bumiller2012-04-143-0/+185
|
* nvc0: only force early fragment tests if requested by shaderChristoph Bumiller2012-04-145-14/+7
|
* nv50,nvc0: hold references to the framebuffer surfacesChristoph Bumiller2012-04-144-2/+32
|
* r300g: align vertex buffer suballocations to 4Marek Olšák2012-04-131-1/+1
|
* u_blitter: align vertex buffer suballocations to 4Marek Olšák2012-04-131-1/+1
|
* xorg/nouveau: switch to libdrm_nouveau-2.0Marcin Slusarz2012-04-131-3/+3
|
* targets/{egl-static,gbm}: further clean-up the nvfx remainsMartin Peres2012-04-142-2/+2
|
* nvc0: remove include of old libdrm_nouveau's nouveau_reloc.hChristoph Bumiller2012-04-141-2/+0
|
* nv50,nvc0: handle PIPE_CAP_MAX_DUAL_SOURCE_RENDER_TARGETSChristoph Bumiller2012-04-142-0/+4
|
* nv30: s/DUAL_SOURCE_BLEND/MAX_DUAL_SOURCE_RENDER_TARGETSChristoph Bumiller2012-04-141-1/+1
| | | | Merge accident.
* nv30: import new driver for GeForce FX/6/7 chipsets, and Quadro variantsBen Skeggs2012-04-1444-7/+13724
| | | | | | | | | | | | | | | | | | The primary motivation for this rewrite was to have a maintainable driver going forward, as nvfx was quite horrible in a lot of ways. The driver is heavily based on the design of the nv50/nvc0 3d drivers we already have, and uses the same common buffer/fence code. It also passes a HEAP more piglit tests than nvfx did, supports a couple more features, and a few more to come still probably. The CPU footprint of this driver is far far less than nvfx, and translates into far greater framerates in a lot of applications (unless you're using a CPU that's way way newer than the GPUs of these generations....) Basically, we once again have a maintained driver for these chipsets \o/ Feel free to report bugs now!
* nouveau: switch to libdrm_nouveau-2.0Christoph Bumiller2012-04-1457-3433/+3714
|
* nvc0: remove obsolete nvc0_push2.cChristoph Bumiller2012-04-142-334/+0
| | | | | Slower version of nvc0_push.c, was only used to ascertain that bugs were not the new version's fault.
* nouveau: remove automatic buffer migration heuristicsChristoph Bumiller2012-04-144-36/+0
|
* nvfx: completely remove this driver (GeForce FX/6/7)Ben Skeggs2012-04-1446-14900/+5
| | | | | | | | | | This driver hasn't been maintained properly for a very long time, and for many very good reasons. It's horrible. A new driver supporting these chipsets will appear with the commits that port vieux/nv50/nvc0 to libdrm_nouveau-2.0. Signed-off-by: Ben Skeggs <[email protected]>
* nouveau: rework and simplify nv04/nv05 driver a bitBen Skeggs2012-04-148-300/+215
| | | | | | | | | | | | | | | TEXTURED_TRIANGLE and MULTITEX_TRIANGLE are both a bit special in that if you use any other graph object in the meantime they'll forget their state and spew a lovely METHOD_CNT error at you when you try to draw. The pre-newlib driver has a flush_notify() hook which does this state re-emit, and a number of random workarounds like extra flushes and state dirtying after various operations to solve this issue. I'm taking a slightly different approach to things instead, which has the nice side-effect of removing the divergent code-paths for ttri/mtri, the flush/dirty workarounds and the need for flush_notify. Also gives a few FPS boost in OA, yay.
* nouveau/vieux: switch to libdrm_nouveau-2.0Ben Skeggs2012-04-1446-1890/+1844
|