Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | nv50: handle VP without inputs | Marcin Slusarz | 2012-05-07 | 1 | -0/+11 |
| | |||||
* | nv50/ir/opt: don't lose saturation in tryCollapseChainedMULs | Christoph Bumiller | 2012-05-06 | 1 | -2/+3 |
| | |||||
* | nvc0/ir: fix lowering of textureGrad | Christoph Bumiller | 2012-05-06 | 1 | -4/+4 |
| | |||||
* | nv50: fix typo causing NULL-deref in nv50_resource_resolve | Christoph Bumiller | 2012-05-04 | 1 | -1/+1 |
| | | | | Introduced in b328949a37fee7b0f68ed3e068ffc4426c083042. | ||||
* | nv50/ir: move expansion of IMUL to later stage and handle memory operands | Christoph Bumiller | 2012-05-04 | 4 | -17/+51 |
| | |||||
* | nv50: implement stream output | Christoph Bumiller | 2012-05-04 | 12 | -33/+468 |
| | |||||
* | nv50: enable array textures | Christoph Bumiller | 2012-05-04 | 2 | -3/+4 |
| | |||||
* | nv50,nvc0: fix depth/stencil resolve | Christoph Bumiller | 2012-04-29 | 4 | -26/+65 |
| | | | | | Cannot sample depth/stencil with a single view, and needed to use different shader code for nve4. | ||||
* | nvc0/ir/opt: INTERP does not support JOIN | Christoph Bumiller | 2012-04-29 | 1 | -0/+2 |
| | |||||
* | nv50/ir/opt: try to convert ABS(SUB) to SAD | Christoph Bumiller | 2012-04-29 | 5 | -15/+162 |
| | |||||
* | nvc0/ir: initial implementation of nve4 scheduling hints | Christoph Bumiller | 2012-04-29 | 5 | -4/+141 |
| | |||||
* | nvc0/ir: implement better placement of texture barriers | Christoph Bumiller | 2012-04-29 | 7 | -6/+58 |
| | | | | | Put them before first uses instead of right after the texturing instruction and cull unnecessary barriers. | ||||
* | nv50/ir/tgsi: fix handling of early RET | Christoph Bumiller | 2012-04-29 | 1 | -4/+5 |
| | | | | We have to actually emit RET, too, of course, not just the PRERET. | ||||
* | nv50: fix typo in nv50_fragprog_assign_slots | Christoph Bumiller | 2012-04-24 | 1 | -1/+1 |
| | |||||
* | nv50,nvc0: handle new vertex fetch caps | Christoph Bumiller | 2012-04-24 | 2 | -0/+9 |
| | |||||
* | nv50,nvc0: don't initialize the draw module, we don't use it | Christoph Bumiller | 2012-04-24 | 3 | -2/+13 |
| | | | | But some day we might (e.g. for blending 16 bpp formats on nv50). | ||||
* | nv50: enable shader watchdog timer | Christoph Bumiller | 2012-04-24 | 1 | -0/+5 |
| | |||||
* | gallium drivers: report that user vertex buffers are supported | Marek Olšák | 2012-04-24 | 1 | -0/+2 |
| | |||||
* | nv50,nvc0: prevent multiple flushes when user spins on get_query_result | Christoph Bumiller | 2012-04-19 | 1 | -1/+6 |
| | |||||
* | nv50/ir/opt: swap VP inputs to first source where possible | Christoph Bumiller | 2012-04-19 | 1 | -0/+17 |
| | |||||
* | nv50: specify location of UCPs to code generator | Christoph Bumiller | 2012-04-17 | 1 | -0/+2 |
| | | | | | Was made configurable in e44089b2f79aa2dcaacf348911433d1e21235c0c for Kepler but forgot to update nv50. | ||||
* | nv50: use correct semantic map value for undefined PointSize output | Christoph Bumiller | 2012-04-15 | 1 | -7/+8 |
| | |||||
* | nvc0: add initial support for nve4+ (Kepler) chipsets | Christoph Bumiller | 2012-04-15 | 9 | -9/+26 |
| | | | | | | | | | Most things that work on Fermi should work on Kepler too. There are a few performance optimizations left to do, like better placement of texture barriers and adding scheduling data to the shader instructions (without them, a thread group will be masked for 32 cycles after each single instruction issue). | ||||
* | nv50/ir/opt: extend handleCVT for nv50's SET u32 to f32 chain | Christoph Bumiller | 2012-04-14 | 1 | -1/+17 |
| | |||||
* | nv50/ir: print interpolation mode | Christoph Bumiller | 2012-04-14 | 1 | -0/+22 |
| | |||||
* | nv50: update FP linkage state on one/two-sided lighting switch | Christoph Bumiller | 2012-04-14 | 2 | -1/+12 |
| | |||||
* | nv50: delete old shader compiler files | Christoph Bumiller | 2012-04-14 | 7 | -7252/+0 |
| | |||||
* | nv50: hook up to new shader code generator | Christoph Bumiller | 2012-04-14 | 10 | -664/+262 |
| | |||||
* | nv50/ir: import nv50 target | Christoph Bumiller | 2012-04-14 | 12 | -219/+2475 |
| | |||||
* | nv50/ir: fix off-by-ones in CSE and nvc0 insnCanLoad | Christoph Bumiller | 2012-04-14 | 1 | -1/+1 |
| | |||||
* | nv50/ir/tgsi: generate UCPs with actual outputs instead of SVs | Christoph Bumiller | 2012-04-14 | 1 | -4/+20 |
| | | | | | gl_ClipDistance is treated the same way, this is just nicer and easier assign slots for them on nv50. | ||||
* | nv50/ir: Fix type of the instruction created by mkCmp() for dst in FILE_FLAGS. | Francisco Jerez | 2012-04-14 | 1 | -1/+2 |
| | |||||
* | nv50/ir: fix Instruction::isCommutationLegal for WAW | Christoph Bumiller | 2012-04-14 | 1 | -4/+14 |
| | |||||
* | nv50/ir/opt: Add isOptSupported() check in logical arith optimization. | Francisco Jerez | 2012-04-14 | 1 | -8/+5 |
| | |||||
* | nv50/ir/ra: Fix live set propagation in the secondary passes of buildLiveSets(). | Francisco Jerez | 2012-04-14 | 1 | -3/+3 |
| | |||||
* | nv50/ir/opt: don't regard OP_WRSV as dead code | Christoph Bumiller | 2012-04-14 | 1 | -1/+2 |
| | |||||
* | nv50/ir: add isUniform query to Values | Christoph Bumiller | 2012-04-14 | 2 | -0/+24 |
| | |||||
* | nv50/ir: rewrite the register allocator as GCRA, with spilling | Christoph Bumiller | 2012-04-14 | 10 | -414/+1473 |
| | | | | | This is more flexible than the linear scan, and we don't need the separate allocation pass for constrained values anymore. | ||||
* | nv50/ir/tgsi: only export x-component of PSIZE | Christoph Bumiller | 2012-04-14 | 1 | -1/+5 |
| | |||||
* | nv50/ir: Fix BuildUtil::mkSelect and mkClobber | Francisco Jerez | 2012-04-14 | 1 | -6/+2 |
| | |||||
* | nv50/ir: fix reg file conflicts with undefined-value placeholders | Christoph Bumiller | 2012-04-14 | 1 | -10/+19 |
| | |||||
* | nv50/ir/opt: silence warning (int < Elements() signedness) | Christoph Bumiller | 2012-04-14 | 1 | -1/+1 |
| | |||||
* | nv50/ir/opt: fix combineSt access to wrong instruction | Christoph Bumiller | 2012-04-14 | 1 | -1/+1 |
| | |||||
* | nv50/ir/opt: another insn NULL check in phi elimination | Christoph Bumiller | 2012-04-14 | 1 | -0/+2 |
| | |||||
* | nv50/ir/ssa: Take into account function inputs and outputs. | Francisco Jerez | 2012-04-14 | 1 | -2/+30 |
| | |||||
* | nv50/ir: Clean up before calculating instruction ordering for a new function. | Francisco Jerez | 2012-04-14 | 2 | -0/+16 |
| | |||||
* | nv50/ir/ra: Allocate registers for function arguments. | Francisco Jerez | 2012-04-14 | 1 | -0/+6 |
| | |||||
* | nv50/ir: Take into account function args in the live range calculation code. | Francisco Jerez | 2012-04-14 | 2 | -3/+28 |
| | |||||
* | nv50/ir/ra: Use matching physical regs for function args in caller and callee. | Francisco Jerez | 2012-04-14 | 1 | -6/+83 |
| | |||||
* | nv50/ir/tgsi: Infer function inputs/outputs. | Francisco Jerez | 2012-04-14 | 2 | -0/+87 |
| | | | | | | | Edit: Don't do it for the main function of (graphics) shaders, its inputs and outputs always go through TGSI_FILE_INPUT/OUTPUT. This prevents all TEMPs from counting as live out and reduces register pressure. |