summaryrefslogtreecommitdiffstats
path: root/src/gallium/drivers
Commit message (Collapse)AuthorAgeFilesLines
* trace/rbug: Add new contexts functions to trace rbugJakob Bornecrantz2009-06-043-1/+236
|
* softpipe: separate case for PIPE_PRIM_POLYGON in sp_vbuf_draw()Brian Paul2009-06-031-2/+12
| | | | | Because of flat shading, we can't use same code as PIPE_PRIM_TRIANGLE_FAN. This is a follow-on to commit a59575d8fbe8b0ca053cc8366ce7a42bc660158a.
* softpipe: fix incorrect tri vertex order for PIPE_PRIM_POLYGON renderingBrian Paul2009-06-031-1/+1
| | | | This fixes incorrect front/back-face orientation.
* r300-gallium: strip swtcl to the bare minimumJoakim Sindholt2009-06-021-34/+22
| | | | | | This was originally taken from i915 and it shows. Basically most the stuff in r300_render.c was never needed and shouldn't have worked in the first place
* r300-gallium: Slightly hacky fix for glxgears-style TCL.Corbin Simpson2009-06-011-0/+7
|
* trace/rbug: Add rbug integration for remote debuggingJakob Bornecrantz2009-06-016-6/+685
|
* Merge branch 'mesa_7_5_branch'Brian Paul2009-05-301-1/+15
|\
| * softpipe: fix incorrect provoking vertex color for PIPE_PRIM_POLYGONBrian Paul2009-05-301-1/+15
| | | | | | | | | | | | | | This fixes the incorrect colors seen when rendering flat-shaded polygons. Note that clipped polygons were correct, but unclipped polygons were wrong. See the glean/clipFlat test for regression testing.
* | Revert "softpipe: fix flat shading provoking vertex for PIPE_PRIM_POLYGON"Brian Paul2009-05-303-6/+1
| | | | | | | | | | | | | | This reverts commit 5d75124db480b37977c353511b4e228905b7cc95. This fixed unclipped polygons, but broke clipped polygons. A better fix from the mesa 7.5 branch will be merged next...
* | softpipe: update comments for max texture sizeBrian Paul2009-05-291-2/+2
| |
* | softpipe: increase max 2D/cube texture size to 4K x 4KBrian Paul2009-05-291-2/+2
| |
* | r300-gallium, radeon-gallium: Make add_buffer indicate when a flush is needed.Corbin Simpson2009-05-283-18/+39
| | | | | | | | On a side note, why is RADEON_MAX_BOS 24? Should ask airlied about that.
* | nv50: some mipmapping fixesBen Skeggs2009-05-284-22/+31
| |
* | nv50: negate sources directly where supportedChristoph Bumiller2009-05-281-42/+68
| |
* | nv50: introduce emit_cvt and use itChristoph Bumiller2009-05-281-40/+48
| | | | | | | | | | This makes some code cleaner, and we can now easily do CEIL and TRUNC.
* | nv50: fix TXPChristoph Bumiller2009-05-281-23/+112
| | | | | | | | | | | | | | | | For TXP we need to divide texture coords by their w component, or use the coords' 1/w in the perspective interpolation instruction. This also tries to support 1D, 3D and CUBE textures, and lets the instruction only load the components that are used.
* | nv50: use multiple constant buffersChristoph Bumiller2009-05-284-48/+105
| | | | | | | | | | | | | | Use different buffers for immds, FP params, and VP params. One has to map constant buffer indices in shader code to buffers defined via CB_DEF. In principle, we could use more buffers so we'd have to change the shader code less frequently.
* | nv50: don't look for unfreed temps in free_nv50_pcChristoph Bumiller2009-05-281-8/+0
| | | | | | | | | | Since we stopped using alloc_temp to get hw indices for FP attrs there shouldn't be any non-deallocated temps left.
* | nv50: release hw TEMPs earlyChristoph Bumiller2009-05-281-0/+19
| | | | | | | | | | Since we know when we don't use a TEMP or FP ATTR register anymore, we can release their hw resources early.
* | nv50: allow immediates for MOV, ADD and MULChristoph Bumiller2009-05-281-5/+22
| | | | | | | | | | | | Immediates are inlined now where possible, so we need to set pc->allow32 to FALSE in LIT where we have the conditional MOV, since immediates swallow the predicate bits.
* | nv50: enable half insns for MOV and MULChristoph Bumiller2009-05-281-7/+12
| |
* | nv50: make sure half-long insns are pairedChristoph Bumiller2009-05-281-0/+72
| | | | | | | | | | | | | | I chose to just convert unpaired 32 bit length instructions after parsing all instructions, although it might be possible to determine beforehand whether there would be any lone ones, and then even do some swapping to bring them together ...
* | nv50: enable KIL in register 19a8Christoph Bumiller2009-05-281-0/+1
| |
* | nv50: don't overwrite sources before they're usedChristoph Bumiller2009-05-281-12/+83
| | | | | | | | | | This would have happened in p.e. ADD TEMP[0], TEMP[0].xyxy, TEMP[1] or RCP/RSQ TEMP[i], TEMP[i].
* | nv50: put FP outputs where they belongChristoph Bumiller2009-05-281-4/+37
| | | | | | | | | | Depth output in fragment programs should end up in the first register after the color outputs.
* | nv50: modified FP attribute loadingChristoph Bumiller2009-05-282-45/+147
| | | | | | | | | | | | | | | | VP outputs that should be loadable in the FP are mapped to interpolant indices by HPOS, COL0 etc.; of course HPOS is always written, so the highest byte of 1988 is a bitmask that selects which components of HPOS are used for interpolants, i.e. the FP inputs in COL0 start at index POPCNT(1988[24:28]).
* | nv50: inspect decl semantic and interpolation modeChristoph Bumiller2009-05-281-1/+74
| | | | | | | | | | | | Record interpolation mode for attributes while parsing declarations, and also remember the indices of FP color inputs and FP depth output, which has to end up in the highest output register.
* | nv50: record last access to temp and attr regsChristoph Bumiller2009-05-281-0/+127
| | | | | | | | | | | | | | We now inspect the TGSI instructions in tx_prep to determine where temps and FP attrs are last accessed. This will enable us to reclaim some temporaries early and we also use it to omit pre-loading FP attributes that aren't used.
* | nv50: save some space in immediate bufferChristoph Bumiller2009-05-281-1/+7
| | | | | | | | | | We could do even better (like just allocating 1 value in alloc_immd), but that's fine for now I guess.
* | nv50: fix SIGN_SET case in tgsi_srcChristoph Bumiller2009-05-281-1/+1
| |
* | nv50: set dst.z,w to 0,1 in SCS and XPDChristoph Bumiller2009-05-281-0/+14
| | | | | | | | | | According to tgsi-instruction-set.txt, if they are written, z and w should be set to 0 and 1 respectively in SCS, and w to 1.0 in XPD.
* | nv50: make LRP instruction nicerChristoph Bumiller2009-05-281-6/+3
| |
* | nv50: fix some memory leaks in shader assemblerChristoph Bumiller2009-05-281-25/+63
| |
* | softpipe: commentsBrian Paul2009-05-271-0/+2
| |
* | softpipe: include sp_winsys.h to silence warning (unprototyped function)Brian Paul2009-05-271-0/+1
| |
* | softpipe: fix flat shading provoking vertex for PIPE_PRIM_POLYGONBrian Paul2009-05-273-1/+6
| | | | | | | | Use the first vertex, not the last.
* | cell: perform triangle cull a little earlierJonathan Adamczewski2009-05-211-31/+74
| | | | | | | | | | | | | | | | | | | | In spu_tri.c:setup_sort_vertices() triangles are culled after the vertices are sorted. This patch moves the check a little earlier and performs the actual check a little faster through intrinsics and a little trickery. Reduced code size and less work is done before a triangle is deemed OK to skip.
* | cell: unroll inner loop of spu_render.c:cmd_render()Jonathan Adamczewski2009-05-213-32/+89
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | It was taking approximately 50 cycles to extract the vertex indices, calculate the vertex_header pointers and call tri_draw() for each three vertices - . Unrolled, it takes less than 100 cycles to extract, unpack, calculate pointers and call tri_draw() eight times. It does have a nasty jump-tabled switch. I'm sure that there's a better way... Code size of spu_render.o gets larger due to the extra constants and work in the inner loop, there are extra stack saves and loads because there are more registers in use, and an assert. spu_tri.o gets a little smaller.
* | r300-gallium: r500-fs: POW.Corbin Simpson2009-05-201-1/+27
| | | | | | | | I feel so unclean.
* | r300-gallium: r500-fs: LRP.Corbin Simpson2009-05-201-4/+32
| | | | | | | | Goddammit. This cannot be the "easy way." :C
* | r300-gallium: r500-fs: Combine function.Corbin Simpson2009-05-201-15/+6
| |
* | r300-gallium: Prevent assert when fogcoords are present.Corbin Simpson2009-05-202-5/+14
| | | | | | | | Seems like this file is the source of all bad logic. (Pun intended.)
* | r300-gallium: Another constantbuf shader recompile test.Corbin Simpson2009-05-204-2/+14
| | | | | | | | | | | | | | Less briefly... Shaders need to be recompiled if their constantbuf offsets have changed. However, since we only change them from shaders if immediates need to be emitted, we shouldn't bother if the shader doesn't use immediates.
* | r300-gallium: Raise constantbuf limits.Corbin Simpson2009-05-201-3/+3
| | | | | | | | Still not correct, but really I don't care.
* | r300-gallium: fs: Remove cruft from way back when.Corbin Simpson2009-05-201-18/+0
| |
* | r300-gallium: Make surface_copy actually load the texture in shader.Corbin Simpson2009-05-203-3/+4
| |
* | r300-gallium: Add missing R481 PCI ID.Corbin Simpson2009-05-201-0/+1
| | | | | | | | Per 74cb2aba on xf86-video-ati.
* | r300-gallium: Make surface_copy work, and refactor buffer validation.Corbin Simpson2009-05-202-16/+47
| |
* | radeon-gallium: Don't permit reading and writing a BO in one CS.Corbin Simpson2009-05-201-2/+3
| | | | | | | | | | | | This fixes some silent problems in current libdrm_radeon. surface_copy still locks up hard.
* | trace: Improve shader wrappingJakob Bornecrantz2009-05-183-4/+29
| |