Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | gallivm: Remove unnecessary header. | Vinson Lee | 2010-10-08 | 1 | -1/+0 |
| | |||||
* | gallivm: Help for combined extraction and broadcasting. | José Fonseca | 2010-10-08 | 3 | -15/+102 |
| | | | | Doesn't change generated code quality, but saves some typing. | ||||
* | llvmpipe: First minify the texture size, then broadcast. | José Fonseca | 2010-10-08 | 3 | -32/+42 |
| | |||||
* | gallivm: Move into the as much of the second level code as possible. | José Fonseca | 2010-10-08 | 5 | -219/+120 |
| | | | | | Also, pass more stuff trhough the sample build context, instead of arguments. | ||||
* | gallivm: Warn when doing inefficient integer comparisons. | José Fonseca | 2010-10-08 | 1 | -2/+18 |
| | |||||
* | gallivm: round rather than truncate in new 4x4f->1x16ub conversion path | Keith Whitwell | 2010-10-08 | 1 | -17/+42 |
| | |||||
* | gallivm: Use the wrappers for SSE pack intrinsics. | José Fonseca | 2010-10-08 | 1 | -15/+3 |
| | | | | Fixes assertion failures on LLVM 2.6. | ||||
* | gallivm: special case conversion 4x4f to 1x16ub | Keith Whitwell | 2010-10-08 | 1 | -0/+84 |
| | | | | | Nice reduction in the number of operations required for final color output in many shaders. | ||||
* | gallivm: Implement brilinear filtering. | José Fonseca | 2010-10-08 | 1 | -1/+89 |
| | |||||
* | gallivm: Fix copy'n'paste typo in previous commit. | José Fonseca | 2010-10-08 | 1 | -2/+2 |
| | |||||
* | gallivm: Clamp mipmap level and zero mip weight simultaneously. | José Fonseca | 2010-10-08 | 4 | -14/+52 |
| | |||||
* | gallivm: Use lp_build_ifloor_fract for lod computation. | José Fonseca | 2010-10-08 | 1 | -5/+3 |
| | | | | Forgot this one before. | ||||
* | gallivm: Don't compute the second mipmap level when frac(lod) == 0 | José Fonseca | 2010-10-08 | 2 | -139/+175 |
| | |||||
* | gallivm: Simplify lp_build_mipmap_level_sizes' interface. | José Fonseca | 2010-10-08 | 4 | -67/+42 |
| | |||||
* | gallivm: Do not do mipfiltering when magnifying. | José Fonseca | 2010-10-08 | 2 | -16/+16 |
| | | | | If lod < 0, then invariably follows that ilevel0 == ilevel1 == 0. | ||||
* | gallivm: Vectorize the rho computation. | José Fonseca | 2010-10-07 | 3 | -37/+92 |
| | |||||
* | util: Cleanup util_pack_z_stencil and friends. | José Fonseca | 2010-10-06 | 1 | -22/+28 |
| | | | | | | | | | | | - Handle PIPE_FORMAT_Z32_FLOAT packing correctly. - In the integer version z shouldn't be passed as as double. - Make it clear that the integer versions should only be used for masks. - Make integer type sizes explicit (uint32_t for now, although uint64_t will be necessary later to encode f32_s8_x24). | ||||
* | gallivm: Compute lod as integer whenever possible. | José Fonseca | 2010-10-06 | 4 | -95/+158 |
| | | | | | More accurate/faster results for PIPE_TEX_MIPFILTER_NEAREST. Less FP <-> SI conversion overall. | ||||
* | gallivm: Only apply min/max_lod when necessary. | José Fonseca | 2010-10-06 | 2 | -15/+38 |
| | |||||
* | gallivm: don't apply zero lod_bias | Keith Whitwell | 2010-10-06 | 2 | -1/+7 |
| | |||||
* | gallivm: Combined ifloor & fract helper. | José Fonseca | 2010-10-06 | 4 | -28/+65 |
| | | | | The only way to ensure we don't do redundant FP <-> SI conversions. | ||||
* | gallivm: Fast implementation of iround(log2(x)) | José Fonseca | 2010-10-06 | 2 | -0/+39 |
| | | | | Not tested yet, but should be correct. | ||||
* | gallivm: Use a faster (and less accurate) log2 in lod computation. | José Fonseca | 2010-10-06 | 3 | -0/+53 |
| | |||||
* | gallivm: Take the type signedness in consideration in round/ceil/floor. | José Fonseca | 2010-10-06 | 1 | -48/+59 |
| | |||||
* | pb: fix numDelayed accounting | Dave Airlie | 2010-10-05 | 1 | -0/+1 |
| | | | | we weren't decreasing when removing from the list. | ||||
* | pb: don't keep checking buffers after first busy | Dave Airlie | 2010-10-05 | 1 | -13/+19 |
| | | | | | If we assume busy buffers are added to the list in order its unlikely we'd fine one after the first busy one that isn't busy. | ||||
* | draw: check for null sampler pointers | Brian Paul | 2010-09-30 | 1 | -4/+6 |
| | | | | http://bugs.freedesktop.org/show_bug.cgi?id=30516 | ||||
* | gallivm: added some comments | Brian Paul | 2010-09-30 | 2 | -8/+16 |
| | |||||
* | Revert "Prefer intrinsics to handrolled atomic ops." | Tom Fogal | 2010-09-30 | 1 | -2/+2 |
| | | | | | This reverts commit 5f66b340aa49c6bc8d0acb2d1a6f8e9a7ef2cb2e, quickly fixing 30514. | ||||
* | Prefer intrinsics to handrolled atomic ops. | Tom Fogal | 2010-09-30 | 1 | -2/+2 |
| | |||||
* | Implement x86_64 atomics for compilers w/o intrinsics. | Tom Fogal | 2010-09-30 | 1 | -0/+47 |
| | | | | | Really old gcc's (3.3, at least) don't have support for the intrinsics we need. This implements a fallback for that case. | ||||
* | gallivm: check for level=0 case in lp_build_minify() | Brian Paul | 2010-09-30 | 1 | -3/+10 |
| | | | | This lets us avoid the shift and max() operations. | ||||
* | gallivm: More comprehensive border usage logic. | José Fonseca | 2010-09-30 | 3 | -29/+51 |
| | |||||
* | gallium/util: remove duplicated include | Nicolas Kaiser | 2010-09-30 | 1 | -2/+0 |
| | | | | | | Remove duplicated include. Signed-off-by: Brian Paul <[email protected]> | ||||
* | gallium/rtasm: remove duplicated include | Nicolas Kaiser | 2010-09-30 | 1 | -1/+0 |
| | | | | | | Remove duplicated include. Signed-off-by: Brian Paul <[email protected]> | ||||
* | gallivm: Use SSE4.1's ROUNDSS/ROUNDSD for scalar rounding. | José Fonseca | 2010-09-29 | 1 | -21/+71 |
| | |||||
* | draw: pass sampler state down to llvm jit state | Brian Paul | 2010-09-29 | 3 | -0/+26 |
| | | | | | | | Fixes a regression caused from the change to make min/max lod dynamic state. https://bugs.freedesktop.org/show_bug.cgi?id=30437 | ||||
* | llvmpipe: fix swizzling of texture border color | Brian Paul | 2010-09-27 | 1 | -2/+2 |
| | | | | | | | | The pipe_sampler_view's swizzle terms also apply to the texture border color. Simply move the apply_sampler_swizzle() call after we fetch the border color. Fixes many piglit texwrap failures. | ||||
* | ureg: support centroid interpolation | Luca Barbieri | 2010-09-27 | 2 | -9/+31 |
| | |||||
* | util/u_blitter: fix leak | Joakim Sindholt | 2010-09-26 | 1 | -1/+1 |
| | |||||
* | gallivm: fix repeat() function for NPOT textures | Brian Paul | 2010-09-25 | 2 | -10/+17 |
| | | | | | The trick of casting the coord to an unsigned value only works for POT textures. Add a bias instead. This fixes a few piglit texwrap failures. | ||||
* | gallivm: Remove dead experimental code. | José Fonseca | 2010-09-25 | 2 | -78/+46 |
| | |||||
* | gallivm: Fetch the lod from the dynamic state when min_lod == max_lod. | José Fonseca | 2010-09-25 | 2 | -5/+4 |
| | |||||
* | draw: Fullfil the new min_lod/max_lod/lod_bias/border_color dynamic state | José Fonseca | 2010-09-25 | 3 | -0/+33 |
| | |||||
* | gallivm: optimize yuv decoding | Roland Scheidegger | 2010-09-25 | 1 | -9/+55 |
| | | | | | | | | | | | this is more a proof to show vector shifts on x86 with per-element shift count are evil. Since we can avoid the shift with a single compare/select, use that instead. Replaces more than 20 instructions (and slow ones at that) with about 3, and cuts compiled shader size with mesa's yuvsqure demo by over 10% (no performance measurements done - but selection is blazing fast). Might want to revisit that for future cpus - unfortunately AVX won't have vector shifts neither, but AMD's XOP will, but even in that case using selection here is probably not slower. | ||||
* | gallivm: don't use URem/UDiv when calculating offsets for blocks | Roland Scheidegger | 2010-09-25 | 1 | -1/+11 |
| | | | | | | | While it's true that llvm can and will indeed replace this with bit arithmetic (since block height/width is POT), it does so (llvm 2.7) by element and hence extracts/shifts/reinserts each element individually. This costs about 16 instructions (and extract is not really fast) vs. 1... | ||||
* | gallivm: fix copy&paste bug | Roland Scheidegger | 2010-09-25 | 1 | -1/+1 |
| | | | | | looks like pot_depth should be used, not pot_height (found by accident, not verified) | ||||
* | util: fix util_pack_color for B4G4R4A4 | Marek Olšák | 2010-09-25 | 1 | -1/+1 |
| | | | | NOTE: This is a candidate for the 7.9 branch. | ||||
* | llvmpipe: make texture border_color dynamic state | Brian Paul | 2010-09-24 | 3 | -25/+53 |
| | |||||
* | llvmpipe: make min/max lod and lod bias dynamic state | Brian Paul | 2010-09-24 | 4 | -20/+37 |
| | | | | | | Before, changing any of these sampler values triggered generation of new JIT code. Added a new flag for the special case of min_lod == max_lod which is hit during auto mipmap generation. |