summaryrefslogtreecommitdiffstats
path: root/src/gallium
Commit message (Collapse)AuthorAgeFilesLines
* galahad, i915g: Copy over constant buffer index check.Corbin Simpson2010-08-202-2/+9
|
* galahad, i915g: Move over a few state asserts.Corbin Simpson2010-08-202-3/+16
|
* galahad: Make it obvious on stderr that Galahad's active.Corbin Simpson2010-08-192-0/+4
|
* gallium/docs: Add GalahadJakob Bornecrantz2010-08-202-0/+13
|
* translate_sse: enable on Win64Luca Barbieri2010-08-201-2/+1
| | | | According to Vinson, enabling it causes no regressions
* u_blitter: fix a memory leakMarek Olšák2010-08-191-0/+1
|
* r300g: do not use fastfill with 16-bit zbuffersMarek Olšák2010-08-191-0/+4
| | | | | | | To my knowledge, there is no way to flush zmask and thus write the clear value. This fixes zbuffer reads, among other things.
* r600g: update comments about ALU src operandsAlex Deucher2010-08-192-2/+12
|
* translate_sse: try to fix Win64Luca Barbieri2010-08-191-1/+2
| | | | Not sure whether it works now (it is still disabled).
* translate_sse: fix emit_load_sse2Luca Barbieri2010-08-191-0/+2
|
* st/egl: Fix win32 build.Chia-I Wu2010-08-191-0/+1
| | | | Add new source files to SConscripts.
* r600g: add sin/cosDave Airlie2010-08-192-2/+123
| | | | | | | This pretty much ports the code from r600c, however it doesn't always seem to work quite perfectly, but I can't find anything in this code that is wrong. I'm guessing either literal input or constants aren't working always.
* r600g: add a chiprev type for r600/r700/evergreen instead of using familyDave Airlie2010-08-192-13/+25
|
* r600g: add SSG, SEQ, SGT and SNEDave Airlie2010-08-192-12/+88
|
* gallivm: Use proper index to lookup predicate register array.michal2010-08-181-1/+1
| | | | Doesn't fix anything, as those indices were both always 0.
* r600g: add FRC, FLR, DDX and DDYDave Airlie2010-08-181-4/+4
| | | | the first two are straight op2's and the DDX/DDY are taken from r600c.
* r600g: add SGE and SLE opcodesDave Airlie2010-08-181-2/+2
| | | | fixes fp-set-01 and glsl-fs-step
* r600g: add TXB supportDave Airlie2010-08-181-1/+1
| | | | fixes biased texturing tests
* r600g: fix TXP vs TEX in shader.Dave Airlie2010-08-182-65/+67
| | | | | | Don't do perspective for TEX, and also copy input to a temporary for TEX also add tex opcode names
* r600g: add two simple tgsi opcodes.Dave Airlie2010-08-181-2/+2
| | | | makes glsl-fs-log2 and glsl1-integer division with uniform var pass
* r600g: fix point sizeDave Airlie2010-08-181-1/+1
| | | | fixes piglit pointAtten and point-sprite tests
* r600g: fixup pitch alignment like r600c.Dave Airlie2010-08-182-3/+7
| | | | This still needs work, passes tex3d, fbo-scissor-bitmap, scissor-bitmap
* translate_sse: don't overwrite source buffer pointerLuca Barbieri2010-08-181-5/+5
| | | | | We were putting the source pointer in a register used as a temporary, breaking all paths that don't read the data in a single instruction.
* r600g: fix height calcs for miptreeDave Airlie2010-08-181-0/+1
| | | | | | | h needs to be rounded up, this probably needs revisiting when we get to tiling etc. fixes fbo-generatemipmap-npot
* r600g: emit texture level offset in CB/DB setup.Dave Airlie2010-08-181-2/+2
| | | | | | | 8 more piglit tests pass, fbo-clearmipmap, fbo-copyteximage, fbo-generatemipmap, fbo-generatemipmap-nonsquare, fbo-generatemipmap-scissor, fbo-generatemipmap-viewport, gen-teximage, gen-texsubimage
* r600g: fix fake pixel outputJerome Glisse2010-08-171-1/+1
| | | | Signed-off-by: Jerome Glisse <[email protected]>
* targets/egl: Link with DRI_LIB_DEPS.Chia-I Wu2010-08-171-3/+3
| | | | | Use DRI_LIB_DEPS when linking GL/GLES state trackers. This fixes missing talloc symbol errors, and is hopefully more future proof.
* gallivm: Fix and re-enable MMX-disabling codenobled2010-08-171-5/+1
| | | | Signed-off-by: José Fonseca <[email protected]>
* r300g: fix context destroy under hyperzDave Airlie2010-08-171-3/+3
| | | | | | we were destroying the mm before unrefing all the objects, so segfault. Signed-off-by: Dave Airlie <[email protected]>
* r600g: kill event type magic number in winsysDave Airlie2010-08-172-3/+5
| | | | these events have names, use them.
* r600g: add user clip plane support.Dave Airlie2010-08-175-13/+89
| | | | | | | | | Apart from the fact that the radeon.h/r600_states.h editing is a nightmare, this wasn't so bad. passes piglit user-clip test now also trivial tests. Signed-off-by: Dave Airlie <[email protected]>
* Merge branch 'glsl2'Ian Romanick2010-08-163-8/+7
|\ | | | | | | | | Conflicts: src/mesa/program/prog_optimize.c
| * scons: Link talloc.José Fonseca2010-08-141-0/+1
| |
| * Merge branch 'master' into glsl2Ian Romanick2010-08-13193-3812/+6087
| |\
| * | scons: Build the new glsl2 code.José Fonseca2010-08-131-0/+1
| | |
| * | gallium: Link DRI drivers with g++ and test with CXXJakob Bornecrantz2010-08-121-8/+5
| | | | | | | | | | | | | | | New shader compiler need libc++ runtime. This works already if we are using llvm so this just covers the !llvm case.
* | | util: Remove check_os_katmai_support.Vinson Lee2010-08-161-119/+1
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | check_os_katmai_support checks that the operating system running on a SSE-capable processor supports SSE. This is necessary for unpatched 2.2.x and earlier kernels. 2.4.x and later kernels support SSE. check_os_katmai_support will disable SSE capabilities for 32-bit x86 operating systems for which there is no code path. Currently, this function handles Linux, Windows, and several BSDs. Mac OS, Cygwin, and Solaris are several operating systems with no code paths. Rather than add code for the unhandled operating systems, remove this function altogether. This will fix SSE detection on all recent 32-bit x86 operating systems. This completely breaks functionality on unpatched 2.2.x and earlier kernels, although there are likely no Gallium3D users on such operating systems.
* | | r600g: drop libdrm_radeon linkDave Airlie2010-08-172-1/+187
| | |
* | | translate: Move loop variable declaration outside for loop.Vinson Lee2010-08-161-1/+2
| | | | | | | | | | | | Fixes MSVC build.
* | | r300g: fix assert in the rasterizer block for r3xx-r4xxMarek Olšák2010-08-161-1/+1
| | | | | | | | | | | | Reported-by: Niels Ole Salscheider <[email protected]>
* | | translate: Remove unused temporary register.José Fonseca2010-08-161-1/+0
| | | | | | | | | | | | Assuming the side-effect of x86_make_reg is also unnecessary.
* | | translate: Eliminate void pointer arithmetic.José Fonseca2010-08-161-1/+1
| | | | | | | | | | | | Non-portable.
* | | st/egl: Add support for EGL_KHR_fence_sync.Chia-I Wu2010-08-173-0/+71
| | | | | | | | | | | | The extension is implemented by pipe_fence_handle.
* | | st/egl: Add support for EGL_KHR_reusable_sync.Chia-I Wu2010-08-175-0/+297
| | | | | | | | | | | | The extension is implemented by pipe_condvar.
* | | draw_llvm: fix segfaults on non-SSE2 CPUs where it is disabled (v2)Luca Barbieri2010-08-163-17/+32
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Changes in v2: - Change function name Currently draw_llvm refuses to create itself on non-SSE2 CPUs due to an alleged LLVM bug. However, this is implemented improperly, because other parts of draw still attempt to access draw->llvm, resulting in segfaults. Instead, put the check in debug_get_option_draw_use_llvm, check that before calling draw_llvm_create, and then check whether draw->llvm is non-null everywhere else.
* | | translate_sse: major rewrite (v5)Luca Barbieri2010-08-162-239/+936
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | NOTE: Win64 is untested, and is thus currently disabled. If you have such a system, please enable it and report whether it works. To enable it, change src/gallium/auxiliary/translate/translate.c Changes in v5: - On Win64, preserve %xmm6 and %xmm7 as required by the ABI - Use _WIN64 instead of WIN64 Changes in v4: - Use x86_target() and x86_target_caps() - Enable translate_sse in x86-64, but not in Win64 Changes in v3: - Win64 support (untested) - Use u_cpu_detect.h constants instead of #ifs Changes in v2: - Minimize #ifs - Give a name to magic number CHANNELS_0001 - Add support for CPUs without SSE (only memcpy and swizzles, like non SSE2) - Fixed comments translate_sse is currently very limited to the point of being useless in essentially all cases. In particular, it only support some float32 and unorm8 formats and doesn't work on x86-64. This commit rewrites it to support: 1. Dumb memory copy for any pair of identical formats 2. All formats that are swizzles of each other 3. Converting 32/64-bit floats and all 8/16/32-bit integers to 32-bit float 4. Converting unorm8/snorm8 to snorm16 and uscaled8/sscaled8 to sscaled16 5. Support for x86-64 (doesn't take advantage of it in any way though) This new translate can even be useful to translate index buffers for cards that lack 8-bit index support. It passes the testsuite I wrote, but note that this is a major change, and more testing would be great.
* | | rtasm: add minimal x86-64 support and new instructions (v5)Luca Barbieri2010-08-163-40/+551
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Changes in v5: - Add sse2_movdqa Changes in v4: - Use _WIN64 instead of WIN64 Changes in v3: - Add target and target caps functions, so that they could be different in principle from the current CPU and they don't need #ifs to check Changes in v2: - Win64 support (untested) - Use u_cpu_detect.h constants instead of #ifs This commit adds minimal x86-64 support: only movs between registers are supported for r8-r15, and x64_rexw() must be used to ask for 64-bit operations. It also adds several new instructions for the new translate_sse code. movdqa
* | | translate: add support for 8/16-bit indicesLuca Barbieri2010-08-165-19/+108
| | | | | | | | | | | | | | | Currently, only 32-bit indices are supported, but some use cases translate needs support for all types.
* | | translate_sse: remove useless generated function wrappersLuca Barbieri2010-08-161-51/+4
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Currently translate_sse puts two trivial wrappers in the translate vtable. These slow it down and enlarge the source code for no gain, except perhaps the ability to set a breakpoint there, so remove them. Breakpoints can be set on the caller of the translate functions, with no loss of functionality.
* | | translate_generic: factor out common code between linear and indexedLuca Barbieri2010-08-161-115/+62
| | | | | | | | | | | | This moves the common code into a separate ALWAYS_INLINE function.