Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | radeon/llvm: Add missing license headers | Tom Stellard | 2013-03-13 | 2 | -0/+52 |
| | | | | Signed-off-by: Tom Stellard <[email protected]> | ||||
* | radeon/llvm: Make radeon_llvm_util.cpp a C file | Tom Stellard | 2013-03-13 | 3 | -29/+8 |
| | | | | All the functions in this file are now implemented in C. | ||||
* | radeon/llvm: Optimize radeon_llvm_strip_unused_kernels() | Tom Stellard | 2013-03-13 | 2 | -17/+10 |
| | | | | | | | | Just delete unused kernels rather than marking them as internal and running the GlobalDCE pass. Also implement this function in C and inline it into radeon_llvm_get_kernel_module() | ||||
* | radeon/llvm: Implement radeon_llvm_get_kernel_module() using the C API | Tom Stellard | 2013-03-13 | 2 | -7/+15 |
| | |||||
* | radeon/llvm: Implement radeon_llvm_get_num_kernels() using the C API | Tom Stellard | 2013-03-13 | 1 | -6/+1 |
| | |||||
* | radeon/llvm: Implement radeon_llvm_parse_bitcode() using C API | Tom Stellard | 2013-03-13 | 2 | -8/+11 |
| | | | | Also make the function static since it is not used anywhere else. | ||||
* | r600g/llvm: Move llvm wrapper functions into the radeon directory | Tom Stellard | 2013-03-13 | 7 | -38/+35 |
| | |||||
* | autotools: Add missing top-level include dir. | José Fonseca | 2013-03-13 | 1 | -0/+1 |
| | | | | | Fixes autotools build failure. Not sure if there are more, as I have difficulties in building the full tree. | ||||
* | mesa: Use PACKAGE_BUGREPORT macro. | Matt Turner | 2013-03-12 | 1 | -1/+1 |
| | | | | Reviewed-by: Eric Anholt <[email protected]> | ||||
* | radeonsi: Fix off-by-one for maximum vertex element index in some cases | Michel Dänzer | 2013-03-12 | 1 | -2/+8 |
| | | | | | | | | | | | In cases where the vertex element size is smaller than the vertex buffer stride, the previous calculation could end up 1 too low. This would result in the GPU using index 0 instead of the maximum index for those elements, which would be visible as intermittent distorted triangles. NOTE: This is a candidate for the 9.1 branch. Reviewed-by: Alex Deucher <[email protected]> | ||||
* | nvc0: avoid crash on updating RASTERIZE_ENABLE state | Christoph Bumiller | 2013-03-12 | 2 | -4/+18 |
| | | | | | When doing a blit with the 3D engine, the rasterizer or zsa cso may be NULL. | ||||
* | nvc0: add MP trap handler for nve4 | Christoph Bumiller | 2013-03-12 | 4 | -15/+314 |
| | |||||
* | nvc0: they removed the NTID,NCTAID,GRIDID registers on nve4 | Christoph Bumiller | 2013-03-12 | 6 | -23/+66 |
| | |||||
* | nvc0: implement compute support for nve4 | Christoph Bumiller | 2013-03-12 | 18 | -78/+1882 |
| | |||||
* | nvc0/ir: try to fix CAS (CompareAndSwap) | Christoph Bumiller | 2013-03-12 | 2 | -1/+42 |
| | |||||
* | nv50/ir: add CCTL (cache control) op | Christoph Bumiller | 2013-03-12 | 5 | -4/+33 |
| | |||||
* | nvc0/ir/emit: fix emission of large address offsets | Christoph Bumiller | 2013-03-12 | 1 | -8/+50 |
| | |||||
* | nvc0: add SHADER/COMPUTE_RESOURCE bind flags to format table | Christoph Bumiller | 2013-03-12 | 1 | -43/+53 |
| | |||||
* | nouveau: align PIPE_BIND_SHADER,COMPUTE_RESOURCEs to 256 bytes | Christoph Bumiller | 2013-03-12 | 1 | -1/+3 |
| | |||||
* | nv50,nvc0: copy writable flag on surface creation | Christoph Bumiller | 2013-03-12 | 2 | -0/+2 |
| | |||||
* | nv50/ir: add support for different sampler and resource index on nve4 | Christoph Bumiller | 2013-03-12 | 3 | -37/+51 |
| | | | | | | | | | | And remove non-working code for indirect sampler/resource selection. Will be added back later. Includes code from "nv50/ir/tgsi: Resource indirect indexing" by Francisco Jerez (when mixing the R and S handles we can only specify them via a register, i.e. indirectly, unless we upload all the used handle combinations to c[] space, which we don't for now). | ||||
* | nv50/ir: implement splitting of 64 bit ops after RA | Christoph Bumiller | 2013-03-12 | 6 | -39/+98 |
| | |||||
* | nvc0/ir: skip back edges when determining latest sched value | Christoph Bumiller | 2013-03-12 | 1 | -3/+4 |
| | |||||
* | nvc0/ir: use large issue delay after RET, too | Christoph Bumiller | 2013-03-12 | 1 | -1/+1 |
| | |||||
* | nv50/ir: fix size adjustment for sched info for multiple functions | Christoph Bumiller | 2013-03-12 | 1 | -6/+11 |
| | |||||
* | nv50/ir: print function inputs and outputs | Christoph Bumiller | 2013-03-12 | 1 | -1/+22 |
| | |||||
* | nv50/ir/ssa: add a few comments regarding RenamePass | Christoph Bumiller | 2013-03-12 | 1 | -0/+19 |
| | |||||
* | nv50/ir/tgsi: Exclude local declarations from function prototypes. | Francisco Jerez | 2013-03-12 | 1 | -5/+28 |
| | |||||
* | nv50/ir/opt: try to make use of SUCLAMP addend | Christoph Bumiller | 2013-03-12 | 1 | -0/+45 |
| | |||||
* | nv50/ir: don't assert on type in Modifier.applyTo if it is 0 | Christoph Bumiller | 2013-03-12 | 1 | -0/+2 |
| | |||||
* | nv50/ir: add support for barriers | Christoph Bumiller | 2013-03-12 | 7 | -15/+161 |
| | | | | nv50 part by Francisco Jerez. | ||||
* | nv50/ir/tgsi: add support for atomics | Christoph Bumiller | 2013-03-12 | 1 | -0/+89 |
| | |||||
* | nv50/ir/tgsi: handle TGSI_OPCODE_LOAD,STORE | Christoph Bumiller | 2013-03-12 | 7 | -30/+303 |
| | | | | | | | | | | | | | | | | Squashed and (heavily) modified original patches by Francisco Jerez: nv50/ir/tgsi: Implement resource LOAD/STORE (wip). nv50/ir/tgsi: Emit SUST/SULD for surface access, and add CB LOAD/STORE support nv50/ir/tgsi: Fix/clean up the LOAD/STORE handling code. Left out for now: nv50/ir/tgsi: Resource indirect indexing Treating raw, read-only surfaces as constant buffers (CBs) was removed because CBs are limited to a size of 64 KiB which isn't desireable, and because this decision should probably be made by the state tracker. If we used a number of CB slots for surfaces, it might find that we cannot accomodate the advertised limit. | ||||
* | nvc0/ir: don't replace load from input in COMPUTE progs with VFETCH | Christoph Bumiller | 2013-03-12 | 1 | -2/+7 |
| | |||||
* | nvc0/ir: implement lowering of surface ops for nve4 | Christoph Bumiller | 2013-03-12 | 8 | -16/+429 |
| | |||||
* | nvc0/ir: add formatted surface load lib code, move to extra header | Christoph Bumiller | 2013-03-12 | 6 | -149/+1309 |
| | | | | | | OpenGL is nice and makes the user specify a format with an image unit. OpenCL is evil and doesn't, and what's better than adding a huge load of functions that we call indirectly to handle the conversion ? | ||||
* | nv50/ir: extend moveSources for delta < 0 | Christoph Bumiller | 2013-03-12 | 2 | -16/+31 |
| | |||||
* | nvc0/ir: lower atomics in s[] | Christoph Bumiller | 2013-03-12 | 1 | -0/+33 |
| | |||||
* | nvc0/ir/emit: implement INSBF, EXTBF, PERMT and ATOM | Christoph Bumiller | 2013-03-12 | 2 | -1/+133 |
| | |||||
* | nv50/ir/emit: handle OP_ATOM | Christoph Bumiller | 2013-03-12 | 1 | -0/+41 |
| | |||||
* | nvc0/ir/target: some ops can't be predicated, e.g. CALL | Christoph Bumiller | 2013-03-12 | 1 | -0/+8 |
| | |||||
* | nv50/ir/opt: CALLs cannot load | Christoph Bumiller | 2013-03-12 | 1 | -0/+3 |
| | |||||
* | nv50/ir: add support for indirect BRA,CALL | Christoph Bumiller | 2013-03-12 | 5 | -6/+29 |
| | |||||
* | nvc0/ir/emit: implement move to and logic ops on predicates | Christoph Bumiller | 2013-03-12 | 1 | -0/+45 |
| | |||||
* | nvc0/ir/emit: implement surface related ops | Christoph Bumiller | 2013-03-12 | 2 | -0/+301 |
| | |||||
* | nv50/ir: initialize CodeEmitters' specialized target fields | Christoph Bumiller | 2013-03-12 | 3 | -9/+10 |
| | |||||
* | nv50/ir/opt: make optimization aware of atomics, barriers, surface ops | Christoph Bumiller | 2013-03-12 | 2 | -1/+28 |
| | |||||
* | nv50/ir: add various new OPs that will be needed for compute | Christoph Bumiller | 2013-03-12 | 9 | -48/+179 |
| | |||||
* | nv50/ir: Rename "mkLoad" to "mkLoadv" for consistency. | Francisco Jerez | 2013-03-12 | 4 | -12/+21 |
| | |||||
* | nv50/ir: fix comparison of system values | Christoph Bumiller | 2013-03-12 | 1 | -0/+3 |
| |