summaryrefslogtreecommitdiffstats
Commit message (Collapse)AuthorAgeFilesLines
* vbo: do FLUSH_CURRENT before validating and updating stateMarek Olšák2012-04-231-10/+16
| | | | | | | | | | This fixes an assertion failure since: commit 81afdd20f3f574ce29559d8ad77df5c77652009e vbo: don't check twice whether it's valid to render FLUSH_CURRENT may set _NEW_CURRENT_ATTRIB. Reviewed-by: Brian Paul <[email protected]>
* r300g/automake: add $ARCH_FLAGS and $OPT_FLAGSMarek Olšák2012-04-231-0/+2
| | | | Reviewed-by: Tom Stellard <[email protected]>
* r600g/automake: add $ARCH_FLAGS and $OPT_FLAGSMarek Olšák2012-04-231-0/+2
| | | | Reviewed-by: Tom Stellard <[email protected]>
* r600g: init vars to silence warningsBrian Paul2012-04-231-2/+2
|
* bin/mklib: remove '-m32' for arm linuxHan Shen(沈涵)2012-04-231-2/+6
| | | | | | | | -m32 is not a valid option for ARM. NOTE: This is a candidate for the 8.0 branch. Signed-off-by: Brian Paul <[email protected]>
* st/mesa: no-op glCopyPixels if source region is out of boundsBrian Paul2012-04-231-1/+9
| | | | | | | | | If the source region for a glCopyPixels is completely outside the source buffer bounds, no-op the copy. Fixes a failed assertion. NOTE: This is a candidate for the 8.0 branch. Reviewed-by: Jose Fonseca <[email protected]>
* radeon/llvm: Don't print an error message when there is no errorTom Stellard2012-04-231-2/+1
| | | | | A blank line with an empty error message was being printed even when the target lookup succeeded.
* radeon/llvm: Lower VCREATE_v4f32 for R600 and SITom Stellard2012-04-235-33/+22
|
* r600g/llvm: Let ISel handle lowering to {INSERT,EXTRACT}_SUBREGTom Stellard2012-04-236-88/+37
|
* r600g/llvm: Only emit an instruction's explicit operandsTom Stellard2012-04-231-2/+2
|
* r600g/llvm: Handle copies between vector registersTom Stellard2012-04-232-2/+21
|
* r600g/llvm: Remove debugging hack from R600InstrInfo::copyPhysReg()Tom Stellard2012-04-231-4/+0
|
* r600g/llvm: Tell the code emitter to ignore KILL and BUNDLETom Stellard2012-04-231-1/+3
|
* r600/llvm: Add LOAD_VTX instructionTom Stellard2012-04-231-0/+13
|
* r600g: Add hooks for the LLVM shader compilerTom Stellard2012-04-234-2/+318
| | | | | | | | | The LLVM backend can now be enabled for r600g by using the --enable-r600-llvm-compiler configure flag. If you configure with this flag, you can still use the default compiler by setting the envrionment variable R600_USE_LLVM=0 Reviewed-by: Alex Deucher <[email protected]>
* r600g: Add TGSI->LLVM implementation v2Tom Stellard2012-04-232-0/+330
| | | | | | v2: Add case for ARUBA in r600_llvm_gpu_string() Reviewed-by: Alex Deucher <[email protected]>
* radeon: Move radeon_llvm_emit.cpp declarations into their own headerTom Stellard2012-04-235-19/+58
| | | | Reviewed-by: Alex Deucher <[email protected]>
* radeon: Remove HAVE_LLVM ifdefsTom Stellard2012-04-231-10/+0
| | | | | | Only LLVM 3.1 is supported, so these are not necessary. Reviewed-by: Alex Deucher <[email protected]>
* r600g: fix gpr number calculationVadim Girlin2012-04-231-0/+3
| | | | | Reviewed-by: Alex Deucher <[email protected]> Signed-off-by: Tom Stellard <[email protected]>
* configure.ac: Move HAVE_LLVM definition into configure.acTom Stellard2012-04-232-5/+1
| | | | | | | Otherwise HAVE_LLVM won't be included in the $(DEFINES) variable for Automake generated Makefiles. Reviewed-by: Alex Deucher <[email protected]>
* r600g: split add_reg into add_reg and add_reg_bo variantsDave Airlie2012-04-234-215/+193
| | | | | | | | | This shaves 2k off the final dri.so, and removes lots of pointless NULL, 0 passing. most like pointless - but it looked nicer to me. Signed-off-by: Dave Airlie <[email protected]>
* glapi: Correct size of allocated _glapi_table structJonas Maebe2012-04-221-1/+1
| | | | | | | | | The __glapi_gentable_set_remaining_noop() routine treats the _glapi_struct as an array of _glapi_get_dispatch_table_size() pointers, so we have to allocate _glapi_get_dispatch_table_size()*sizeof(void*) bytes rather than sizeof(struct _glapi_struct) bytes. Reviewed-by: Jeremy Huddleston <[email protected]>
* r600g: enable GLSL130 on all cardsDave Airlie2012-04-221-3/+2
| | | | | | | | Alexandre Demers sent me some cayman results with no major problems. I'll rip out the env var in a week or so. Signed-off-by: Dave Airlie <[email protected]>
* r600g: enable dual src blending on r600 cardsDave Airlie2012-04-211-1/+1
| | | | | | tested on my rv610 and it passes the tests with no hangs. Signed-off-by: Dave Airlie <[email protected]>
* r600g: enable GLSL 1.30 for r600 classDave Airlie2012-04-211-1/+1
| | | | | | | | | Full piglit run on my rv610 with no regressions. This only leaves cayman, however my cayman is resisting my attempt to get through a full piglit run. Signed-off-by: Dave Airlie <[email protected]>
* r600: enable glsl 1.30 on r700Dave Airlie2012-04-211-1/+1
| | | | | | | | I've done a piglit run on rv740 and confirmed no regressions. We don't get GL3 on r700 due to transform feedback being busted still. Signed-off-by: Dave Airlie <[email protected]>
* r600g: report INTEGERS cap if glsl130 is on.Dave Airlie2012-04-201-1/+2
| | | | | | | | | This cap is used by u_blitter to decide if it can use integers in vertex data. fixes some crashes with glsl130 in piglit Signed-off-by: Dave Airlie <[email protected]>
* r600g: enable glsl 130 on evergreen.Dave Airlie2012-04-201-1/+3
| | | | | | | | | | | | I've done a piglit run on my SUMO machine and I see no regressions. Lots of things to fix (skip->fail), but hey maybe we can fix them if we can see them. I'll try and work my way across r600,700,cayman sometime if nobody else gets to them. Signed-off-by: Dave Airlie <[email protected]>
* r600g: disable I2F conversion for InstanceID if integers are supportedVadim Girlin2012-04-201-11/+16
| | | | | | Signed-off-by: Vadim Girlin <[email protected]> Reviewed-by: Michel Dänzer <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
* r600g: store glsl_feature_level in the r600_screenVadim Girlin2012-04-202-1/+3
| | | | | | Signed-off-by: Vadim Girlin <[email protected]> Reviewed-by: Michel Dänzer <[email protected]> Signed-off-by: Dave Airlie <[email protected]>
* mesa: move gl_array_attrib::NewState to gl_array_object::NewArraysBrian Paul2012-04-206-18/+10
| | | | | | | | The field wasn't actually used before and it's not used now either. But this is a more logical place for it and will hopefully allow doing smarter draw/array validation (per array object) in the future. Reviewed-by: Mathias Fröhlich <[email protected]>
* docs: recommend 2GB of RAM for Viewperf testsBrian Paul2012-04-201-0/+4
|
* i965: Convert live interval computation to using live variable analysis.Eric Anholt2012-04-191-39/+26
| | | | | | | | | | | | | | | | | | | | Our previous live interval analysis just said that anything in a loop was live for the whole loop. If you had to spill a reg in a loop, then we would consider the unspilled value live across the loop too, so you never made progress by spilling. Eventually it would consider everything in the loop unspillable and fail out. With the new analysis, things completely deffed and used inside the loop won't be marked live across the loop, so even if you spill/unspill something that used to be live across the loop, you reduce register pressure. But you usually don't even have to spill any more, since our intervals are smaller than before. This fixes assertion failure trying to compile the shader for the "glyphy" text rasterier and piglit glsl-fs-unroll-explosion. Improves Unigine Tropics performance 1.3% +/- 0.2% (n=5), by allowing more shaders to be compiled in 16-wide mode.
* i965: Move the old live interval analysis code next to the new live vars code.Eric Anholt2012-04-192-122/+122
| | | | I'm about to replace the insides of this using the new analysis.
* i965: Add support for live variable analysis using dataflow analysis.Eric Anholt2012-04-193-0/+245
|
* i965: Add basic block generator.Eric Anholt2012-04-195-0/+392
| | | | | | This takes the fs_inst list generated by the visitor, and generates a list of basic blocks with edges between them. This is a building block for data-flow analysis.
* i965/fs: Suppress printing the whole loop in BRW_OPCODE_DO annotation.Eric Anholt2012-04-191-0/+2
|
* glsl: Properly throw compile-time errors for conflicting FS output writes.Eric Anholt2012-04-191-0/+66
| | | | | | | | | | | We were checking for these at link time previously, which is not as early as mandated, and would actually fail to detect conflicting writes if dead code removal removed some writes. Fixes failures in piglit glsl-*/compiler/fragment-outputs/write-gl_Frag* Reviewed-by: Kenneth Graunke <[email protected]>
* glsl: Track in each ir_variable whether it was ever assigned.Eric Anholt2012-04-193-13/+33
| | | | | | | This will be used for some compile-and-link-time error checking, where currently we've been doing error checking only at link time. Reviewed-by: Kenneth Graunke <[email protected]>
* glsl: Convert the tests directory to automake.Eric Anholt2012-04-194-1/+5
| | | | | | | | | | | This runs optimization-test and produces the usual automake test output, which may be interesting to automated build systems. This doesn't convert the tests to be individually exposed to the automake runner, because automake doesn't like wildcards (due to being nonportable in make, not that we care). Reviewed-by: Kenneth Graunke <[email protected]>
* glsl: Fix up some comments in structure splitting, based on array-split work.Eric Anholt2012-04-191-3/+8
| | | | Reviewed-by: Kenneth Graunke <[email protected]>
* glsl: Don't allow array splitting on function arguments.Eric Anholt2012-04-191-1/+17
| | | | | | | | | | | This is the reason the declaration member existed in the reference visitor, but I didn't copy the code from structure splitting that avoided setting it. This wasn't currently a problem, because we don't allow splitting of in/out variables. But that would be nice to change some day. Reviewed-by: Kenneth Graunke <[email protected]>
* glsl: Rename the "whole_array_access" member in array splitting.Eric Anholt2012-04-191-11/+15
| | | | | | | This was carried over from structure splitting, without thinking about whether the name still made sense in this context. Reviewed-by: Kenneth Graunke <[email protected]>
* glsl: Fix up a comment explaining what a visitor class does.Eric Anholt2012-04-191-1/+4
| | | | | | | Ken noted that some of the "actual work" was happening in the caller of this class. Reviewed-by: Kenneth Graunke <[email protected]>
* glsl: Use the column_types() helper method.Eric Anholt2012-04-191-2/+1
| | | | | | Ken noted this in a review of this patch that I pushed early. Reviewed-by: Kenneth Graunke <[email protected]>
* glx: Use AM_CPPFLAGS to pass -I and -D to both C and C++ compiles.Eric Anholt2012-04-191-5/+3
| | | | | Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Adam Jackson <[email protected]>
* Revert "mesa: only update _MaxElement when we actually need it"Marek Olšák2012-04-191-3/+1
| | | | | | | This reverts commit 0de5a21470b3bff9b7c8714e5d960d5ed9d01b9c. I was wrong, we use it in the vbo module too. This fixes a performance regression in Nexuiz.
* radeonsi: Fix VGPR_BIT() definition.Michel Dänzer2012-04-191-1/+1
| | | | | | | | Fixes encoding of VOP3 shader instructions. The shift was wrong for source registers 2 and 3, and the resulting value was only 32 bits, so the shift in SICodeEmitter::VOPPostEncode() didn't work as intended.
* radeonsi: Replace magic numbers for vertex buffer resource.Michel Dänzer2012-04-191-4/+8
|
* radeonsi: (User) SGPR related cleanups.Michel Dänzer2012-04-193-16/+33
| | | | | | Use the same user SGPRs for the same purpose in vertex and pixel shaders. Better calculation of the number of SGPRs to reserve.