mesa.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	glsl: Remove unnecessary header.	Vinson Lee	2010-10-01	1	-1/+0
\|
*	glsl: Add a lowering pass for texture projection.	Eric Anholt	2010-09-30	3	-0/+102
\|
*	glsl: "Copyright", not "Constantright"	Kenneth Graunke	2010-09-28	1	-1/+1
\| \| \| \| \|	Clearly this started out as ir_copy_propagation.cpp, but the search and replace was a bit overzealous.
*	glsl: Also update implicit sizes of varyings at link time.	Eric Anholt	2010-09-28	1	-4/+7
\| \| \| \| \| \|	Otherwise, we'll often end up with gl_TexCoord being 0 length, for example. With ir_to_mesa, things ended up working out anyway, as long as multiple implicitly-sized arrays weren't involved.
*	glsl: Add validation that a swizzle only references valid channels.	Eric Anholt	2010-09-27	1	-0/+18
\| \| \| \|	Caught the bug in the previous commit.
*	glsl: Fix broadcast_index of lower_variable_index_to_cond_assign.	Eric Anholt	2010-09-27	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	It's trying to get an int smeared across all channels, not trying to get a 1:1 mapping of a subset of a vector's channels. This usually ended up not mattering with ir_to_mesa, since it just smears floats into every chan of a vec4. Fixes: glsl1-temp array with swizzled variable indexing
*	glsl: Fix copy'n'wasted ir_noop_swizzle conditions.	Eric Anholt	2010-09-22	1	-2/+2
\| \| \| \|	It considered .xyyy a noop for vec4 instead of .xyzw, and similar for vec3.
*	glsl: Rework assignments with write_masks to have LHS chan count match RHS.	Eric Anholt	2010-09-22	10	-111/+154
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	It turns out that most people new to this IR are surprised when an assignment to (say) 3 components on the LHS takes 4 components on the RHS. It also makes for quite strange IR output: (assign (constant bool (1)) (x) (var_ref color) (swiz x (var_ref v) )) (assign (constant bool (1)) (y) (var_ref color) (swiz yy (var_ref v) )) (assign (constant bool (1)) (z) (var_ref color) (swiz zzz (var_ref v) )) But even worse, even we get it wrong, as shown by this line of our current step(float, vec4): (assign (constant bool (1)) (w) (var_ref t) (expression float b2f (expression bool >= (swiz w (var_ref x))(var_ref edge)))) where we try to assign a float to the writemasked-out x channel and don't supply anything for the actual w channel we're writing. Drivers right now just get lucky since ir_to_mesa spams the float value across all the source channels of a vec4. Instead, the RHS will now have a number of components equal to the number of components actually being written. Hopefully this confuses everyone less, and it also makes codegen for a scalar target simpler. Reviewed-by: Kenneth Graunke <[email protected]> Reviewed-by: Ian Romanick <[email protected]>
*	glsl2: fix typo in error msg	Brian Paul	2010-09-21	1	-1/+1
\|
*	glsl: Add definition of gl_TextureMatrix inverse/transpose builtins.	Eric Anholt	2010-09-21	1	-0/+3
\| \| \| \| \|	Fixes glsl2/builtin-texturematrix. Bug #30196.
*	glsl: Fix broken handling of ir_binop_equal and ir_binop_nequal.	Kenneth Graunke	2010-09-20	1	-25/+30
\| \| \| \| \| \| \| \|	When ir_binop_all_equal and ir_binop_any_nequal were introduced, the meaning of these two opcodes changed to return vectors rather than a single scalar, but the constant expression handling code was incorrectly written and only worked for scalars. As a result, only the first component of the returned vector would be properly initialized.
*	glsl: Add comments to clarify the types of comparison binops.	Kenneth Graunke	2010-09-20	1	-1/+2
\|
*	glsl2: silence compiler warnings in printf() calls	Brian Paul	2010-09-20	1	-2/+4
\| \| \| \| \|	Such as: "ir_validate.cpp:143: warning: format ‘%p’ expects type ‘void’, but argument 2 has type ‘ir_variable’"
*	glsl: Add doxygen comments	Ian Romanick	2010-09-20	2	-12/+121
\|
*	glsl/builtins: Switch comparison functions to just return an expression.	Kenneth Graunke	2010-09-18	5	-360/+72
\|
*	glsl/builtins: Fix equal and notEqual builtins.	Kenneth Graunke	2010-09-18	3	-48/+48
\| \| \| \| \|	Commit 309cd4115b7cba669a0bf858e7809cb6dae90ddf incorrectly converted these to all_equal and any_nequal, which is the wrong operation.
*	glsl: Properly handle nested structure types.	Kenneth Graunke	2010-09-18	2	-25/+11
\| \| \| \|	Fixes piglit test CorrectFull.frag.
*	glsl2: Fixed cloning of ir_call error instructions.	Tilman Sauerbeck	2010-09-18	1	-0/+3
\| \| \| \| \| \| \|	Those have the callee field set to the null pointer, so calling the public constructor will segfault. Signed-off-by: Tilman Sauerbeck <[email protected]>
*	glsl: Fix 'control reaches end of non-void function' warning.	Vinson Lee	2010-09-18	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \|	Fixes this GCC warning. lower_variable_index_to_cond_assign.cpp: In member function 'bool variable_index_to_cond_assign_visitor::needs_lowering(ir_dereference_array*) const': lower_variable_index_to_cond_assign.cpp:261: warning: control reaches end of non-void function
*	glsl2: Empty functions can be inlined.	Tilman Sauerbeck	2010-09-18	1	-1/+4
\| \| \| \| \|	Signed-off-by: Tilman Sauerbeck <[email protected]> Signed-off-by: Kenneth Graunke <[email protected]>
*	glsl2: Add flags to enable variable index lowering	Ian Romanick	2010-09-17	2	-8/+52
\|
*	glsl2: Refactor testing for whether a deref is of a matrix or array	Ian Romanick	2010-09-17	1	-11/+12
\|
*	glsl: add pass to lower variable array indexing to conditional assignments	Luca Barbieri	2010-09-17	5	-0/+317
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currenly GLSL happily generates indirect addressing of any kind of arrays. Unfortunately DirectX 9 GPUs are not guaranteed to support any of them in general. This pass fixes that by lowering such constructs to a binary search on the values, followed at the end by vectorized generation of equality masks, and 4 conditional assignments for each mask generation. Note that this requires the ir_binop_equal change so that we can emit SEQ to generate the boolean masks. Unfortunately, ir_structure_splitting is too dumb to turn the resulting constant array references to individual variables, so this will need to be added too before this pass can actually be effective for temps. Several patches in the glsl2-lower-variable-indexing were squashed into this commit. These patches fix bugs in Luca's original implementation, and the individual patches can be seen in that branch. This was done to aid bisecting in the future. Signed-off-by: Ian Romanick <[email protected]>
*	glsl: Don't print blank (function ...) headers for built-ins.	Kenneth Graunke	2010-09-16	1	-0/+3
\| \| \| \|	Fixes a regression caused when I added my GLSL ES support.
*	glsl: Change from has_builtin_signature to has_user_signature.	Kenneth Graunke	2010-09-16	3	-5/+5
\| \| \| \| \|	The print visitor needs this, and the only existing user can work with has_user_signature just as well.
*	glsl: Fix 'format not a string literal and no format arguments' warning.	Vinson Lee	2010-09-15	1	-1/+1
\| \| \| \| \| \|	Fix the following GCC warning. loop_controls.cpp: In function 'int calculate_iterations(ir_rvalue, ir_rvalue, ir_rvalue*, ir_expression_operation)': loop_controls.cpp:88: warning: format not a string literal and no format arguments
*	glsl2: add case for ir_unop_noise	Brian Paul	2010-09-14	1	-0/+4
\| \| \| \| \|	Silences a compiler warning. Still need to add some assertions for this case.
*	glsl2: fix comments	Brian Paul	2010-09-14	3	-3/+3
\|
*	glsl2: Port equal() and notEqual() to ir_unop_all_equal and ir_unop_any_nequal	Ian Romanick	2010-09-13	3	-240/+48
\|
*	glsl: introduce ir_binop_all_equal and ir_binop_any_equal, allow vector cmps	Luca Barbieri	2010-09-13	7	-22/+67
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently GLSL IR forbids any vector comparisons, and defines "ir_binop_equal" and "ir_binop_nequal" to compare all elements and give a single bool. This is highly unintuitive and prevents generation of optimal Mesa IR. Hence, first rename "ir_binop_equal" to "ir_binop_all_equal" and "ir_binop_nequal" to "ir_binop_any_nequal". Second, readd "ir_binop_equal" and "ir_binop_nequal" with the same semantics as less, lequal, etc. Third, allow all comparisons to acts on vectors. Signed-off-by: Ian Romanick <[email protected]>
*	loop_unroll: unroll loops with (lowered) breaks	Luca Barbieri	2010-09-13	1	-4/+89
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If the loop ends with an if with one break or in a single break unroll it. Loops that end with a continue will have that continue removed by the redundant jump optimizer. Likewise loops that end with an if-statement with a break at the end of both branches will have the break pulled out after the if-statement. Loops of the form for (...) { do_something1(); if (cond) { do_something2(); break; } else { do_something3(); } } will be unrolled as do_something1(); if (cond) { do_something2(); } else { do_something3(); do_something1(); if (cond) { do_something2(); } else { do_something3(); /* Repeat inserting iterations here.*/ } } ir_lower_jumps can guarantee that all loops are put in this form and thus all loops are now potentially unrollable if an upper bound on the number of iterations can be found. Signed-off-by: Ian Romanick <[email protected]>
*	glsl2: Add pass to remove redundant jumps	Ian Romanick	2010-09-13	6	-1/+118
\|
*	glsl: Explain file naming convention	Ian Romanick	2010-09-13	1	-0/+12
\|
*	loop_controls: fix analysis of already analyzed loops	Luca Barbieri	2010-09-13	1	-1/+8
\| \| \| \| \| \|	The loop_controls pass didn't look at the counter values it put in ir_loop on previous iterations, so while the first iteration worked, subsequent ones couldn't determine max_iterations.
*	glsl: add continue/break/return unification/elimination pass (v2)	Luca Barbieri	2010-09-13	6	-250/+548
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Changes in v2: - Base class renamed to ir_control_flow_visitor - Tried to comply with coding style This is a new pass that supersedes ir_if_return and "lowers" jumps to if/else structures. Currently it causes no regressions on softpipe and nv40, but I'm not sure whether the piglit glsl tests are thorough enough, so consider this experimental. It can be asked to: 1. Pull jumps out of ifs where possible 2. Remove all "continue"s, replacing them with an "execute flag" 3. Replace all "break" with a single conditional one at the end of the loop 4. Replace all "return"s with a single return at the end of the function, for the main function and/or other functions This gives several great benefits: 1. All functions can be inlined after this pass 2. nv40 and other pre-DX10 chips without "continue" can be supported 3. nv30 and other pre-DX10 chips with no control flow at all are better supported Note that for full effect we should also teach the unroller to unroll loops with a fixed maximum number of iterations but with the canonical conditional "break" that this pass will insert if asked to. Continues are lowered by adding a per-loop "execute flag", initialized to TRUE, that when cleared inhibits all execution until the end of the loop. Breaks are lowered to continues, plus setting a "break flag" that is checked at the end of the loop, and trigger the unique "break". Returns are lowered to breaks/continues, plus adding a "return flag" that causes loops to break again out of their enclosing loops until all the loops are exited: then the "execute flag" logic will ignore everything until the end of the function. Note that "continue" and "return" can also be implemented by adding a dummy loop and using break. However, this is bad for hardware with limited nesting depth, and prevents further optimization, and thus is not currently performed.
*	glsl: add ir_control_flow_visitor	Luca Barbieri	2010-09-13	1	-0/+17
\| \| \| \| \| \| \| \| \| \|	This is just a subclass of ir_visitor with empty implementations of all the visit methods for non-control flow nodes. Used to avoid duplicating that in ir_visitor subclasses. ir_hierarchical_visitor is another way to solve this, but is less natural for some applications.
*	glsl2: Fix scons build for all platforms	Jakob Bornecrantz	2010-09-10	1	-0/+1
\|
*	glsl2: Implement noise[1234] built-in functions using ir_unop_noise	Ian Romanick	2010-09-09	5	-104/+458
\|
*	glsl2: Add lowering pass to remove noise opcodes	Ian Romanick	2010-09-09	4	-1/+75
\|
*	glsl2: Add ir_unop_noise	Ian Romanick	2010-09-09	2	-0/+5
\|
*	glsl/builtins: normalize of a negative scalar should be -1.0.	Kenneth Graunke	2010-09-09	2	-2/+2
\|
*	glsl: add several EmitNo* options, and MaxUnrollIterations	Luca Barbieri	2010-09-08	6	-10/+12
\| \| \| \| \| \| \| \| \|	This increases the chance that GLSL programs will actually work. Note that continues and returns are not yet lowered, so linking will just fail if not supported. Signed-off-by: Ian Romanick <[email protected]>
*	glsl: Add info about talloc and optimization passes to the README.	Eric Anholt	2010-09-08	1	-0/+26
\|
*	glsl: Update README talking about multi-instruction operations.	Eric Anholt	2010-09-08	1	-4/+4
\| \| \| \| \| \|	The previous thing taking multiple instructions ended up being handled at the IR level, as we suggested would be the common result. Pick a new one.
*	glsl/builtins: Set the API in the fake context.	Kenneth Graunke	2010-09-08	2	-0/+2
\| \| \| \|	Otherwise it gets used uninitialized.
*	glsl2: Clear out profile pointers in _mesa_glsl_release_functions	Ian Romanick	2010-09-08	2	-0/+2
\| \| \| \| \| \|	Otherwise builtin_profiles contains dangling pointers the next time _mesa_read_profile is called. I suspect this may fix bugzilla #29847, but I was never able to reproduce it.
*	glsl: Fix for scalar float built-in definitions.	Kenneth Graunke	2010-09-08	3	-4/+4
\| \| \| \|	These need abs, and we need more tests.
*	glsl: regenerate builtins	Eric Anholt	2010-09-08	1	-1/+1
\|
*	glsl: Fix typo in builtin step() using a wrong channel.	Eric Anholt	2010-09-08	1	-1/+1
\|
*	ir_validate: Ensure ir_binop_dot is only used on vector types.	Kenneth Graunke	2010-09-08	1	-0/+1
\|